北京航空航天大学学报 ›› 2015, Vol. 41 ›› Issue (4): 751-756.doi: 10.13700/j.bh.1001-5965.2014.0499

• 论文 • 上一篇    

基于多尺度梯度及深度神经网络的汉字识别

潘炜深, 金连文, 冯子勇   

  1. 华南理工大学 电子与信息学院, 广州 510641
  • 收稿日期:2014-04-28 修回日期:2014-11-27 出版日期:2015-04-20 发布日期:2015-05-08
  • 通讯作者: 金连文(1968—),男,贵州都匀人,教授,lianwen.jin@gmail.com,主要研究方向为手写汉字识别、机器学习、模式识别、云计算等. E-mail:lianwen.jin@gmail.com
  • 作者简介:潘炜深(1990—),男,广东惠州人,硕士生,panweishen2009@gmail.com
  • 基金资助:

    国家自然科学基金资助项目(61075021); 国家科技支撑计划资助项目(2013BAH65F01,2013BAH65F04); 广东省科技计划资助项目(2012A010701001)

Recognition of Chinese characters based on multi-scale gradient and deep neural network

PAN Weishen, JIN Lianwen, FENG Ziyong   

  1. Department of Electronic and Information Engineering, South China University of Technology, Guangzhou 510641, China
  • Received:2014-04-28 Revised:2014-11-27 Online:2015-04-20 Published:2015-05-08

摘要:

介绍了一种基于多尺度滑动窗的方法提取文字的梯度直方图特征,并结合深度神经网络对印刷体汉字进行识别.针对梯度直方图的空间关系,使用可伸缩的滑动窗对图像进行分割,在不同尺度上获取文字的特征信息,有效融合汉字的全局特征和局部分块特征.实验采用5层的深度神经网络模型对国标一级3755个印刷体汉字进行分类,并应用Dropout技术防止训练过拟合,提高神经网络的泛化能力.实验准确率达到98.292%,有较好的识别性能,验证了本文多尺度梯度特征及深度神经网络模型在文字识别上的有效性.

关键词: 多尺度滑动窗, 梯度直方图, 深度神经网络, 泛化能力, 汉字识别

Abstract:

The method to extract the gradient histogram feature of the Chinese characters with a multi-scale sliding window and to recognize the printed Chinese characters with deep neural network was presented. In order to acquire the spatial information of the gradient histogram, a retractable sliding window technique was proposed for segmenting the images and getting the gradient feature information from different scales which can effectively combine all the global features and local block features of Chinese characters. The experiment was carried out by using a 5-layer deep neural network to classify 3755 categories of printed Chinese characters.A Dropout technique was applied so as to prevent over-fitting training and to improve the generalization ability of the neural network. The accuracy of the experiment reaches 98.292%, which has better recognition performance and demonstrates that the method of applying a multi-scale gradient feature and deep neural network model on the recognition of Chinese characters is effective.

Key words: multi-scale sliding window, gradient histogram, deep neural network, generalization ability, recognition of Chinese characters

中图分类号: 


版权所有 © 《北京航空航天大学学报》编辑部
通讯地址:北京市海淀区学院路37号 北京航空航天大学学报编辑部 邮编:100191 E-mail:jbuaa@buaa.edu.cn
本系统由北京玛格泰克科技发展有限公司设计开发