通信学报 ›› 2014, Vol. 35 ›› Issue (9): 184-189.doi: 10.3969/j.issn.1000-436x.2014.09.019

• 学术通信 • 上一篇    下一篇

大数据下的基于深度神经网的相似汉字识别

杨钊,陶大鹏,张树业,金连文   

  1. 华南理工大学 电子与信息学院,广东 广州 510641
  • 出版日期:2014-09-25 发布日期:2017-06-14
  • 基金资助:
    国家自然科学基金资助项目;国家科技支撑计划基金资助项目;广东省科技计划基金资助项目

Similar handwritten Chinese character recognition based on deep neural networks with big data

Zhao YANG,Da-peng TAO,Shu-ye ZHANG,Lian-wen JIN   

  1. School of Electronic and Information Engineering,South China University of Technology,Guangzhou 510641,China
  • Online:2014-09-25 Published:2017-06-14
  • Supported by:
    The National Natural Science Foundation of China;The National Science and Technology Support Plan;The Science and Technology Project of Guangdong Province

摘要:

针对传统相似手写汉字识别系统(SHCCR)受特征提取方法的限制,提出采用深度神经网(DNN)对相似汉字自动学习有效特征并进行识别,介绍相似字符集生成方法和针对相似汉字识别的深度神经网络的具体结构,研究对比不同的训练数据规模对识别性能的影响。实验表明,DNN 能有效地进行特征学习,避免了人工设计特征的不足,与传统基于梯度特征的支持向量机(SVM)和最近邻分类器(1-NN)方法相比,识别率有较大的提高;且随着训练样本增加的同时,DNN 在提高识别性能上表现得更为优秀,大数据训练对提升深度神经网络的识别率作用明显。

关键词: 大数据, 深度神经网, 深度学习, 相似手写汉字识别

Abstract:

The recognition rates of the traditional similar handwritten Chinese character recognition (SHCCR) systems are not very high due to the restriction of feature extraction methods.In order to improve the recognition accuracy,a new method based on deep neural networks (DNN) was proposed to learn effective features automatically and conduct recog-nition.The method of how to generate similar handwritten Chinese character sets was introduced.The architecture of the DNN for SHCCR was presented.The performances with respect to different training data scale was compared.The ex-perimental results show that,DNN can learn features automatically and efficiently.The proposed DNN can achieve better performance comparing with support vector machine (SVM) and nearest neighbor classifier (1-NN) based on gradient features.Especially,with the increase of training data the recognition rate of DNN is improved observably,indicating that large training data is crucial for the performance of DNN.

Key words: big data, deep neural networks, deep learning, similar handwritten Chinese characters recognition

No Suggested Reading articles found!