网络与信息安全学报 ›› 2020, Vol. 6 ›› Issue (4): 1-13.doi: 10.11959/j.issn.2096-109x.2020010

• 综述 •    下一篇

基于深度学习的文本分类研究进展

杜思佳,于海宁(),张宏莉   

  1. 哈尔滨工业大学计算机科学与技术学院,黑龙江 哈尔滨 150001
  • 修回日期:2020-01-25 出版日期:2020-08-15 发布日期:2020-08-13
  • 作者简介:杜思佳(1995- ),女,浙江嘉兴人,哈尔滨工业大学硕士生,主要研究方向为网络舆情分析、网络安全|于海宁(1983- ),男,黑龙江鹤岗人,博士,哈尔滨工业大学助理研究员,主要研究方向为物联网安全搜索与隐私保护、云安全与隐私保护|张宏莉(1973- ),女,吉林榆树人,博士,哈尔滨工业大学教授、博士生导师,主要研究方向为网络与信息安全、网络测量与建模、网络计算、并行处理
  • 基金资助:
    国家自然科学基金(61601146);国家自然科学基金(61732022)

Survey of text classification methods based on deep learning

Sijia DU,Haining YU(),Hongli ZHANG   

  1. School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China
  • Revised:2020-01-25 Online:2020-08-15 Published:2020-08-13
  • Supported by:
    The National Natural Science Foundation of China(61601146);The National Natural Science Foundation of China(61732022)

摘要:

文本分类技术是自然语言处理领域的研究热点,其主要应用于舆情检测、新闻文本分类等领域。近年来,人工神经网络技术在自然语言处理的许多任务中有着很好的表现,将神经网络技术应用于文本分类取得了许多成果。在基于深度学习的文本分类领域,文本分类的数值化表示技术和基于深度学习的文本分类技术是两个重要的研究方向。对目前文本表示的有关词向量的重要技术和应用于文本分类的深度学习方法的实现原理和研究现状进行了系统的分析和总结,并针对当前的技术发展,分析了文本分类方法的不足和发展趋势。

关键词: 文本分类, 深度学习, 人工神经网络, 词向量

Abstract:

Text classification is a research hot spot in the field of natural language processing,which is mainly used in public opinion detection,news classification and other fields.In recent years,artificial neural networks has good performance in many tasks of natural language processing,the application of neural network technology to text classification has also made many achievements.In the field of text classification based on deep learning,numerical representation of text and deep-learning-based text classification are two main research directions.The important technology of word embedding in text representation and the implementation principle and research status of deep learning method applied in text classification were systematically analyzed and summarized.And the shortcomings and the development trend of text classification methods in view of the current technology development were analyzed.

Key words: text classification, deep learning, artificial neural network, word embedding

中图分类号: 

No Suggested Reading articles found!