Big Data Research ›› 2022, Vol. 8 ›› Issue (2): 134-144.doi: 10.11959/j.issn.2096-0271.2022019

• STUDA • Previous Articles     Next Articles

A semi-supervised learning financial news classification algorithm

Xiaolong ZHANG1,2, Long ZHI1,2, Jian GAO3, Zhongchen MIAO3, Yuefeng LIN3, Yali XIANG1,2, Yun XIONG1,2   

  1. 1 School of Computer Science and Technology, Fudan University, Shanghai 200438, China
    2 Shanghai Key Laboratory of Data Science, Shanghai 200438, China
    3 Shanghai Financial Futures Information Technology Co., Ltd., Shanghai 200120, China
  • Online:2022-03-15 Published:2022-03-01
  • Supported by:
    The National Natural Science Foundation of China(U1636207);The National Natural Science Foundation of China(U1936213)

Abstract:

Classifying financial texts is a common task for identifying financial risks.Traditional financial news classification requires a large number of labeled texts to train the classifier.However, labeling financial news requires not only professional financial background knowledge, but also time-consuming and labor-intensive.In order to reduce the dependence on labeled text, a semi-supervised learning financial text classification algorithm- SSF (semi-supervised learning financial news classification algorithm) was proposed, which uses a consistent training method of supervised learning and unsupervised learning to improve the use of unlabeled data.And unsupervised data augmentation for financial texts was introduced, that is, use specific target data augmentation methods for specific tasks to generate more effective data.Experiments on multiple financial news data sets were conducted to verify that the proposed SSF algorithm has a significant improvement in effectiveness compared with other text classification algorithms.

Key words: natural language processing, text classification, semi-supervised learning, finance

CLC Number: 

No Suggested Reading articles found!