Chinese Journal of Network and Information Security ›› 2020, Vol. 6 ›› Issue (5): 126-138.doi: 10.11959/j.issn.2096-109x.2020009

• Papers • Previous Articles     Next Articles

Cyber security entity recognition method based on residual dilation convolution neural network

Bo XIE1,2,Guowei SHEN1,2(),Chun GUO1,2,Yan ZHOU1,2,Miao YU3   

  1. 1 College of Computer Science and Technology,Guizhou University,Guiyang 550025,China
    2 Guizhou Provincial Key Laboratory of Public Big Data,Guiyang 550025,China
    3 Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093,China
  • Revised:2020-01-07 Online:2020-10-15 Published:2020-10-19
  • Supported by:
    The National Natural Science Foundation of China(61802081);The Natural Science Foundation of Guizhou Province,China(20161052);The Natural Science Foundation of Guizhou Province,China(20167428);The Natural Science Foundation of Guizhou Province,China(20171051);The Major Scientific and Technological Special Project of Guizhou Province,China(20183001)

Abstract:

In recent years,cybersecurity threats have increased,and data-driven security intelligence analysis has become a hot research topic in the field of cybersecurity.In particular,the artificial intelligence technology represented by the knowledge graph can provide support for complex cyberattack detection and unknown cyberattack detection in multi-source heterogeneous threat intelligence data.Cybersecurity entity recognition is the basis for the construction of threat intelligence knowledge graphs.The composition of security entities in open network text data is very complex,which makes traditional deep learning methods difficult to identify accurately.Based on the pre-training language model of BERT (pre-training of deep bidirectional transformers),a cybersecurity entity recognition model BERT-RDCNN-CRF based on residual dilation convolutional neural network and conditional random field was proposed.The BERT model was used to train the character-level feature vector representation.Combining the residual convolution and the dilation neural network model to effectively extract the important features of the security entity,and finally obtain the BIO annotation of each character through CRF.Experiments on the large-scale cybersecurity entity annotation dataset constructed show that the proposed method achieves better results than the LSTM-CRF model,the BiLSTM-CRF model and the traditional entity recognition model.

Key words: cybersecurity,entity recognition, residual connection, dilation convolution neural network, BERT pre-train model

CLC Number: 

No Suggested Reading articles found!