Journal on Communications ›› 2020, Vol. 41 ›› Issue (2): 165-175.doi: 10.11959/j.issn.1000-436x.2020033

• Papers • Previous Articles     Next Articles

Research on coreference resolution technology of entity in information security

Han ZHANG1,2,Yongjin HU1,Yuanbo GUO1,Jicheng CHEN3   

  1. 1 Department of Cryptogram Engineering,Information Engineering University,Zhengzhou 450001,China
    2 Software College,Zhengzhou University,Zhengzhou 450000,China
    3 Institute of information technology,Information Engineering University,Zhengzhou 450001,China
  • Revised:2019-12-27 Online:2020-02-25 Published:2020-03-09
  • Supported by:
    The National Natural Science Foundation of China(61501515);The Project of Henan Provincial Key Scientific and Technology(172102210002);The Young Scholar teachers project of Zhengzhou University(2017ZDGGJS048)

Abstract:

To solve the problem of coreference resolution in information security,a hybrid method was proposed.Based on the BiLSTM-attention-CRF model,the domain-dictionary matching mechanism was introduced and combined with the attention mechanism at the document level.As a new dictionary-based attention mechanism,the word features were calculated to solve the problem of weak recognition ability of rare entities and entities with long length when extracting candidates from text.And by summarizing the features of the domain texts,the candidates were coreferenced by rules and machine learning according to the part of speech to improve the accuracy.Through the experiments on security data set,the superiority of the method is proved from the aspects of coreference resolution and extraction of candidates from text .

Key words: coreference resolution, hybrid method, domain-dictionary matching mechanism, BiLSTM-attention-CRF, information security

CLC Number: 

No Suggested Reading articles found!