Chinese Journal of Intelligent Science and Technology ›› 2020, Vol. 2 ›› Issue (2): 144-152.doi: 10.11959/j.issn.2096-6652.202016

• Regular Papers • Previous Articles     Next Articles

Fusion of discourse structural position encoding for neural machine translation

Xiaomian KANG1,2,Chengqing ZONG1,2()   

  1. 1 National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China
    2 University of Chinese Academy of Sciences,Beijing 100049,China
  • Revised:2020-04-01 Online:2020-06-20 Published:2020-07-14
  • Supported by:
    The National Natural Science Foundation of China(U1836221)

Abstract:

Most of existing document-level neural machine translation (DocNMT) methods focus on exploring the utilization of the lexical information of context,which ignore the structural relationships among the cross-sentence discourse semantic units.Therefore,multiple discourse structural position encoding strategies were proposed to represent the positional relationships among the words in discourse units over the discourse tree based on rhetorical structure theory (RST).Experimental results show that the source-side discourse structural position information is effectively fused into the DocNMT models underlying the Transformer architecture by the position encoding,and the translation quality is improved significantly.

Key words: neural machine translation, discourse structure, position encoding, discourse analysis, rhetorical structure theory

CLC Number: 

No Suggested Reading articles found!