通信学报 ›› 2019, Vol. 40 ›› Issue (12): 41-50.doi: 10.11959/j.issn.1000-436x.2019195

• 学术论文 • 上一篇    下一篇

基于预训练机制的自修正复杂语义分析方法

李青1,钟将1(),李立力2,李琪3   

  1. 1 重庆大学计算机学院,重庆 400044
    2 重庆大学土木工程学院,重庆 400044
    3 绍兴文理学院计算机科学与工程系,浙江 绍兴 312000
  • 修回日期:2019-10-30 出版日期:2019-12-25 发布日期:2020-01-16
  • 作者简介:李青(1989- ),女,陕西西安人,重庆大学博士生,主要研究方向为自然语言处理、复杂事件检测、医学信息学|钟将(1974- ),男,重庆人,博士,重庆大学教授,主要研究方向为自然语言处理、数据挖掘|李立力(1989- ),男,陕西铜川人,博士,重庆大学博士生,主要研究方向为桥梁健康监测、数据挖掘|李琪(1987- ),男,江苏盱眙人,博士,绍兴文理学院讲师,主要研究方向为图计算、数据挖掘
  • 基金资助:
    中央高校研究生科研创新基金资助项目(2018CDYJSY0055);重庆市研究生科研创新基金资助项目(2017YFB1402400);重庆市研究生科研创新基金资助项目(CYB18058);重庆市技术创新与应用示范基金资助项目(cstc2018jszx-cyzdX0086)

Self-correcting complex semantic analysis method based on pre-training mechanism

Qing LI1,Jiang ZHONG1(),Lili LI2,Qi LI3   

  1. 1 College of Computer Science,Chongqing University,Chongqing 400044,China
    2 School of Civil Engineering,Chongqing University,Chongqing 400044,China
    3 Department of Computer Science and Engineering,Shaoxing University,Shaoxing 312000,China
  • Revised:2019-10-30 Online:2019-12-25 Published:2020-01-16
  • Supported by:
    Fundamental Research Funds for the Central Universities(2018CDYJSY0055);The National Key Research and Development Program of China(2017YFB1402400);The National Key Research and Development Program of China(CYB18058);Chongqing Technological Innovation and Application Demonstration Project(cstc2018jszx-cyzdX0086)

摘要:

面向知识服务过程中内容资源的智能化、知识化、精细化和重组化的碎片性管理需求。深层分析并挖掘语义隐层知识、技术、经验与信息,突破已有传统文本到结构化查询语言(SQL)的语义分析技术瓶颈,提出基于预训练机制的自修正复杂语义分析方法PT-Sem2SQL。设计结合Kullback-Leibler差异技术的MT-DNN预训练机制,以加强上下文语义理解深度;设计专有增强模块,捕获句内上下文语义信息的位置;并通过自修正方法优化生成模型的执行过程,以解决解码过程中的错误输出。实验结果表明,PT-Sem2SQL 能够有效提高复杂语义的解析性能,准确度优于相关工作。

关键词: 文本到SQL, 语义分析, 自然语言处理, 复杂事件处理

Abstract:

In the process of knowledge service,in order to meet the fragmentation management needs of intellectualization,knowledge ability,refinement and reorganization content resources.Through deep analysis and mining of semantic hidden knowledge,technology,experience,and information,it broke through the existing bottleneck of traditional semantic parsing technology from Text-to-SQL.The PT-Sem2SQL based on the pre-training mechanism was proposed.The MT-DNN pre-training model mechanism combining Kullback-Leibler technology was designed to enhance the depth of context semantic understanding.A proprietary enhancement module was designed that captured the location of contextual semantic information within the sentence.Optimize the execution process of the generated model by the self-correcting method to solve the error output during decoding.The experimental results show that PT-Sem2SQL can effectively improve the parsing performance of complex semantics,and its accuracy is better than related work.

Key words: Text-to-SQL, semantic parsing, natural language processing, complex event processing

中图分类号: 

No Suggested Reading articles found!