Big Data Research ›› 2021, Vol. 7 ›› Issue (6): 19-29.doi: 10.11959/j.issn.2096-0271.2021057

• TOPIC: BIG DATA-ENABLED INTELLIGENT APPLICATIONS • Previous Articles     Next Articles

Legal element extraction method based on BERT reading comprehension framework

Hui HUANG1, Yongbin QIN1,2, Yanping CHEN1,2, Ruizhang HUANG1,2   

  1. 1 School of Computer Science and Technology, Guizhou University, Guiyang 550025, China
    2 State Key Laboratory of Public Big Data, Guiyang 550025, China
  • Online:2021-11-15 Published:2021-11-01
  • Supported by:
    The National Natural Science Foundation of China(62066008);The Key Projects of Science and Technology of Guizhou Province([2020]1Z055)

Abstract:

Extraction of legal elements is an important basis for judicial intelligent auxiliary applications, and its purpose is to identify the key elements involved in the judgment document.In the past, extracting legal elements usually used multi-label classification methods for modeling.These methods mainly relied on the text features of the judgment document, thereby ignoring the label features.Besides, due to the imbalanced data problem in judicial data sets, the classification method will lead to poor model performance because of too many negative examples.To solve the above problems, a legal element extraction method based on BERT reading comprehension framework was proposed.This method constructed auxiliary questions with label information and legal prior knowledge, and used the machine reading comprehension model based on BERT to establish the semantic associations between question and judgment document.And this method added special tokens before and after the label in the question to enhance the learning ability of the model.Experiments were conducted on the legal element extraction data sets of the CAIL2019.Experiment results show that the performance is improved significantly, and the F1 value has been increased by 2.7%, 11.3%, and 5.6% respectively on the data sets of marriage and family case, labor dispute case, and loan contract dispute case.

Key words: element extraction, machine reading comprehension, neural network, BERT

CLC Number: 

No Suggested Reading articles found!