大数据

• •    

数字人文视域中的古籍文本标注方法研究——以MARKUS为例

于亚秀1 李欣2   

  1. 1.华东师范大学图书馆 2.华东师范大学数据科学与工程学院, 上海 200062
  • 作者简介:于亚秀(1985-),女,硕士,华东师范大学副研究馆员,主要研究方向为数字人文、知识组织与管理、智慧图书馆建设。yxyu@library.ecnu.edu.cn。 李欣(1961-),女,华东师范大学研究馆员,主要研究方向为语义网知识组织与管理、数字人文、推荐系统。xli@dase.ecnu.edu.cn(通讯作者)

Research on Text Annotation Method of Ancient Works from the Perspective of Digital Humanism——A Case Study on MARKUS#br#

Yaxiu YU   xin Li   

  1. 1. East China Normal University Library, Shanghai 200062,China
    2. School of Data Science and Engineering, East China Normal University, Shanghai 200062,China

摘要: 文本标注是文本分析挖掘中的重要一步,面对大规模古籍资源,人工标注无法满足人文研究需求,且古籍特殊的语法结构和语言特点,现代文标注技术很难直接应用于古籍研究。本文在分析人文研究者进行古籍文本标注中所面临的难点和痛点基础上,提出普适性的古籍标注标准流程,给出基于MARKUS的文本标注模型,并通过具体实践,探索基于该模型的古籍文本标注方法,将助推借助数字人文工具改变古籍人文研究方式和研究规模的应用深度。

关键词: 数字人文, 古籍, 文本标注, MARKUS

Abstract: Text annotation is an important step in text analysis and mining, faced with large-scale text resources, simple manual labeling can no longer meet the needs of humanistic research, and due to the special grammatical structure and language characteristics of ancient works, the text annotation technology on modern corpora cannot be directly applied to the ancient works. In this paper, based on the analysis of the challenges faced by humanities researchers, we proposes a universal standard text annotation process of ancient works, and gives a model based on Markus,and through the analysis of specific practical cases, explores this model boosts the application depth of using digital humanistic tools change the way and scale of humanistic research.

No Suggested Reading articles found!