Big Data Research ›› 2019, Vol. 5 ›› Issue (6): 3-18.doi: 10.11959/j.issn.2096-0271.2019046

• TOPIC:BIG DATA WRANGLING • Previous Articles     Next Articles

Progress on human-in-the-loop data preparation

Ju FAN1,2,Yueguo CHEN1,2,Xiaoyong DU1,2   

  1. 1 DEKE Lab &Information School,Renmnin University of China,Beijing 100872,China
    2 School of Information,Renmnin University of China,Beijing 100872,China
  • Online:2019-11-15 Published:2020-01-10
  • Supported by:
    The National Natural Science Foundation of China(61602488);The National Natural Science Foundation of China(61632016);The National Natural Science Foundation of China(U1711261)

Abstract:

With the rapid development of data analytics,data preparation has become a major bottleneck.The two essential challenges for data preparation on cost and time were analyzed.To address the challenges,the research progress on human-in-theloop data preparation was reviewed.Firstly,interactive data preparation was reviewed,which aimed to reduce the time for data preparation by predictively interacting with the end users.Then,crowdsourced data preparation was introduced,which utilize human’s computational power from the crowd to support foundamental data preparation tasks,and developed algorithms for controlling result quality and reducing crowdsourcing cost.Finally,future research directions were summarized and discussed.

Key words: data governance, data preparation, crowdsourcing, interactive mechanism

CLC Number: 

No Suggested Reading articles found!