电信科学 ›› 2013, Vol. 29 ›› Issue (3): 90-100.doi: 10.3969/j.issn.1000-0801.2013.03.017

• 研究与开发 • 上一篇    下一篇

云计算环境下一种面向科学工作流不确定数据源的视图构造方法

胡海洋,刘占晨,胡华   

  1. 杭州电子科技大学计算机学院 杭州 310018
  • 出版日期:2013-03-20 发布日期:2017-06-16
  • 基金资助:
    国家自然科学基金资助项目;国家自然科学基金资助项目;浙江省自然科学基金资助项目;浙江省自然科学基金资助项目

Constructing View of Uncertain Data Provenance for Scientific Workflow in Cloud Computing

Haiyang Hu,Zhanchen Liu,Hua Hu   

  1. Schoo1 of Computer Science and Techno1ogy, Hangzhou Dianzi University, Hangzhou 310018, China)
  • Online:2013-03-20 Published:2017-06-16

摘要:

科学工作流的数据源视图根据数据源中任务间的数据流关系,将它们划分为多个复合模块,并在此基础上进行数据抽象与封装,从而可有效降低科研工作者的数据分析工作量并节省数据查询时间。 然而在云计算环境中开发与应用科学工作流系统时,由于受数据采集的准确度和服务器的可靠性影响,将会导致工作流数据源图的不确定性,因此需要提供有效的机制在不确定数据源图中构建合理性视图。针对此方面,首先给出了不确定数据源图及其合理性视图的定义,在此基础上提出了一种检测不合理视图的方法;还进一步分析了数据源图中任务节点与其一阶前序节点之间存在的多种数据流关系及复合任务的局部期望支持度,给出了合理视图的构造方法。设计了相应的多项式时间算法,并分析算法的时间复杂度。最后,对相关方法给出示例,并进行实验分析,验证了其可行性与有效性。

关键词: 云计算, 科学工作流, 数据源, 视图

Abstract:

The view of data provenance in scientific workf1ow provides an approach of data abstraction and encapsu1ation by partitioning tasks in the data provenance graph(DPG)into a set of composite modu1es due to the data f1ow re1ations among them, so as to efficient1y decrease the work1oad consumed by researchers making ana1ysis on the data provenance and the time needed in doing data querying.Neverthe1ess, deve1oping and app1ying the scientific workf1ow systems in c1oud computing environments suffers the prob1em of uncertainty brought by the inaccuracy of data co11ection and unre1iabi1ity of data servers distributed in the internet.Concentrating on this scenario, the definitions of uncertain DPG and its sound view were presented first1y, and then a method for detecting the unsound view of DPG was proposed.A1so, a method for constructing sound and high-support view was presented, which is based on the data f1ow re1ations among the tasks and their first-order preceding tasks in the graph, and the 1oca1 expected support of the composite modu1es.A po1ynomia1-time a1gorithm was designed, and its maxima1 time comp1exity was a1so ana1yzed.Additiona11y, an examp1e and conduct comprehensive experiments were given to show the feasibi1ity and effectiveness of the method.

Key words: c1oud computing, scientific workf1ow, data provenance, view

No Suggested Reading articles found!