Big Data Research ›› 2020, Vol. 6 ›› Issue (3): 87-100.doi: 10.11959/j.issn.2096-0271.2020026

• TOPIC:DATAFLOW COMPUTING TECHNIQUES FOR BIG DATA PROCESSING • Previous Articles     Next Articles

State-of-art research of cluster resource management in dataflow computing model

Xiaochun TANG,Ying FU,Zhao DING,Anqi MAO,Zhanhuai LI   

  1. School of Computer Science,Northwestern Polytechnical University,Xi’an 710129,China
  • Online:2020-05-15 Published:2020-05-15
  • Supported by:
    The National Key Research and Development Program of China(2018YFB1003400)

Abstract:

The development of cluster-based high-performance computing has undergone three stages of evolution.With the widespread use of dataflow programming models such as Spark and Flink in the field of big data computing,how to ensure the fair share with the cluster resources by various dataflow computing applications is extremely important.It is also a main means to reduce the cost of infrastructures.As the drawbacks of traditional cluster resource management have becoming increasingly apparent in dataflow computing model,many alternative cluster resource management,including HoD,centralized scheduling,two-level scheduling,distributed scheduling,and hybrid scheduling management,have been proposed in recent years.Their respective advantages and disadvantages were introduced,and a certain reference for the uses or researches in development of cluster resource management and scheduling in a dataflow computing environment was provided.

Key words: dataflow model, cluster resource, schedule framework, big data

CLC Number: 

No Suggested Reading articles found!