Telecommunications Science ›› 2016, Vol. 32 ›› Issue (7): 115-120.doi: 10.11959/j.issn.1000-0801.2016203

• Viewpoint aggregation • Previous Articles     Next Articles

Hadoop bottleneck detection algorithm based on information gain

Zaole TAN1,Zhifeng HAO1,Ruichu CAI1,Xiaojun XIAO2,Yu LU2   

  1. 1 School of Computers,Guangdong University of Technology,Guangzhou 510006,China
    2 Guangzhou Useease Information Technology Co.,Ltd.,Guangzhou 510630,China
  • Online:2016-07-20 Published:2017-04-26

Abstract:

Hadoop has become a major platform for big data storage and large data mining nowadays.Although Hadoop platform achieves high performance parallel computing through a distributed cluster of machines,the bottlenecks will inevitably appear on a machine when cluster load increases,because the cluster is composed of inexpensive host.Aiming at this problem,a bottleneck detection algorithms based on information gain was proposed.The algorithm detected cluster's bottlenecks resource by computing the information gain of each resource.The experiments show that the bottleneck detection algorithm is feasible.

Key words: big data, Hadoop, information gain, bottleneck detection

No Suggested Reading articles found!