电信科学 ›› 2015, Vol. 31 ›› Issue (7): 152-157.doi: 10.11959/j.issn.1000-0801.2015174

• 运营技术广角 • 上一篇    下一篇

主流大数据处理开源架构的分析及对比评测

方艾,徐雄,梁冰,张玉忠,杨翊平   

  1. 中国电信股份有限公司广州研究院 广州 510630
  • 出版日期:2015-08-21 发布日期:2015-08-21

Comparison of Open-Source Distributed Computing Framework for Big Data

Ai Fang,Xiong Xu,Bing Liang,Yuzhong Zhang,Yiping Yang   

  1. Guangzhou Research Institute of China Telecom Co.,Ltd.,Guangzhou 510630,China
  • Online:2015-08-21 Published:2015-08-21

摘要:

结合电信增值业务领域中对大数据处理的实际需求,对现有主流的分布式大数据处理架构(Hive、Impala、Spark)的核心进行分析与实测,比较它们在大数据处理过程中的优劣及适用的场景,从而为大数据分析所面临的架构适用性选型提供参考。

关键词: 大数据, Hive, MapReduce, Impala, Spark

Abstract:

A comparison of three open source distributed computing frameworks for big data (Hive,Impala and Spark)was conducted.Tests were run to evaluate the performance aiming at real business demands.The cost of implementation to meet business requirements was also discussed.

Key words: big data, Hive, MapReduce, Impala, Spark

No Suggested Reading articles found!