电信科学 ›› 2017, Vol. 33 ›› Issue (12): 107-113.doi: 10.11959/j.issn.1000-0801.2017341

• 专栏:大数据技术与应用 • 上一篇    下一篇

基于异构关联的大数据价值密度提升方法

汪少敏,王铮   

  1. 中国电信股份有限公司上海研究院,上海 200122
  • 修回日期:2017-12-04 出版日期:2017-12-01 发布日期:2018-01-12
  • 作者简介:汪少敏(1983-),女,中国电信股份有限公司上海研究院工程师,主要研究方向为大数据架构、数据挖掘分析和人工智能技术。|王铮(1973-),男,中国电信股份有限公司上海研究院工程师,人工智能交互团队负责人,主要研究方向为大数据架构、数据挖掘分析和人工智能技术。

Method of improving big data value density based on heterogeneous association

Shaomin WANG,Zheng WANG   

  1. Shanghai Research Institute of China Telecom Co.,Ltd.,Shanghai 200122,China
  • Revised:2017-12-04 Online:2017-12-01 Published:2018-01-12

摘要:

电信大数据通常分散存储在 DPI、OIDD、CRM 等多个系统中,且格式、表述和规则在各系统中互不相同;因而,同一对象在不同系统中的多类数据很难被有效识别及完整利用,大数据分析的样本规模和特征维度严重受限,导致分析结果可信度和准确率下降。提出了电信大数据的异构关联方法与实现架构,并进行了方法的流程举例和验证,从用户维度实现了多系统间的数据融合,优化了诸如用户画像等应用的数据样本空间,从而大幅提升电信大数据价值密度。

关键词: 大数据, 电信大数据, 多源异构, 异构关联

Abstract:

The big data resources possessed by telecom operators are usually distributed in many different systems,such as DPI、OIDD、CRM.Moreover,the formulation,interpretation and rules of the big data are not always the same in different systems.Therefore,it is difficult to identify and utilize the same object’s multi-type data in different sys-tems.Big data analysis’ sample size and dimension are limited,with the decreasing of analysis results’ reality and accuracy.The methods,architectures and implementation examples of big data’s heterogeneous association were pre-sented.The data fusion in user-dimension from different systems could optimize the data sample space of applications,such as user portrait.Thus,the value of carrier’s big data density was greatly improved.

Key words: big data, telecom service big data, multi-source and heterogeneous, heterogeneous association

中图分类号: 

No Suggested Reading articles found!