Telecommunications Science ›› 2015, Vol. 31 ›› Issue (4): 77-85.doi: 10.11959/j.issn.1000-0801.2015100

• research and development • Previous Articles     Next Articles

Fast Single Pbase Algoritbm for Utility Mining in Big Data

Junqiang Liu1,Qingfeng Zhou1,Wenhui Wang2,Lei Shi1   

  1. 1 Zhejiang Gongshang University,Hangzhou 310018,China
    2 Zhejiang University of Water Resources and Electric Power,Hangzhou 310018,China
  • Online:2015-04-15 Published:2015-04-15
  • Supported by:
    The National Natural Science Foundation of China;The Zhejiang Provincial Natural Science Foundation of China

Abstract:

Most of the latest works on utility mining generates a huge number of candidates in dealing with big data,which suffers from the scalability issue.Some work does not generate candidates,but suffers from the efficiency issue due to lack of strong pruning and high computation overhead.A novel algorithm that finds high utility patterns in a single phase without generating candidates was proposed.The novelties lie in a prefix growth strategy with strong pruning,and a sparse matrix based representation of transactions with pseudo projection.The proposed algorithm works in a depth first manner and does not materialize high utility patterns in memory,which further improves the scalability.Extensive experiments on synthetic and rea1-world data show that the proposed algorithm outperforms the latest works in terms of running time,memory overhead,and scalability.

Key words: big data, utility mining, high utility pattern, frequent pattern

No Suggested Reading articles found!