Telecommunications Science ›› 2012, Vol. 28 ›› Issue (1): 91-95.doi: 10.3969/j.issn.1000-0801.2012.01.017

• research and development • Previous Articles     Next Articles

Study of Over-Sampling Method Based on Feature Selection

Huijuan Lu1,2,Jinwei Zhang2,Jinwei Zhang2,Jinwei Zhang2,Xiaoping Ma1,Xiaobing Yang2   

  1. 1 School of Information and Electrical Engineering,China University of Mining&Technology,Xuzhou 221008,China
    2 College of Information Engineering,China Jiliang University,Hangzhou 310018,China
  • Published:2017-02-07

Abstract:

To significantly improve the classification performance of the minority class,we present an over-sampling method based on feature selection.Firstly,feature selection is performed on the training data set in order to select a set of key columns.Then minority class samples are produced using selected key columns,and each sample consists of two kinds of features.One type of features is characteristic value that is corresponding to the selected key columns,the others is generated according to the principle of SMOTE.Comparing to SMOTE algorithm,results show that the new method performs better than SMOTE,and it can effectively reduce the imbalance of data and improve the classification accuracy of the minority class.

Key words: imbalanced data set, feature selection, over sampling, genetic algorithm

No Suggested Reading articles found!