Journal of Applied Sciences ›› 2013, Vol. 31 ›› Issue (6): 628-632.doi: 10.3969/j.issn.0255-8297.2013.06.012

• Computer Science and Applications • Previous Articles     Next Articles

Ensemble Pruning Based on Frequent Patterns

ZHOU Hong-fang1, WANG Xiao1, ZHAO Xue-han1, RAO Yuan2   

  1. 1. School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China
    2. School of Soft Engineering, Xi’an Jiaotong University, Xi’an 710049, China
  • Received:2013-04-21 Revised:2013-06-06 Online:2013-11-29 Published:2013-06-06

Abstract: Most ensemble learning methods have high computational complexity, excessive base classifiers and unsatisfactory classification accuracy in case of large-scale data sets. This paper proposes an ensemble pruning algorithm based on frequent patterns. Using the theory of frequent patterns mining, the method
maps the un-pruned ensemble classifier and corresponding sample space to a transactional database, and stores the corresponding classification results in a boolean matrix. After extracting frequent base classifiers from the Boolean matrix and composing a pruning ensemble, the algorithm gives the final pruning ensemble.Experimental results show that this algorithm reduces the number of base classifiers, improves classification accuracy and increases classification efficiency compared with ensemble algorithms of Bagging, AdaBoost, WAVE and RFW.

Key words: large-scale data set, frequent pattern, ensemble pruning, transactional database, Boolean matrix

CLC Number: