应用科学学报 ›› 2015, Vol. 33 ›› Issue (2): 203-214.doi: 10.3969/j.issn.0255-8297.2015.02.010

• 计算机科学与应用 • 上一篇    下一篇

结合类别信念的AdaBoost 算法

严超, 吴悦, 岳晓冬   

  1. 上海大学计算机工程与科学学院,上海200444
  • 出版日期:2015-03-30 发布日期:2015-03-30
  • 作者简介:WU Yue, Ph.D., professor, research interests including data mining, intelligent information processing, E-mail: ywu@shu.edu.cn
  • 基金资助:

    Project Supported by the National Science Foundation of China (No. 61103067)

AdaBoost Algorithm with Classification Belief

YAN Chao, WU Yue, YUE Xiao-dong   

  1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China
  • Online:2015-03-30 Published:2015-03-30

摘要: 集成学习是一种受到广泛认可和使用的机器学习算法. 为此提出一种新的多类集成
学习算法,即AdaBoost belief. 此算法改进多类集成学习算法AdaBoost·SAMME,使每个基
分类器对于每个类别都有权重信息. 这种类别上的权重被称为类别信念,可通过计算每次迭代
中各个类别的正确率得到. 将所提出的算法与原有的AdaBoost·SAMME算法从预测准确率、
泛化能力以及理论支持等方面进行比较发现:在高斯数据集、多种UCI数据集以及基于日志的
多类别入侵检测应用中,该算法不但具有更高的预测准确率和泛化能力,而且当类别数目增
加,即类别更难以预测时,其分类错误率较原有AdaBoost·SAMME算法上升得更缓慢.

关键词: 集成学习, 多类别, 类别信念, 类别权重, AdaBoost·SAMME

Abstract: Ensemble learning is widely accepted and used in machine learning. This paper
proposes a multi-class ensemble learning algorithm named AdaBoost belief. The algorithm
improves AdaBoost·SAMME by attaching weights to classes in every weak classifier. These
weights, called class beliefs, are computed based on class accuracy collected in each round
of the iteration. We compare the algorithm with AdaBoost·SAMME in many aspects including
learning accuracy, generalization ability, and theory support. Experimental results
indicate that the proposed method has a competitive learning ability and high prediction
accuracy in Gaussian sets, several UCI sets, anda number of log-based intrusion detection
applications. When the class number increases so that prediction of classes becomes
more difficult, the prediction error rate of the proposed algorithm increases slower than
AdaBoost·SAMME.

Key words: ensemble learning, multi-class, class belief, class weight, AdaBoost·SAMME

中图分类号: