The machine learning algorithm is used to predict the customer loss problem in business activities. Inspired by the idea of Bagging ensemble method, we proposed a Stacking ensemble learning based on bootstrap sampling. By multiple bootstrap sampling of the data set and adding attribute disturbance, multiple copies of the base classifier are trained with the data subset, and the decision result of the base classifier is determined by the vote of the corresponding copy of the base classifier. Experimental results show that the method we proposed in this paper has better performance than all base classifiers and the classical Stacking ensemble method of the same structure in terms of accuracy, precision rate and F1-score.
ZHENG Hong, YE Cheng, JIN Yonghong, CHENG Yunhui
. Customer Churn Prediction Method Based on Stacking Ensemble Learning[J]. Journal of Applied Sciences, 2020
, 38(6)
: 944
-954
.
DOI: 10.3969/j.issn.0255-8297.2020.06.011
[1] Mohammed H, Ali T, Tariq E, et al. Customer churn in mobile markets:a comparison of techniques[J]. International Business Research, 2015, 8(6):224-237.
[2] Soltani Z, Navimipour N J. Customer relationship management mechanisms:a systematic review of the state of the art literature and recommendations for future research[J]. Computers in Human Behavior, 2016, 61:667-688.
[3] Gillies C, Rigby D, Reichheld F. The story behind successful customer relations management[J]. European Business Journal, 2002, 14(2):73-77.
[4] Tiwari A, Sam R, Shaikh S. Analysis and prediction of churn customers for telecommunication industry[C]//2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud). IEEE, 2017:218-222.
[5] 于瑞云, 薛林, 安轩邈, 等. 基于改进GA-BP的移动通信用户流失预测算法[J]. 东北大学学报(自然科学版), 2019, 40(2):180-185. Yu R Y, Xue L, An X M, et al. Mobile communications customer churn prediction algorithm based on improved GA-BP network[J]. Journal of Northeastern University (Natural Science), 2019, 40(2):180-185. (in Chinese)
[6] Dalvi P K, Khandge S K, Deomore A, et al. Analysis of customer churn prediction in telecom industry using decision trees and logistic regression[C]//2016 Symposium on Colossal Data Analysis and Networking (CDAN). IEEE, 2016:1-4.
[7] Qiu Y F, Li C. Research on E-commerce user churn prediction based on logistic regression[C]//2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). IEEE, 2017:87-91.
[8] 朱姗姗. 数据挖掘在电信行业客户细分的应用研究[D]. 辽宁:辽宁大学, 2012.
[9] Farquad M A H, Ravi V, Raju S B. Churn prediction using comprehensible support vector machine:an analytical CRM application[J]. Applied Soft Computing, 2014, 19:31-40.
[10] Kisioglu P, Topcu Y I. Applying Bayesian belief network approach to customer churn analysis:a case study on the telecom industry of Turkey[J]. Expert Systems with Applications, 2011, 38(6):7151-7157.
[11] Artit W, Cyrille B, Rujikorn P. Churn analysis using deep convolutional neural networks and autoencoders[DB/OL].[2016]. https://arxiv.org/pdf/1703.02596.pdf
[12] Martins H. Predicting user churn on streaming services using recurrent neural networks[D]. KTH Royal Institute of Technology, 2017.
[13] Liu X, Dai Y, Zhang Y, et al. A preprocessing method of AdaBoost for mislabeled data classification[C]//2017 29th Chinese Control and Decision Conference (CCDC). IEEE, 2017:2738-2742.
[14] 章品正, 王健弘. 一种应用机器学习的车牌定位方法[J]. 应用科学学报, 2011, 29(2):147-152. Zhang P Z, Wang J H. Vehicle license plate location based on machine learning[J]. Journal of Applied Sciences, 2011, 29(2):147-152. (in Chinese)
[15] Olson M A, Wyner A J. Making sense of random forest probabilities:a kernel perspective[DB/OL].[2018]. https://arxiv.org/abs/1812.05792?context=stat.ML
[16] Divina F, Gilson A, GomÉZ-Vela F, et al. Stacking ensemble learning for short-term electricity consumption forecasting[J]. Energies, 2018, 11(4):949-980.
[17] Zheng H, Li H, Lu X, et al. A multiple kernel learning approach for air quality prediction[J]. Advances in Meteorology, 2018, 2018:1-15.