应用科学学报 ›› 2019, Vol. 37 ›› Issue (4): 529-540.doi: 10.3969/j.issn.0255-8297.2019.04.010

• 计算机科学与应用 • 上一篇    下一篇

基于主成分回归算法的城市客流聚集风险预测

王聚全1,2, 王伟3, 马慧民4, 杨博1, 杜渂1,2   

  1. 1. 迪爱斯信息技术股份有限公司, 上海 200032;
    2. 电信科学技术第一研究所有限公司, 上海 200032;
    3. 上海市公安局科技处, 上海 200042;
    4. 上海市北高新股份有限公司, 上海 200436
  • 收稿日期:2018-11-26 修回日期:2018-12-18 出版日期:2019-07-31 发布日期:2019-10-11
  • 通信作者: 杜渂,教授级高工,研究方向:大数据、数据挖掘、物联网、软件体系架构,E-mail:duwen@dscomm.com.cn E-mail:duwen@dscomm.com.cn
  • 基金资助:
    工业和信息化部2018年大数据产业发展试点示范项目基金;上海市人工智能创新发展专项基金(No.2018-RGZN-01013);上海市科技支撑计划项目基金(No.15dz1207400);上海市信息化发展专项资金(No.201502001);上海市科技创新行动计划项目基金(No.16511101000)资助

Urban Passenger Flow Aggregation Risk Forecasting Based on Principal Component Regression Algorithm

WANG Juquan1,2, WANG Wei3, MA Huimin4, YANG Bo1, DU Wen1,2   

  1. 1. DS Information Technology Co., Ltd., Shanghai 200032, China;
    2. The First Research Institute of Telecommunications Technology Co., Ltd., Shanghai 200032, China;
    3. Science and Technology Department of Shanghai Public Security Bureau, Shanghai 200042, China;
    4. Shanghai Shibei Hi-tech Co., Ltd., Shanghai 200436, China
  • Received:2018-11-26 Revised:2018-12-18 Online:2019-07-31 Published:2019-10-11

摘要: 为解决特大城市突发公共事件客流聚集预警准确度不高的问题,采用主成分回归算法对固定区域内运营商提供的手机用户数据和真实客流数据进行回归拟合,并根据多种统计检验方法对模型进行检验与评估.通过主成分分析,有效克服了运营商提供的手机用户数据的多重共线性问题,充分利用了手机用户数据的各个维度信息,并且降低了算法的复杂度,有效提高了城市客流聚集风险预测的精准度.

关键词: 主成分分析, 多重共线性, 统计检验, 客流聚集

Abstract: In order to solve the problem of the low accuracy of early warning of public events in mega-cities, this paper proposes a principal component regression algorithm to fit the mobile user data and real passenger flow data of fixed regions provided by operators, and uses a variety of statistical methods to test and evaluate the model. The principal component analysis can effectively overcome the multicollinearity problem of the mobile phone user data provided by the operators. By making full use of all-dimension information of the mobile phone user data, the complexity of the algorithm is reduced, and the accuracy of the prediction of the urban passenger flow aggregation risk is effectively improved.

Key words: principal component analysis, multicollinearity, statistical test, passenger flow aggregation

中图分类号: