山东科学 ›› 2021, Vol. 34 ›› Issue (2): 54-64.doi: 10.3976/j.issn.1002-4026.2021.02.008

• 交通运输 • 上一篇    下一篇

基于因果分析和相似日选择的共享单车需求量预测组合模型

徐长兴1,汪伟平1*,昌锡铭1,包旭2,吴建军1   

  1. 1.北京交通大学 轨道交通控制与安全国家重点实验室,北京 1000442.淮阴工学院 交通工程学院,江苏 淮安 223000
  • 收稿日期:2020-07-23 出版日期:2021-04-13 发布日期:2021-04-14
  • 通信作者: 汪伟平,男,副教授,研究方向为复杂交通系统建模与管理。E-mail: wpwang@bjtu.edu.cn
  • 作者简介:徐长兴(1990—),男,硕士研究生,研究方向为交通大数据。E-mail:18120755@bjtu.edu.cn
  • 基金资助:
    国家自然科学基金重大项目(7189097271890970

A combined model for forecasting shared bikes demand based on causal analysis and similar day selection

XU Chang-xing1, WANG Wei-ping 1*, CHANG Xi-ming1, BAO Xu2 , WU Jian-jun 1   

  1. 1.State Key Laboratory of Rail Traffic Control and Safety, Beijing Jiaotong University, Beijing 100044,China;2. School of Traffic Engineering, Huaiyin Institute of Technology, Huaian 223000, China
  • Received:2020-07-23 Online:2021-04-13 Published:2021-04-14

摘要: 共享单车的需求量预测是优化车辆系统布局、实现车辆合理调度的基础。为了提高共享单车需求量预测模型的精度,建立了基于格兰杰因果分析和相似日选择的组合预测模型,研究了时间和天气因素对共享单车出行需求的影响。应用格兰杰因果检验方法,筛选出影响共享单车需求量变化的关键天气指标。然后,基于天气特征向量的灰色关联度指标,提取待预测日各时段的相似日样本集。综合随机森林回归、支持向量回归等机器学习算法,建立了Stacking策略的组合预测模型,对区域分时共享单车需求量进行预测。最后,对北京市共享单车用户的骑行数据进行实例分析。结果表明相较单个机器学习预测模型,提出的组合预测模型的平均绝对百分比误差下降了9.1%,提高了共享单车短时需求预测的科学性和准确性,可为实际车辆调度提供参考依据。

关键词: 共享单车, 出行需求, 因果分析, 灰色关联度, 相似日, 机器学习, Stacking策略

Abstract: Short-term demand of shared bikes forecasting plays an important role in optimizing the layout of free-floating bike sharing systems and bikes rebalancing. To improve the accuracy of demand forecasting methods for the emerging shared bikes business, this study establishes a combined forecasting model based on Stacking strategy and examines the impact of temporal variables and weather factors on shared bikes demand. In particular, the Granger causality test is used to identify the key weather indicators that cause demand fluctuations. We extract the set of similar samples for each period of the day for prediction based on the grey correlation index of weather variables. The Stacking strategy is then introduced to integrate random forest, support vector regression, and other machine learning algorithms for establishing a combined forecasting model to predict the short-term demand in different regions. Finally, using free-floating bike sharing data in Beijing, the proposed combined model is tested. The prediction results of combined model demonstrate that the mean absolute percentage error decreased by 9.1% compared with the single prediction model, thus improving the accuracy of the short-term demand forecast of shared bikes and providing useful information for bike relocation.

Key words: shared bikes, travel demand, causal analysis, grey correlation, similar day, machine learning, Stacking strategy

中图分类号: 

  • U491.14