山东科学 ›› 2017, Vol. 30 ›› Issue (6): 87-93.doi: 10.3976/j.issn.1002-4026.2017.06.014

• 交通运输 • 上一篇    下一篇

基于微博签到数据的出行行为分析

聂琦   

  1. 北京交通大学交通运输学院,北京 100044
  • 收稿日期:2017-05-17 出版日期:2017-12-20 发布日期:2017-12-20
  • 作者简介:聂琦(1992—),男,硕士研究生,研究方向为交通运输系统科学、人类移动性。E-mail: nie_qi@bjtu.edu.cn
  • 基金资助:

    国家自然科学基金(71525002)

Travel behavior analysis based on Weibo checkin data

NIE Qi   

  1. School of Traffic and Transportation, Beijing Jiaotong University, Beijing 100044, China
  • Received:2017-05-17 Online:2017-12-20 Published:2017-12-20

摘要:

应用Python爬虫程序,通过新浪API端口爬取了新浪微博2012年的地点签到数据,共计5 028 980条。将这些数据按城市划分,共分为340个地级以上的城市或地区。通过统计发现,签到次数最多的3个城市为北京、上海和广州,说明微博用户更多地活跃在这三个城市。进一步通过相关性分析发现,这些城市的微博用户签到流量和当地GDP呈一定的相关性,说明经济发展水平会影响用户的旅行行为。此外,本文还按照用户的出行流量对各大城市进行了聚类划分,进一步印证了经济发达城市对微博用户签到的吸引会高于其他经济欠发达的城市。

关键词: 经济水平, 相关性, 聚类划分, 微博签到

Abstract:

Using Python crawler, the location checkin data of Sina Weibo in the year of 2012 were crawled through the Sina API port. The data set consisted of 5,028,980 records. These data were divided into 340 cities or regions above prefecture level. Data statistics showed that there was the largest number of check-in in 3 cities: Beijing, Shanghai and Guangzhou, which revealed that Weibo users were more active there. Furthermore, through correlation analysis, it was found that the Weibo users′ attendance flow in these cities was related to the local GDP, indicating that the level of city economic development would affect the users′ travel behavior. In addition, this paper also divided the major cities into clusters according to the users' trip volume, further confirming that the developed cities were more attractive to Weibo users than other economically underdeveloped cities.

Key words: economic level, correlation, Weibo check-in, clustering

中图分类号: 

  • U491