
arXiv: 1405.7769
The identification of urban mobility patterns is very important for predicting and controlling spatial events. In this study, we analyzed millions of geographical check-ins crawled from a leading Chinese location-based social networking service (Jiepang.com), which contains demographic information that facilitates group-specific studies. We determined the distinct mobility patterns of natives and non-natives in all five large cities that we considered. We used a mixed method to assign different algorithms to natives and non-natives, which greatly improved the accuracy of location prediction compared with the basic algorithms. We also propose so-called indigenization coefficients to quantify the extent to which an individual behaves like a native, which depends only on their check-in behavior, rather than requiring demographic information. Surprisingly, the hybrid algorithm weighted using the indigenization coefficients outperformed a mixed algorithm that used additional demographic information, suggesting the advantage of behavioral data in characterizing individual mobility compared with the demographic information. The present location prediction algorithms can find applications in urban planning, traffic forecasting, mobile recommendation, and so on.
19 pages, 5 figures and 7 tables
Social and Information Networks (cs.SI), FOS: Computer and information sciences, Physics - Physics and Society, Physics - Data Analysis, Statistics and Probability, FOS: Physical sciences, Computer Science - Social and Information Networks, Physics and Society (physics.soc-ph), Data Analysis, Statistics and Probability (physics.data-an)
Social and Information Networks (cs.SI), FOS: Computer and information sciences, Physics - Physics and Society, Physics - Data Analysis, Statistics and Probability, FOS: Physical sciences, Computer Science - Social and Information Networks, Physics and Society (physics.soc-ph), Data Analysis, Statistics and Probability (physics.data-an)
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 21 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
