TY - JOUR
T1 - Income estimation based on human mobility patterns and machine learning models
AU - Gao, Qi-Li
AU - Zhong, Chen
AU - Yue, Yang
AU - Cao, Rui
AU - Zhang, Bowen
N1 - Funding Information:
This research was supported by the National Natural Science Foundation of China (Grant No. 42001390 ) and the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (Grant Agreement No. 949670 ), and from ESRC under JPI Urban Europe/NSFC (Grant No. ES/T000287/1 ).
Publisher Copyright:
© 2023 The Authors
PY - 2024/2
Y1 - 2024/2
N2 - Sustainable and inclusive urban development requires a thorough understanding of income distribution and poverty. Recent related research has extensively explored the use of automatically generated sensor data to proxy economic activities. Notably, human mobility patterns have been found to exhibit strong associations with socioeconomic attributes and great potential for income estimation. However, the representation of complex human mobility patterns and their effectiveness in income estimation needs further investigation. To address this, we propose three representations of human mobility: mobility indicators, activity footprints, and travel graphs. These representations feed into various models, including XGBoost, a traditional machine learning model, a convolutional neural network (CNN), and a time-series graph neural network (GCRN). By leveraging public transit data from Shenzhen, our study demonstrates that graph-based representations and deep learning models outperform other approaches in income estimation. They excel in minimising information loss and handling complex data structures. Spatial contextual attributes, such as transport accessibility, are the most influential factors, while indicators related to activity extent, temporal rhythm, and intensity contribute comparatively less. In summary, this study highlights the potential of cutting-edge artificial intelligence tools and emerging human mobility data as an alternative approach to estimating income distribution and addressing poverty-related concerns.
AB - Sustainable and inclusive urban development requires a thorough understanding of income distribution and poverty. Recent related research has extensively explored the use of automatically generated sensor data to proxy economic activities. Notably, human mobility patterns have been found to exhibit strong associations with socioeconomic attributes and great potential for income estimation. However, the representation of complex human mobility patterns and their effectiveness in income estimation needs further investigation. To address this, we propose three representations of human mobility: mobility indicators, activity footprints, and travel graphs. These representations feed into various models, including XGBoost, a traditional machine learning model, a convolutional neural network (CNN), and a time-series graph neural network (GCRN). By leveraging public transit data from Shenzhen, our study demonstrates that graph-based representations and deep learning models outperform other approaches in income estimation. They excel in minimising information loss and handling complex data structures. Spatial contextual attributes, such as transport accessibility, are the most influential factors, while indicators related to activity extent, temporal rhythm, and intensity contribute comparatively less. In summary, this study highlights the potential of cutting-edge artificial intelligence tools and emerging human mobility data as an alternative approach to estimating income distribution and addressing poverty-related concerns.
KW - Income estimation
KW - Human mobility patterns
KW - Machine learning
KW - Public transit
UR - http://www.scopus.com/inward/record.url?scp=85180786503&partnerID=8YFLogxK
U2 - 10.1016/j.apgeog.2023.103179
DO - 10.1016/j.apgeog.2023.103179
M3 - Article
SN - 0143-6228
VL - 163
JO - APPLIED GEOGRAPHY
JF - APPLIED GEOGRAPHY
M1 - 103179
ER -