
arXiv: 2101.05924
handle: 11583/2996096 , 2318/1792438
There is a rumbling debate over the impact of gentrification: presumed gentrifiers have been the target of protests and attacks in some cities, while they have been welcome as generators of new jobs and taxes in others. Census data fails to measure neighborhood change in real-time since it is usually updated every ten years. This work shows that Airbnb data can be used to quantify and track neighborhood changes. Specifically, we consider both structured data (e.g., number of listings, number of reviews, listing information) and unstructured data (e.g., user-generated reviews processed with natural language processing and machine learning algorithms) for three major cities, New York City (US), Los Angeles (US), and Greater London (UK). We find that Airbnb data (especially its unstructured part) appears to nowcast neighborhood gentrification, measured as changes in housing affordability and demographics. Overall, our results suggest that user-generated data from online platforms can be used to create socioeconomic indices to complement traditional measures that are less granular, not in real-time, and more costly to obtain.
FOS: Computer and information sciences, Computer Science - Machine Learning, 330, J.4, gentrification, economics, K.4.0, airbnb, Machine Learning (cs.LG), Computer Science - Computers and Society, user-generated data, Computers and Society (cs.CY), natural language processing, airbnb; economics; gentrification; natural language processing; user-generated data, K.4.0; J.4
FOS: Computer and information sciences, Computer Science - Machine Learning, 330, J.4, gentrification, economics, K.4.0, airbnb, Machine Learning (cs.LG), Computer Science - Computers and Society, user-generated data, Computers and Society (cs.CY), natural language processing, airbnb; economics; gentrification; natural language processing; user-generated data, K.4.0; J.4
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 13 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
