
Open data and code for Berg et al. "Workforce Patterns and Income Disparities in a Growing US City" Input data are from the U.S. Census Bureau's Public Use Microdata Sample (PUMS) and are in the public domain. Derivative data and code is published under a Creative Commons Attribution 4.0 International License. For attribution, please cite the article: Berg, A. K, Howell, A., Bagchi-Sen, S., Workforce Patterns and Income Disparities in a Growing US City. (2026). Journal of Economic Geography. https://doi.org/10.1093/jeg/lbaf071 Files and what they do: Code - contains all data cleaning, modeling, and visualization R code (run in this order). Data processed in R version 4.5.1. usa_io_2013.r - USA 2013 - contains code for counting total workers in each IO, data cleaning for shift share usa_io_2019.r - USA 2019 - contains code for counting total workers in each IO, data cleaning for shift share atl_io_2013.r - Atlanta 2013 - contains code for counting total workers in each IO, data cleaning for shift share atl_io_2019.r - Atlanta 2019 - contains code for counting total workers in each IO, LQ calculations for 2019, and data cleaning for shift share atl_shiftshare_lq.r - contains code for the classification of IO catagories based on shift share and lq atl_summarystats.r - contains code to create summary statistics tables summarystats_tests.r - contains code performing Chi-Square, Wilcoxon, and Welch two-sample tests for the Appendix atl_models.r - contains all model code Data - contains all input, working, and output data from the analysis inputs usa_pums_2013.csv - raw 1-year 2013 ACS PUMS files for the US, downloaded via the U.S. Census Bureaus FTP archive usa_pums_2019.csv - raw 1-year 2019 ACS PUMS files for the US, downloaded via the U.S. Census Bureaus FTP archive working - contains derivative files created during the analysis. See codebooks for details about generation outputs - contains output tables created automatically during the analysis. Some tables in the manuscript were created manually from tables in the "working" folder
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
