
Abstract Causal inference lies at the heart of social science, and the 2019 Nobel Prize in Economics highlights the value of randomized variation for identifying causal effects and mechanisms. But causal inference cannot rely on randomized variation alone; it also requires good data. Yet the data-generating process has received less consideration from economists. We provide a simple framework to clarify how research inputs affect data quality and discuss several such inputs, including interviewer selection and training, survey design, and investments in linking across multiple data sources. More investment in research on the data quality production function would considerably improve casual inference generally, and poverty alleviation specifically.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 16 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
