
doi: 10.2139/ssrn.5618673
handle: 10419/333740
This paper proposes a novel methodology to identify the geographic market of local newspapers when information on their diffusion is not available or is not sufficiently granular. We illustrate the methodology using historical data from 154 newly digitized newspapers published in Italy between 1919 and 1922. Combining machine learning-augmented optical character recognition techniques, multi-way fixed-effect regressions, and GIS tools, our approach allows us to estimate markets based on news content. Text-based location of newspaper markets considerably improves over assuming that market boundaries coincide with administrative aggregations. We discuss how our technique strengthens the usage of newspapers as a granular and time-varying source of historical information and offers new avenues for identification strategies.
N01, media coverage, ddc:330, N94, C18, C81, text analysis, inter-war Italy, newspaper markets
N01, media coverage, ddc:330, N94, C18, C81, text analysis, inter-war Italy, newspaper markets
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
