
Data set of 2Kx2K image tiles cropped from maps of the David Rumsey collection for the ICDAR'24 Competition on Historical Map Text Detection, Recognition, and Linking. Annotations and images follow the format described at the competition website and can be evaluated using the official evaluation repository script. Important: v1.1 fixes an image channel order error, superseding the prior version. Train Validation Annotations rumsey_train.json rumsey_val.json Images train.zip val.zip Files rumsey/train/*.png rumsey/val/*.png Tiles 200 40 Map Sheets 196 40 Words 34,521 5,543 Label Groups 27,729 4,959 Illegible Words 1,741 291 Truncated Words 3,582 643 Valid Words 30,683 4,881 Annotations: Copyright 2024 UMN Knowledge Computing Lab, CC-BY-NC-SA 4.0 International.Images: David Rumsey Map Collection, David Rumsey Map Center, Stanford Libraries. CC-BY-NC-SA 3.0 Unported.
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 3 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
