
The latest developments in the field of digital humanities have increasingly enabled the construction of large data sets which can be easily accessed and used. These data sets often contain indirect spatial information, such as historical addresses. Historical geocoding is the process of transforming indirect spatial information into direct locations which can be placed on a map, thus allowing for spatial analysis and cross-referencing. There are many geocoders that work efficiently for current addresses. However, these do not tackle temporal information, and usually follow a strict hierarchy (country, city, street, house number, etc.) which is difficult—if not impossible—to use with historical data. Historical data is filled with uncertainty (pertaining to temporal, textual, and positional accuracy, as well as to the reliability of historical sources) which can neither be ignored nor entirely resolved. Our open source, open data, and extensible solution for geocoding is based on extracting a large number of simple gazetteers composed of geohistorical objects, from historical maps. Geocoding a historical address becomes the process of finding one or several geohistorical objects in the gazetteers which best match the historical address searched by the user. The matching criteria are customisable, weighted, and include several dimensions (fuzzy string, fuzzy temporal, level of detail, positional accuracy). Since our goal is to facilitate historical work, we also put forward web-based user interfaces which help geocode (one address or batch mode) and display results over current or historical maps. Geocoded results can then be checked and edited collaboratively (no source is modified). The system was tested on the city of Paris, France, for the 19th and 20th centuries. It showed high response rates and worked quickly enough to be used interactively.
FOS: Computer and information sciences, History, [SHS.GEO] Humanities and Social Sciences/Geography, [SHS.INFO] Humanities and Social Sciences/Library and information sciences, Computer Science - Computers and Society, Computer Science - Databases, citizen science, Computers and Society (cs.CY), [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], Digital Libraries (cs.DL), database, collaborative, Geography (General), historical dataset, Computer Science - Digital Libraries, Databases (cs.DB), geohistorical objects, GIS, geocoding, localisation, [SHS.HIST] Humanities and Social Sciences/History, G1-922, Arts and Humanities, digital humanities, crowd-sourced
FOS: Computer and information sciences, History, [SHS.GEO] Humanities and Social Sciences/Geography, [SHS.INFO] Humanities and Social Sciences/Library and information sciences, Computer Science - Computers and Society, Computer Science - Databases, citizen science, Computers and Society (cs.CY), [INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB], Digital Libraries (cs.DL), database, collaborative, Geography (General), historical dataset, Computer Science - Digital Libraries, Databases (cs.DB), geohistorical objects, GIS, geocoding, localisation, [SHS.HIST] Humanities and Social Sciences/History, G1-922, Arts and Humanities, digital humanities, crowd-sourced
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 22 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Top 10% | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Top 10% |
