Toponym-assisted map georeferencing: Evaluating the use of toponyms for the digitization of map collections.

Karim Bahgat, Dan Runfola
Author Information
  1. Karim Bahgat: Department of Applied Science, William & Mary, Williamsburg, VA, United States of America. ORCID
  2. Dan Runfola: Department of Applied Science, William & Mary, Williamsburg, VA, United States of America.


A great deal of information is contained within archival maps-ranging from historic political boundaries, to mineral resources, to the locations of cultural landmarks. There are many ongoing efforts to preserve and digitize historic maps so that the information contained within them can be stored and analyzed efficiently. A major barrier to such map digitizing efforts is that the geographic location of each map is typically unknown and must be determined through an often slow and manual process known as georeferencing. To mitigate the time costs associated with the georeferencing process, this paper introduces a fully automated method based on map toponym (place name) labels. It is the first study to demonstrate these methods across a wide range of both simulated and real-world maps. We find that toponym-based georeferencing is sufficiently accurate to be used for data extraction purposes in nearly half of all cases. We make our implementation available to the wider research community through fully open-source replication code, as well as an online georeferencing tool, and highlight areas of improvement for future research. It is hoped that the practical implications of this research will allow for larger and more efficient processing and digitizing of map information for researchers, institutions, and the general public.


  1. J Conflict Resolut. 2021 Feb;65(2-3):563-590 [PMID: 33487734]
  2. PLoS One. 2020 Apr 24;15(4):e0231866 [PMID: 32330167]
  3. ISPRS Int J Geoinf. 2018 Apr;7(4): [PMID: 31061817]
  4. IEEE Trans Image Process. 2017 Jun;26(6):2751-2766 [PMID: 28113978]
  5. IEEE Trans Image Process. 2009 Feb;18(2):388-400 [PMID: 19095535]
  6. Sensors (Basel). 2019 Mar 08;19(5): [PMID: 30857205]

MeSH Term

Data Collection
Geographic Mapping
Image Processing, Computer-Assisted
Information Storage and Retrieval
Maps as Topic
Pilot Projects

Word Cloud

Created with Highcharts 10.0.0mapgeoreferencinginformationresearchcontainedwithinhistoriceffortsmapsdigitizingprocessfullygreatdealarchivalmaps-rangingpoliticalboundariesmineralresourceslocationsculturallandmarksmanyongoingpreservedigitizecanstoredanalyzedefficientlymajorbarriergeographiclocationtypicallyunknownmustdeterminedoftenslowmanualknownmitigatetimecostsassociatedpaperintroducesautomatedmethodbasedtoponymplacenamelabelsfirststudydemonstratemethodsacrosswiderangesimulatedreal-worldfindtoponym-basedsufficientlyaccurateuseddataextractionpurposesnearlyhalfcasesmakeimplementationavailablewidercommunityopen-sourcereplicationcodewellonlinetoolhighlightareasimprovementfuturehopedpracticalimplicationswillallowlargerefficientprocessingresearchersinstitutionsgeneralpublicToponym-assistedgeoreferencing:Evaluatingusetoponymsdigitizationcollections

Similar Articles

Cited By