A survey on geocoding: algorithms and datasets for toponym resolution

Zeyu Zhang, Steven Bethard

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Geocoding, the task of converting unstructured text to structured spatial data, has recently seen progress thanks to a variety of new datasets, evaluation metrics, and machine-learning algorithms. Geocoding plays a critical role in tasks such as tracking the evolution and emergence of infectious diseases, analyzing and searching documents by geography, geospatial analysis of historical events, and disaster response mechanisms. To assist those new to this area of research, we provide a survey that reviews, organizes and analyzes recent work on geocoding (also known as toponym resolution) where text is matched to geospatial coordinates and/or ontologies. We summarize the findings of this research, including the domains and databases covered by current geocoding corpora, point-based and polygon-based evaluation metrics, and features and architectures of geocoding systems.

Original languageEnglish (US)
JournalLanguage Resources and Evaluation
DOIs
StateAccepted/In press - 2024

Keywords

  • Geocoding
  • Geographical entity normalization
  • Toponym resolution

ASJC Scopus subject areas

  • Language and Linguistics
  • Education
  • Linguistics and Language
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'A survey on geocoding: algorithms and datasets for toponym resolution'. Together they form a unique fingerprint.

Cite this