Abstract
Geocoding, the task of converting unstructured text to structured spatial data, has recently seen progress thanks to a variety of new datasets, evaluation metrics, and machine-learning algorithms. Geocoding plays a critical role in tasks such as tracking the evolution and emergence of infectious diseases, analyzing and searching documents by geography, geospatial analysis of historical events, and disaster response mechanisms. To assist those new to this area of research, we provide a survey that reviews, organizes and analyzes recent work on geocoding (also known as toponym resolution) where text is matched to geospatial coordinates and/or ontologies. We summarize the findings of this research, including the domains and databases covered by current geocoding corpora, point-based and polygon-based evaluation metrics, and features and architectures of geocoding systems.
| Original language | English (US) |
|---|---|
| Article number | 103191 |
| Pages (from-to) | 1775-1796 |
| Number of pages | 22 |
| Journal | Language Resources and Evaluation |
| Volume | 59 |
| Issue number | 2 |
| DOIs | |
| State | Published - Jun 2025 |
Keywords
- Geocoding
- Geographical entity normalization
- Toponym resolution
ASJC Scopus subject areas
- Language and Linguistics
- Education
- Linguistics and Language
- Library and Information Sciences