A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution

This article describes a novel approach for toponym resolution with deep neural networks. The proposed approach does not involve matching references in the text against entries in a gazetteer, instead directly predicting geo-spatial coordinates. Multiple inputs are considered in the neural network a...

Full description

Bibliographic Details
Main Authors: Ana Bárbara Cardoso, Bruno Martins, Jacinto Estima
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/11/1/28
_version_ 1797493535110332416
author Ana Bárbara Cardoso
Bruno Martins
Jacinto Estima
author_facet Ana Bárbara Cardoso
Bruno Martins
Jacinto Estima
author_sort Ana Bárbara Cardoso
collection DOAJ
description This article describes a novel approach for toponym resolution with deep neural networks. The proposed approach does not involve matching references in the text against entries in a gazetteer, instead directly predicting geo-spatial coordinates. Multiple inputs are considered in the neural network architecture (e.g., the surrounding words are considered in combination with the toponym to disambiguate), using pre-trained contextual word embeddings (i.e., ELMo or BERT) as well as bi-directional Long Short-Term Memory units, which are both regularly used for modeling textual data. The intermediate representations are then used to predict a probability distribution over possible geo-spatial regions, and finally to predict the coordinates for the input toponym. The proposed model was tested on three datasets used on previous toponym resolution studies, specifically the (i) <i>War of the Rebellion</i>, (ii) <i>Local–Global Lexicon</i>, and (iii) <i>SpatialML</i> corpora. Moreover, we evaluated the effect of using (i) geophysical terrain properties as external information, including information on elevation or terrain development, among others, and (ii) additional data collected from Wikipedia articles, to further help with the training of the model. The obtained results show improvements using the proposed method, when compared to previous approaches, and specifically when BERT embeddings and additional data are involved.
first_indexed 2024-03-10T01:21:23Z
format Article
id doaj.art-d70188213d4545da893ea303ca2c5df1
institution Directory Open Access Journal
issn 2220-9964
language English
last_indexed 2024-03-10T01:21:23Z
publishDate 2021-12-01
publisher MDPI AG
record_format Article
series ISPRS International Journal of Geo-Information
spelling doaj.art-d70188213d4545da893ea303ca2c5df12023-11-23T13:59:57ZengMDPI AGISPRS International Journal of Geo-Information2220-99642021-12-011112810.3390/ijgi11010028A Novel Deep Learning Approach Using Contextual Embeddings for Toponym ResolutionAna Bárbara Cardoso0Bruno Martins1Jacinto Estima2INESC-ID, Instituto Superior Técnico, Universidade de Lisboa, 1649-004 Lisboa, PortugalINESC-ID, Instituto Superior Técnico, Universidade de Lisboa, 1649-004 Lisboa, PortugalINESC-ID, Faculdade de Design, Tecnologia e Comunicação (IADE), Universidade Europeia, 1200-649 Lisboa, PortugalThis article describes a novel approach for toponym resolution with deep neural networks. The proposed approach does not involve matching references in the text against entries in a gazetteer, instead directly predicting geo-spatial coordinates. Multiple inputs are considered in the neural network architecture (e.g., the surrounding words are considered in combination with the toponym to disambiguate), using pre-trained contextual word embeddings (i.e., ELMo or BERT) as well as bi-directional Long Short-Term Memory units, which are both regularly used for modeling textual data. The intermediate representations are then used to predict a probability distribution over possible geo-spatial regions, and finally to predict the coordinates for the input toponym. The proposed model was tested on three datasets used on previous toponym resolution studies, specifically the (i) <i>War of the Rebellion</i>, (ii) <i>Local–Global Lexicon</i>, and (iii) <i>SpatialML</i> corpora. Moreover, we evaluated the effect of using (i) geophysical terrain properties as external information, including information on elevation or terrain development, among others, and (ii) additional data collected from Wikipedia articles, to further help with the training of the model. The obtained results show improvements using the proposed method, when compared to previous approaches, and specifically when BERT embeddings and additional data are involved.https://www.mdpi.com/2220-9964/11/1/28geographical text analysisresolving toponyms in textual documentsdeep learning for NLPcontextual word embeddingsmachine learning with neural networks
spellingShingle Ana Bárbara Cardoso
Bruno Martins
Jacinto Estima
A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution
ISPRS International Journal of Geo-Information
geographical text analysis
resolving toponyms in textual documents
deep learning for NLP
contextual word embeddings
machine learning with neural networks
title A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution
title_full A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution
title_fullStr A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution
title_full_unstemmed A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution
title_short A Novel Deep Learning Approach Using Contextual Embeddings for Toponym Resolution
title_sort novel deep learning approach using contextual embeddings for toponym resolution
topic geographical text analysis
resolving toponyms in textual documents
deep learning for NLP
contextual word embeddings
machine learning with neural networks
url https://www.mdpi.com/2220-9964/11/1/28
work_keys_str_mv AT anabarbaracardoso anoveldeeplearningapproachusingcontextualembeddingsfortoponymresolution
AT brunomartins anoveldeeplearningapproachusingcontextualembeddingsfortoponymresolution
AT jacintoestima anoveldeeplearningapproachusingcontextualembeddingsfortoponymresolution
AT anabarbaracardoso noveldeeplearningapproachusingcontextualembeddingsfortoponymresolution
AT brunomartins noveldeeplearningapproachusingcontextualembeddingsfortoponymresolution
AT jacintoestima noveldeeplearningapproachusingcontextualembeddingsfortoponymresolution