A holistic approach to aligning geospatial data with multidimensional similarity measuring

Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced infor...

Full description

Bibliographic Details
Main Authors: Li Yu, Peiyuan Qiu, Xiliang Liu, Feng Lu, Bo Wan
Format: Article
Language:English
Published: Taylor & Francis Group 2018-08-01
Series:International Journal of Digital Earth
Subjects:
Online Access:http://dx.doi.org/10.1080/17538947.2017.1359688
Description
Summary:Semantically aligning the heterogeneous geospatial datasets (GDs) produced by different organizations demands efficient similarity matching methods. However, the strategies employed to align the schema (concept and property) and instances are usually not reusable, and the effects of unbalanced information tend to be neglected in GD alignment. To solve this problem, a holistic approach is presented in this paper to integrally align the geospatial entities (concepts, properties and instances) simultaneously. Spatial, lexical, structural and extensional similarity metrics are designed and automatically aggregated by means of approval voting. The presented approach is validated with real geographical semantic webs, Geonames and OpenStreetMap. Compared with the well-known extensional-based aligning system, the presented approach not only considers more information involved in GD alignment, but also avoids the artificial parameter setting in metric aggregation. It reduces the dependency on specific information, and makes the alignment more robust under the unbalanced distribution of various information.
ISSN:1753-8947
1753-8955