Matching biomedical ontologies with GCN-based feature propagation
With an increasing number of biomedical ontologies being evolved independently, matching these ontologies to solve the interoperability problem has become a critical issue in biomedical applications. Traditional biomedical ontology matching methods are mostly based on rules or similarities for conce...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2022-06-01
|
Series: | Mathematical Biosciences and Engineering |
Subjects: | |
Online Access: | https://www.aimspress.com/article/doi/10.3934/mbe.2022394?viewType=HTML |
_version_ | 1828801657056526336 |
---|---|
author | Peng Wang Shiyi Zou Jiajun Liu Wenjun Ke |
author_facet | Peng Wang Shiyi Zou Jiajun Liu Wenjun Ke |
author_sort | Peng Wang |
collection | DOAJ |
description | With an increasing number of biomedical ontologies being evolved independently, matching these ontologies to solve the interoperability problem has become a critical issue in biomedical applications. Traditional biomedical ontology matching methods are mostly based on rules or similarities for concepts and properties. These approaches require manually designed rules that not only fail to address the heterogeneity of domain ontology terminology and the ambiguity of multiple meanings of words, but also make it difficult to capture structural information in ontologies that contain a large amount of semantics during matching. Recently, various knowledge graph (KG) embedding techniques utilizing deep learning methods to deal with the heterogeneity in knowledge graphs (KGs), have quickly gained massive attention. However, KG embedding focuses mainly on entity alignment (EA). EA tasks and ontology matching (OM) tasks differ dramatically in terms of matching elements, semantic information and application scenarios, etc., hence these methods cannot be applied directly to biomedical ontologies that contain abstract concepts but almost no entities. To tackle these issues, this paper proposes a novel approach called BioOntGCN that directly learns embeddings of ontology-pairs for biomedical ontology matching. Specifically, we first generate a pair-wise connectivity graph (PCG) of two ontologies, whose nodes are concept-pairs and edges correspond to property-pairs. Subsequently, we learn node embeddings of the PCG to predicate the matching results through following phases: 1) A convolutional neural network (CNN) to extract the similarity feature vectors of nodes; 2) A graph convolutional network (GCN) to propagate the similarity features and obtain the final embeddings of concept-pairs. Consequently, the biomedical ontology matching problem is transformed into a binary classification problem. We conduct systematic experiments on real-world biomedical ontologies in Ontology Alignment Evaluation Initiative (OAEI), and the results show that our approach significantly outperforms other entity alignment methods and achieves state-of-the-art performance. This indicates that BioOntGCN is more applicable to ontology matching than the EA method. At the same time, BioOntGCN substantially achieves superior performance compared with previous ontology matching (OM) systems, which suggests that BioOntGCN based on the representation learning is more effective than the traditional approaches. |
first_indexed | 2024-12-12T06:50:20Z |
format | Article |
id | doaj.art-c97d42c82f244b93a9f3ca56072d13a5 |
institution | Directory Open Access Journal |
issn | 1551-0018 |
language | English |
last_indexed | 2024-12-12T06:50:20Z |
publishDate | 2022-06-01 |
publisher | AIMS Press |
record_format | Article |
series | Mathematical Biosciences and Engineering |
spelling | doaj.art-c97d42c82f244b93a9f3ca56072d13a52022-12-22T00:34:05ZengAIMS PressMathematical Biosciences and Engineering1551-00182022-06-011988479850410.3934/mbe.2022394Matching biomedical ontologies with GCN-based feature propagationPeng Wang0Shiyi Zou 1Jiajun Liu2Wenjun Ke31. School of Computer Science and Engineering, Southeast University, Nanjing 210018, China 2. Monash University Joint Graduate School, Southeast University, Suzhou 215123, China 3. School of Cyber Science and Engineering, Southeast University, Nanjing 210018, China2. Monash University Joint Graduate School, Southeast University, Suzhou 215123, China1. School of Computer Science and Engineering, Southeast University, Nanjing 210018, China4. Beijing Institute of Computer Technology and Application, Beijing 100854, ChinaWith an increasing number of biomedical ontologies being evolved independently, matching these ontologies to solve the interoperability problem has become a critical issue in biomedical applications. Traditional biomedical ontology matching methods are mostly based on rules or similarities for concepts and properties. These approaches require manually designed rules that not only fail to address the heterogeneity of domain ontology terminology and the ambiguity of multiple meanings of words, but also make it difficult to capture structural information in ontologies that contain a large amount of semantics during matching. Recently, various knowledge graph (KG) embedding techniques utilizing deep learning methods to deal with the heterogeneity in knowledge graphs (KGs), have quickly gained massive attention. However, KG embedding focuses mainly on entity alignment (EA). EA tasks and ontology matching (OM) tasks differ dramatically in terms of matching elements, semantic information and application scenarios, etc., hence these methods cannot be applied directly to biomedical ontologies that contain abstract concepts but almost no entities. To tackle these issues, this paper proposes a novel approach called BioOntGCN that directly learns embeddings of ontology-pairs for biomedical ontology matching. Specifically, we first generate a pair-wise connectivity graph (PCG) of two ontologies, whose nodes are concept-pairs and edges correspond to property-pairs. Subsequently, we learn node embeddings of the PCG to predicate the matching results through following phases: 1) A convolutional neural network (CNN) to extract the similarity feature vectors of nodes; 2) A graph convolutional network (GCN) to propagate the similarity features and obtain the final embeddings of concept-pairs. Consequently, the biomedical ontology matching problem is transformed into a binary classification problem. We conduct systematic experiments on real-world biomedical ontologies in Ontology Alignment Evaluation Initiative (OAEI), and the results show that our approach significantly outperforms other entity alignment methods and achieves state-of-the-art performance. This indicates that BioOntGCN is more applicable to ontology matching than the EA method. At the same time, BioOntGCN substantially achieves superior performance compared with previous ontology matching (OM) systems, which suggests that BioOntGCN based on the representation learning is more effective than the traditional approaches.https://www.aimspress.com/article/doi/10.3934/mbe.2022394?viewType=HTMLontology matchingbiomedical ontologyconvolutional neural networkgraph convolutional network |
spellingShingle | Peng Wang Shiyi Zou Jiajun Liu Wenjun Ke Matching biomedical ontologies with GCN-based feature propagation Mathematical Biosciences and Engineering ontology matching biomedical ontology convolutional neural network graph convolutional network |
title | Matching biomedical ontologies with GCN-based feature propagation |
title_full | Matching biomedical ontologies with GCN-based feature propagation |
title_fullStr | Matching biomedical ontologies with GCN-based feature propagation |
title_full_unstemmed | Matching biomedical ontologies with GCN-based feature propagation |
title_short | Matching biomedical ontologies with GCN-based feature propagation |
title_sort | matching biomedical ontologies with gcn based feature propagation |
topic | ontology matching biomedical ontology convolutional neural network graph convolutional network |
url | https://www.aimspress.com/article/doi/10.3934/mbe.2022394?viewType=HTML |
work_keys_str_mv | AT pengwang matchingbiomedicalontologieswithgcnbasedfeaturepropagation AT shiyizou matchingbiomedicalontologieswithgcnbasedfeaturepropagation AT jiajunliu matchingbiomedicalontologieswithgcnbasedfeaturepropagation AT wenjunke matchingbiomedicalontologieswithgcnbasedfeaturepropagation |