Matching biomedical ontologies with GCN-based feature propagation

With an increasing number of biomedical ontologies being evolved independently, matching these ontologies to solve the interoperability problem has become a critical issue in biomedical applications. Traditional biomedical ontology matching methods are mostly based on rules or similarities for conce...

Full description

Bibliographic Details
Main Authors: Peng Wang, Shiyi Zou, Jiajun Liu, Wenjun Ke
Format: Article
Language:English
Published: AIMS Press 2022-06-01
Series:Mathematical Biosciences and Engineering
Subjects:
Online Access:https://www.aimspress.com/article/doi/10.3934/mbe.2022394?viewType=HTML
_version_ 1828801657056526336
author Peng Wang
Shiyi Zou
Jiajun Liu
Wenjun Ke
author_facet Peng Wang
Shiyi Zou
Jiajun Liu
Wenjun Ke
author_sort Peng Wang
collection DOAJ
description With an increasing number of biomedical ontologies being evolved independently, matching these ontologies to solve the interoperability problem has become a critical issue in biomedical applications. Traditional biomedical ontology matching methods are mostly based on rules or similarities for concepts and properties. These approaches require manually designed rules that not only fail to address the heterogeneity of domain ontology terminology and the ambiguity of multiple meanings of words, but also make it difficult to capture structural information in ontologies that contain a large amount of semantics during matching. Recently, various knowledge graph (KG) embedding techniques utilizing deep learning methods to deal with the heterogeneity in knowledge graphs (KGs), have quickly gained massive attention. However, KG embedding focuses mainly on entity alignment (EA). EA tasks and ontology matching (OM) tasks differ dramatically in terms of matching elements, semantic information and application scenarios, etc., hence these methods cannot be applied directly to biomedical ontologies that contain abstract concepts but almost no entities. To tackle these issues, this paper proposes a novel approach called BioOntGCN that directly learns embeddings of ontology-pairs for biomedical ontology matching. Specifically, we first generate a pair-wise connectivity graph (PCG) of two ontologies, whose nodes are concept-pairs and edges correspond to property-pairs. Subsequently, we learn node embeddings of the PCG to predicate the matching results through following phases: 1) A convolutional neural network (CNN) to extract the similarity feature vectors of nodes; 2) A graph convolutional network (GCN) to propagate the similarity features and obtain the final embeddings of concept-pairs. Consequently, the biomedical ontology matching problem is transformed into a binary classification problem. We conduct systematic experiments on real-world biomedical ontologies in Ontology Alignment Evaluation Initiative (OAEI), and the results show that our approach significantly outperforms other entity alignment methods and achieves state-of-the-art performance. This indicates that BioOntGCN is more applicable to ontology matching than the EA method. At the same time, BioOntGCN substantially achieves superior performance compared with previous ontology matching (OM) systems, which suggests that BioOntGCN based on the representation learning is more effective than the traditional approaches.
first_indexed 2024-12-12T06:50:20Z
format Article
id doaj.art-c97d42c82f244b93a9f3ca56072d13a5
institution Directory Open Access Journal
issn 1551-0018
language English
last_indexed 2024-12-12T06:50:20Z
publishDate 2022-06-01
publisher AIMS Press
record_format Article
series Mathematical Biosciences and Engineering
spelling doaj.art-c97d42c82f244b93a9f3ca56072d13a52022-12-22T00:34:05ZengAIMS PressMathematical Biosciences and Engineering1551-00182022-06-011988479850410.3934/mbe.2022394Matching biomedical ontologies with GCN-based feature propagationPeng Wang0Shiyi Zou 1Jiajun Liu2Wenjun Ke31. School of Computer Science and Engineering, Southeast University, Nanjing 210018, China 2. Monash University Joint Graduate School, Southeast University, Suzhou 215123, China 3. School of Cyber Science and Engineering, Southeast University, Nanjing 210018, China2. Monash University Joint Graduate School, Southeast University, Suzhou 215123, China1. School of Computer Science and Engineering, Southeast University, Nanjing 210018, China4. Beijing Institute of Computer Technology and Application, Beijing 100854, ChinaWith an increasing number of biomedical ontologies being evolved independently, matching these ontologies to solve the interoperability problem has become a critical issue in biomedical applications. Traditional biomedical ontology matching methods are mostly based on rules or similarities for concepts and properties. These approaches require manually designed rules that not only fail to address the heterogeneity of domain ontology terminology and the ambiguity of multiple meanings of words, but also make it difficult to capture structural information in ontologies that contain a large amount of semantics during matching. Recently, various knowledge graph (KG) embedding techniques utilizing deep learning methods to deal with the heterogeneity in knowledge graphs (KGs), have quickly gained massive attention. However, KG embedding focuses mainly on entity alignment (EA). EA tasks and ontology matching (OM) tasks differ dramatically in terms of matching elements, semantic information and application scenarios, etc., hence these methods cannot be applied directly to biomedical ontologies that contain abstract concepts but almost no entities. To tackle these issues, this paper proposes a novel approach called BioOntGCN that directly learns embeddings of ontology-pairs for biomedical ontology matching. Specifically, we first generate a pair-wise connectivity graph (PCG) of two ontologies, whose nodes are concept-pairs and edges correspond to property-pairs. Subsequently, we learn node embeddings of the PCG to predicate the matching results through following phases: 1) A convolutional neural network (CNN) to extract the similarity feature vectors of nodes; 2) A graph convolutional network (GCN) to propagate the similarity features and obtain the final embeddings of concept-pairs. Consequently, the biomedical ontology matching problem is transformed into a binary classification problem. We conduct systematic experiments on real-world biomedical ontologies in Ontology Alignment Evaluation Initiative (OAEI), and the results show that our approach significantly outperforms other entity alignment methods and achieves state-of-the-art performance. This indicates that BioOntGCN is more applicable to ontology matching than the EA method. At the same time, BioOntGCN substantially achieves superior performance compared with previous ontology matching (OM) systems, which suggests that BioOntGCN based on the representation learning is more effective than the traditional approaches.https://www.aimspress.com/article/doi/10.3934/mbe.2022394?viewType=HTMLontology matchingbiomedical ontologyconvolutional neural networkgraph convolutional network
spellingShingle Peng Wang
Shiyi Zou
Jiajun Liu
Wenjun Ke
Matching biomedical ontologies with GCN-based feature propagation
Mathematical Biosciences and Engineering
ontology matching
biomedical ontology
convolutional neural network
graph convolutional network
title Matching biomedical ontologies with GCN-based feature propagation
title_full Matching biomedical ontologies with GCN-based feature propagation
title_fullStr Matching biomedical ontologies with GCN-based feature propagation
title_full_unstemmed Matching biomedical ontologies with GCN-based feature propagation
title_short Matching biomedical ontologies with GCN-based feature propagation
title_sort matching biomedical ontologies with gcn based feature propagation
topic ontology matching
biomedical ontology
convolutional neural network
graph convolutional network
url https://www.aimspress.com/article/doi/10.3934/mbe.2022394?viewType=HTML
work_keys_str_mv AT pengwang matchingbiomedicalontologieswithgcnbasedfeaturepropagation
AT shiyizou matchingbiomedicalontologieswithgcnbasedfeaturepropagation
AT jiajunliu matchingbiomedicalontologieswithgcnbasedfeaturepropagation
AT wenjunke matchingbiomedicalontologieswithgcnbasedfeaturepropagation