A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement
Name disambiguation becomes increasingly important in information retrieval in the era of big data, while on how to further improve the accuracy of disambiguating duplicate names, the existing models are encountering many problems or facing many challenges, such as 1) how to capture and sufficiently...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2020-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8986661/ |
_version_ | 1831679299944448000 |
---|---|
author | Chunhui Deng Huifang Deng Chaoran Li |
author_facet | Chunhui Deng Huifang Deng Chaoran Li |
author_sort | Chunhui Deng |
collection | DOAJ |
description | Name disambiguation becomes increasingly important in information retrieval in the era of big data, while on how to further improve the accuracy of disambiguating duplicate names, the existing models are encountering many problems or facing many challenges, such as 1) how to capture and sufficiently use the global structure features of the network; 2) how to handle the complexity of feature data sets; 3) how to differentiate different types of feature relations, and 4) how to deal with the feature information missing etc. All these challenges are very important issues in further improving the accuracy. In order to address the above issues, this paper proposed a novel model which is called HRFAENE (Heterogeneous Relation Fusion and Attribute Enhanced Network Embedding Model). This model considers both feature network structure information (multiple relations) and document attribute features. The feature network structure information is represented by a scalable loss function which is designed based on pairwise constraints. The document attribute features are comprehensively extracted through multiple heterogeneous information networks which are constructed based on strong features. In order to better identify the disambiguation entity, this model uses weak features as node attributes in strong feature networks and iteratively learns network structure information and node attribute information. The experimental results show that the proposed model significantly outperforms the existing models in the disambiguation accuracy and has good stability, indicating that the model is very effective as expected and can be applied in reality. |
first_indexed | 2024-12-20T05:18:15Z |
format | Article |
id | doaj.art-a5e22c0ad42f41cc82165453fe529c63 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-12-20T05:18:15Z |
publishDate | 2020-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-a5e22c0ad42f41cc82165453fe529c632022-12-21T19:52:07ZengIEEEIEEE Access2169-35362020-01-018283752838410.1109/ACCESS.2020.29723728986661A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute EnhancementChunhui Deng0https://orcid.org/0000-0001-8297-9934Huifang Deng1https://orcid.org/0000-0001-6394-4496Chaoran Li2https://orcid.org/0000-0001-9546-2934School of Computer Engineering, Guangzhou College, South China University of Technology, Guangzhou, ChinaSchool of Computer Science and Engineering, South China University of Technology, Guangzhou, ChinaSchool of Computer Science and Engineering, South China University of Technology, Guangzhou, ChinaName disambiguation becomes increasingly important in information retrieval in the era of big data, while on how to further improve the accuracy of disambiguating duplicate names, the existing models are encountering many problems or facing many challenges, such as 1) how to capture and sufficiently use the global structure features of the network; 2) how to handle the complexity of feature data sets; 3) how to differentiate different types of feature relations, and 4) how to deal with the feature information missing etc. All these challenges are very important issues in further improving the accuracy. In order to address the above issues, this paper proposed a novel model which is called HRFAENE (Heterogeneous Relation Fusion and Attribute Enhanced Network Embedding Model). This model considers both feature network structure information (multiple relations) and document attribute features. The feature network structure information is represented by a scalable loss function which is designed based on pairwise constraints. The document attribute features are comprehensively extracted through multiple heterogeneous information networks which are constructed based on strong features. In order to better identify the disambiguation entity, this model uses weak features as node attributes in strong feature networks and iteratively learns network structure information and node attribute information. The experimental results show that the proposed model significantly outperforms the existing models in the disambiguation accuracy and has good stability, indicating that the model is very effective as expected and can be applied in reality.https://ieeexplore.ieee.org/document/8986661/Name disambiguationscholar entity identificationnetwork representation modelheterogeneous relation fusionattribute enhanced network embeddingnode (document) attributes |
spellingShingle | Chunhui Deng Huifang Deng Chaoran Li A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement IEEE Access Name disambiguation scholar entity identification network representation model heterogeneous relation fusion attribute enhanced network embedding node (document) attributes |
title | A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement |
title_full | A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement |
title_fullStr | A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement |
title_full_unstemmed | A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement |
title_short | A Scholar Disambiguation Method Based on Heterogeneous Relation-Fusion and Attribute Enhancement |
title_sort | scholar disambiguation method based on heterogeneous relation fusion and attribute enhancement |
topic | Name disambiguation scholar entity identification network representation model heterogeneous relation fusion attribute enhanced network embedding node (document) attributes |
url | https://ieeexplore.ieee.org/document/8986661/ |
work_keys_str_mv | AT chunhuideng ascholardisambiguationmethodbasedonheterogeneousrelationfusionandattributeenhancement AT huifangdeng ascholardisambiguationmethodbasedonheterogeneousrelationfusionandattributeenhancement AT chaoranli ascholardisambiguationmethodbasedonheterogeneousrelationfusionandattributeenhancement AT chunhuideng scholardisambiguationmethodbasedonheterogeneousrelationfusionandattributeenhancement AT huifangdeng scholardisambiguationmethodbasedonheterogeneousrelationfusionandattributeenhancement AT chaoranli scholardisambiguationmethodbasedonheterogeneousrelationfusionandattributeenhancement |