Diting: An Author Disambiguation Method Based on Network Representation Learning

It is important to disambiguate names among persons in many scenarios. In this work, we propose an unsupervised method Diting and a semi-supervised method Diting++ for author disambiguation. In Diting, we learn a low-dimensional vector to represent each paper in networks, which...

Full description

Bibliographic Details
Main Authors: Liwen Peng, Siqi Shen, Jun Xu, Yongquan Fu, Dongsheng Li, Adele Lu Jia
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8844683/
_version_ 1818479359415025664
author Liwen Peng
Siqi Shen
Jun Xu
Yongquan Fu
Dongsheng Li
Adele Lu Jia
author_facet Liwen Peng
Siqi Shen
Jun Xu
Yongquan Fu
Dongsheng Li
Adele Lu Jia
author_sort Liwen Peng
collection DOAJ
description It is important to disambiguate names among persons in many scenarios. In this work, we propose an unsupervised method Diting and a semi-supervised method Diting++ for author disambiguation. In Diting, we learn a low-dimensional vector to represent each paper in networks, which are formed by connecting papers with multiple types of relationship (such as co-author). During representation learning, we focus on maximizing the gap between positive edges and negative edges. Further, we propose a clustering algorithm which associates papers to their real-life authors. To make full use of the authorship information, which is easy to obtain from the authors’ homepages, we design Diting++ to improve the performance for name disambiguation. Diting++ uses the authorship information listed on the authors’ homepages to construct label networks and uses a network representation learning method to learn paper representations based on label networks and other networks. Further, Diting++ uses a semi-supervised clustering method to partition learned paper representations into disjoint groups. Each group belongs to a distinct author. By making use of the label information, the clustering method partitions papers written by the same author in the same group, whereas papers written by different authors locate in different groups. Through extensive experiments, we show that our methods are significantly better than the state-of-the-art author disambiguation methods.
first_indexed 2024-12-10T11:09:33Z
format Article
id doaj.art-51109c0f8d1d4415b39e811d8161bdf5
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-10T11:09:33Z
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-51109c0f8d1d4415b39e811d8161bdf52022-12-22T01:51:27ZengIEEEIEEE Access2169-35362019-01-01713553913555510.1109/ACCESS.2019.29424778844683Diting: An Author Disambiguation Method Based on Network Representation LearningLiwen Peng0https://orcid.org/0000-0003-1202-9583Siqi Shen1Jun Xu2Yongquan Fu3Dongsheng Li4Adele Lu Jia5School of Computer, National University of Defense Technology, Changsha, ChinaSchool of Computer, National University of Defense Technology, Changsha, ChinaAnt Financial Services Group, Hangzhou, ChinaSchool of Computer, National University of Defense Technology, Changsha, ChinaSchool of Computer, National University of Defense Technology, Changsha, ChinaCollege of Information and Electrical Engineering, China Agricultural University, Beijing, ChinaIt is important to disambiguate names among persons in many scenarios. In this work, we propose an unsupervised method Diting and a semi-supervised method Diting++ for author disambiguation. In Diting, we learn a low-dimensional vector to represent each paper in networks, which are formed by connecting papers with multiple types of relationship (such as co-author). During representation learning, we focus on maximizing the gap between positive edges and negative edges. Further, we propose a clustering algorithm which associates papers to their real-life authors. To make full use of the authorship information, which is easy to obtain from the authors’ homepages, we design Diting++ to improve the performance for name disambiguation. Diting++ uses the authorship information listed on the authors’ homepages to construct label networks and uses a network representation learning method to learn paper representations based on label networks and other networks. Further, Diting++ uses a semi-supervised clustering method to partition learned paper representations into disjoint groups. Each group belongs to a distinct author. By making use of the label information, the clustering method partitions papers written by the same author in the same group, whereas papers written by different authors locate in different groups. Through extensive experiments, we show that our methods are significantly better than the state-of-the-art author disambiguation methods.https://ieeexplore.ieee.org/document/8844683/Network representation learningnetwork embeddingauthor disambiguation
spellingShingle Liwen Peng
Siqi Shen
Jun Xu
Yongquan Fu
Dongsheng Li
Adele Lu Jia
Diting: An Author Disambiguation Method Based on Network Representation Learning
IEEE Access
Network representation learning
network embedding
author disambiguation
title Diting: An Author Disambiguation Method Based on Network Representation Learning
title_full Diting: An Author Disambiguation Method Based on Network Representation Learning
title_fullStr Diting: An Author Disambiguation Method Based on Network Representation Learning
title_full_unstemmed Diting: An Author Disambiguation Method Based on Network Representation Learning
title_short Diting: An Author Disambiguation Method Based on Network Representation Learning
title_sort diting an author disambiguation method based on network representation learning
topic Network representation learning
network embedding
author disambiguation
url https://ieeexplore.ieee.org/document/8844683/
work_keys_str_mv AT liwenpeng ditinganauthordisambiguationmethodbasedonnetworkrepresentationlearning
AT siqishen ditinganauthordisambiguationmethodbasedonnetworkrepresentationlearning
AT junxu ditinganauthordisambiguationmethodbasedonnetworkrepresentationlearning
AT yongquanfu ditinganauthordisambiguationmethodbasedonnetworkrepresentationlearning
AT dongshengli ditinganauthordisambiguationmethodbasedonnetworkrepresentationlearning
AT adelelujia ditinganauthordisambiguationmethodbasedonnetworkrepresentationlearning