Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels

The problem of data incompleteness has become an intractable problem for network representation learning(NRL) methods,which makes existing NRL algorithms fail to achieve the expected results.Despite numerous efforts have done to solve the issue,most of previous methods mainly focused on the lack of...

Full description

Bibliographic Details
Main Author: FU Kun, GUO Yun-peng, ZHUO Jia-ming, LI Jia-ning, LIU Qi
Format: Article
Language:zho
Published: Editorial office of Computer Science 2022-11-01
Series:Jisuanji kexue
Subjects:
Online Access:https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-11-109.pdf
_version_ 1797845105756864512
author FU Kun, GUO Yun-peng, ZHUO Jia-ming, LI Jia-ning, LIU Qi
author_facet FU Kun, GUO Yun-peng, ZHUO Jia-ming, LI Jia-ning, LIU Qi
author_sort FU Kun, GUO Yun-peng, ZHUO Jia-ming, LI Jia-ning, LIU Qi
collection DOAJ
description The problem of data incompleteness has become an intractable problem for network representation learning(NRL) methods,which makes existing NRL algorithms fail to achieve the expected results.Despite numerous efforts have done to solve the issue,most of previous methods mainly focused on the lack of label information,and rarely consider data imbalance phenomenon,especially the completely imbalance problem that a certain class labels are completely missing.Learning algorithms to solve such problems are still explored,for example,some neighborhood feature aggregation process prefers to focus on network structure information,while disregarding relationships between attribute features and semantic features,of which utilization may enhance representation results.To address the above problems,a semantic information enhanced network embedding with completely imbalanced labels(SECT)method that combines attribute features and structural features is proposed in this paper.Firstly,SECT introduces attention mechanism in the supervised learning for obtaining the semantic information vector on precondition of considering the relationship between the attribute space and the semantic space.Secondly,a variational autoencoder is applied to extract structural features under an unsupervised mode to enhance the robustness of the algorithm.Finally,both semantic and structural information are integrated in the embedded space.Compared with two state-of-the-art algorithms,the node classification results on public data sets Cora and Citeseer indicate the network vector obtained by SECT algorithm outperforms others and increases by 0.86%~1.97% under Mirco-F1.As well as the node visualization results exhibit that compared with other algorithms,the vector distances among different-class clusters obtained by SECT are larger,the clusters of same class are more compact,and the class boundaries are more obvious.All these experimental results demonstrate the effectiveness of SECT,which mainly benefited from a better fusion of semantic information in the low-dimensional embedding space,thus extremely improves the performance of node classification tasks under completely imbalanced labels.
first_indexed 2024-04-09T17:33:10Z
format Article
id doaj.art-042f3c84648e45dbbd2316837006e390
institution Directory Open Access Journal
issn 1002-137X
language zho
last_indexed 2024-04-09T17:33:10Z
publishDate 2022-11-01
publisher Editorial office of Computer Science
record_format Article
series Jisuanji kexue
spelling doaj.art-042f3c84648e45dbbd2316837006e3902023-04-18T02:32:50ZzhoEditorial office of Computer ScienceJisuanji kexue1002-137X2022-11-01491110911610.11896/jsjkx.210900101Semantic Information Enhanced Network Embedding with Completely Imbalanced LabelsFU Kun, GUO Yun-peng, ZHUO Jia-ming, LI Jia-ning, LIU Qi0College of Artificial Intelligence,Hebei University of Technology,Tianjin 300401,China ;Key Laboratory of Big Data Computing,Tianjin 300401,ChinaThe problem of data incompleteness has become an intractable problem for network representation learning(NRL) methods,which makes existing NRL algorithms fail to achieve the expected results.Despite numerous efforts have done to solve the issue,most of previous methods mainly focused on the lack of label information,and rarely consider data imbalance phenomenon,especially the completely imbalance problem that a certain class labels are completely missing.Learning algorithms to solve such problems are still explored,for example,some neighborhood feature aggregation process prefers to focus on network structure information,while disregarding relationships between attribute features and semantic features,of which utilization may enhance representation results.To address the above problems,a semantic information enhanced network embedding with completely imbalanced labels(SECT)method that combines attribute features and structural features is proposed in this paper.Firstly,SECT introduces attention mechanism in the supervised learning for obtaining the semantic information vector on precondition of considering the relationship between the attribute space and the semantic space.Secondly,a variational autoencoder is applied to extract structural features under an unsupervised mode to enhance the robustness of the algorithm.Finally,both semantic and structural information are integrated in the embedded space.Compared with two state-of-the-art algorithms,the node classification results on public data sets Cora and Citeseer indicate the network vector obtained by SECT algorithm outperforms others and increases by 0.86%~1.97% under Mirco-F1.As well as the node visualization results exhibit that compared with other algorithms,the vector distances among different-class clusters obtained by SECT are larger,the clusters of same class are more compact,and the class boundaries are more obvious.All these experimental results demonstrate the effectiveness of SECT,which mainly benefited from a better fusion of semantic information in the low-dimensional embedding space,thus extremely improves the performance of node classification tasks under completely imbalanced labels.https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-11-109.pdfnetwork representation learning|graph embedding|graph attention network|completely imbalanced label|varia-tional autoencoders
spellingShingle FU Kun, GUO Yun-peng, ZHUO Jia-ming, LI Jia-ning, LIU Qi
Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels
Jisuanji kexue
network representation learning|graph embedding|graph attention network|completely imbalanced label|varia-tional autoencoders
title Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels
title_full Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels
title_fullStr Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels
title_full_unstemmed Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels
title_short Semantic Information Enhanced Network Embedding with Completely Imbalanced Labels
title_sort semantic information enhanced network embedding with completely imbalanced labels
topic network representation learning|graph embedding|graph attention network|completely imbalanced label|varia-tional autoencoders
url https://www.jsjkx.com/fileup/1002-137X/PDF/1002-137X-2022-49-11-109.pdf
work_keys_str_mv AT fukunguoyunpengzhuojiaminglijianingliuqi semanticinformationenhancednetworkembeddingwithcompletelyimbalancedlabels