Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement

As a kind of special graph of structured data, a hypergraph can intuitively describe not only the higher-order relation and complex connection mode between nodes but also the implicit relation between nodes. Aiming at the limitation of traditional distance measurement in high-dimensional data, a new...

Full description

Bibliographic Details
Main Authors: Jing Wang, Siwu Lan, Xiangyu Li, Meng Lu, Jingfeng Guo, Chunying Zhang, Bin Liu
Format: Article
Language:English
Published: MDPI AG 2023-10-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/20/4375
_version_ 1797573974556672000
author Jing Wang
Siwu Lan
Xiangyu Li
Meng Lu
Jingfeng Guo
Chunying Zhang
Bin Liu
author_facet Jing Wang
Siwu Lan
Xiangyu Li
Meng Lu
Jingfeng Guo
Chunying Zhang
Bin Liu
author_sort Jing Wang
collection DOAJ
description As a kind of special graph of structured data, a hypergraph can intuitively describe not only the higher-order relation and complex connection mode between nodes but also the implicit relation between nodes. Aiming at the limitation of traditional distance measurement in high-dimensional data, a new method of hypergraph construction based on set pair theory is proposed in this paper. By means of dividing the relationship between data attributes, the set pair connection degree between samples is calculated, and the set pair distance between samples is obtained. Then, on the basis of set pair distance, the combination technique of <i>k</i>-nearest neighbor and ε radius is used to construct a hypergraph, and high-dimensional expression and hypergraph clustering are demonstrated experimentally. By performing experiments on different datasets on the Kaggle open-source dataset platform, the comparison of cluster purity, the Rand coefficient, and normalized mutual information are shown to demonstrate that this distance measurement method is more effective in high-dimensional expression and exhibits a more significant performance improvement in spectral clustering.
first_indexed 2024-03-10T21:17:42Z
format Article
id doaj.art-8d04b37a81ee46d0a16d90b101a2f221
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T21:17:42Z
publishDate 2023-10-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-8d04b37a81ee46d0a16d90b101a2f2212023-11-19T16:20:50ZengMDPI AGElectronics2079-92922023-10-011220437510.3390/electronics12204375Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance MeasurementJing Wang0Siwu Lan1Xiangyu Li2Meng Lu3Jingfeng Guo4Chunying Zhang5Bin Liu6College of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, ChinaCollege of Science, North China University of Science and Technology, Tangshan 063210, ChinaCollege of Science, North China University of Science and Technology, Tangshan 063210, ChinaCollege of Science, North China University of Science and Technology, Tangshan 063210, ChinaCollege of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, ChinaCollege of Science, North China University of Science and Technology, Tangshan 063210, ChinaBig Data and Social Computing Research Center, Hebei University of Science and Technology, Shijiazhuang 050018, ChinaAs a kind of special graph of structured data, a hypergraph can intuitively describe not only the higher-order relation and complex connection mode between nodes but also the implicit relation between nodes. Aiming at the limitation of traditional distance measurement in high-dimensional data, a new method of hypergraph construction based on set pair theory is proposed in this paper. By means of dividing the relationship between data attributes, the set pair connection degree between samples is calculated, and the set pair distance between samples is obtained. Then, on the basis of set pair distance, the combination technique of <i>k</i>-nearest neighbor and ε radius is used to construct a hypergraph, and high-dimensional expression and hypergraph clustering are demonstrated experimentally. By performing experiments on different datasets on the Kaggle open-source dataset platform, the comparison of cluster purity, the Rand coefficient, and normalized mutual information are shown to demonstrate that this distance measurement method is more effective in high-dimensional expression and exhibits a more significant performance improvement in spectral clustering.https://www.mdpi.com/2079-9292/12/20/4375high-dimensional dataset pair distancehypergraph constructionhigh-dimensional representationhypergraph spectral clustering
spellingShingle Jing Wang
Siwu Lan
Xiangyu Li
Meng Lu
Jingfeng Guo
Chunying Zhang
Bin Liu
Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement
Electronics
high-dimensional data
set pair distance
hypergraph construction
high-dimensional representation
hypergraph spectral clustering
title Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement
title_full Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement
title_fullStr Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement
title_full_unstemmed Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement
title_short Research on the Method of Hypergraph Construction of Information Systems Based on Set Pair Distance Measurement
title_sort research on the method of hypergraph construction of information systems based on set pair distance measurement
topic high-dimensional data
set pair distance
hypergraph construction
high-dimensional representation
hypergraph spectral clustering
url https://www.mdpi.com/2079-9292/12/20/4375
work_keys_str_mv AT jingwang researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement
AT siwulan researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement
AT xiangyuli researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement
AT menglu researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement
AT jingfengguo researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement
AT chunyingzhang researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement
AT binliu researchonthemethodofhypergraphconstructionofinformationsystemsbasedonsetpairdistancemeasurement