Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs

A labeled graph is a special structure with node identification capability, which is often used in information networks, biological networks, and other fields. The subgraph query is widely used as an important means of graph data analysis. As the size of the labeled graph increases and changes dynam...

Full description

Bibliographic Details
Main Authors: Xiaohuan Shan, Chunjie Jia, Linlin Ding, Xingyan Ding, Baoyan Song
Format: Article
Language:English
Published: MDPI AG 2019-02-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/10/2/61
_version_ 1818503616812548096
author Xiaohuan Shan
Chunjie Jia
Linlin Ding
Xingyan Ding
Baoyan Song
author_facet Xiaohuan Shan
Chunjie Jia
Linlin Ding
Xingyan Ding
Baoyan Song
author_sort Xiaohuan Shan
collection DOAJ
description A labeled graph is a special structure with node identification capability, which is often used in information networks, biological networks, and other fields. The subgraph query is widely used as an important means of graph data analysis. As the size of the labeled graph increases and changes dynamically, users tend to focus on the high-match results that are of interest to them, and they want to take advantage of the relationship and number of results to get the results of the query quickly. For this reason, we consider the individual needs of users and propose a dynamic Top-K interesting subgraph query. This method establishes a novel graph topology feature index (GTSF index) including a node topology feature index (NTF index) and an edge feature index (EF index), which can effectively prune and filter the invalid nodes and edges that do not meet the restricted condition. The multi-factor candidate set filtering strategy is proposed based on the GTSF index, which can be further pruned to obtain fewer candidate sets. Then, we propose a dynamic Top-K interesting subgraph query method based on the idea of the sliding window to realize the dynamic modification of the matching results of the subgraph in the dynamic evolution of the label graph, to ensure real-time and accurate results of the query. In addition, considering the factors, such as frequent Input/Output (I/O) and network communication overheads, the optimization mechanism of the graph changes and an incremental maintenance strategy for the index are proposed to reduce the huge cost of redundant operation and global updates. The experimental results show that the proposed method can effectively deal with a dynamic Top-K interesting subgraph query on a large-scale labeled graph, at the same time the optimization mechanism of graph changes and the incremental maintenance strategy of the index can effectively reduce the maintenance overheads.
first_indexed 2024-12-10T21:26:19Z
format Article
id doaj.art-0ecfd4772f104f94b097ba979a388804
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-12-10T21:26:19Z
publishDate 2019-02-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-0ecfd4772f104f94b097ba979a3888042022-12-22T01:32:58ZengMDPI AGInformation2078-24892019-02-011026110.3390/info10020061info10020061Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled GraphsXiaohuan Shan0Chunjie Jia1Linlin Ding2Xingyan Ding3Baoyan Song4School of Information, Liaoning University, Shenyang, 110036, ChinaSchool of Information, Liaoning University, Shenyang, 110036, ChinaSchool of Information, Liaoning University, Shenyang, 110036, ChinaSchool of Information, Liaoning University, Shenyang, 110036, ChinaSchool of Information, Liaoning University, Shenyang, 110036, ChinaA labeled graph is a special structure with node identification capability, which is often used in information networks, biological networks, and other fields. The subgraph query is widely used as an important means of graph data analysis. As the size of the labeled graph increases and changes dynamically, users tend to focus on the high-match results that are of interest to them, and they want to take advantage of the relationship and number of results to get the results of the query quickly. For this reason, we consider the individual needs of users and propose a dynamic Top-K interesting subgraph query. This method establishes a novel graph topology feature index (GTSF index) including a node topology feature index (NTF index) and an edge feature index (EF index), which can effectively prune and filter the invalid nodes and edges that do not meet the restricted condition. The multi-factor candidate set filtering strategy is proposed based on the GTSF index, which can be further pruned to obtain fewer candidate sets. Then, we propose a dynamic Top-K interesting subgraph query method based on the idea of the sliding window to realize the dynamic modification of the matching results of the subgraph in the dynamic evolution of the label graph, to ensure real-time and accurate results of the query. In addition, considering the factors, such as frequent Input/Output (I/O) and network communication overheads, the optimization mechanism of the graph changes and an incremental maintenance strategy for the index are proposed to reduce the huge cost of redundant operation and global updates. The experimental results show that the proposed method can effectively deal with a dynamic Top-K interesting subgraph query on a large-scale labeled graph, at the same time the optimization mechanism of graph changes and the incremental maintenance strategy of the index can effectively reduce the maintenance overheads.https://www.mdpi.com/2078-2489/10/2/61labeled graphdynamic Top-K queryinteresting subgraphsliding window
spellingShingle Xiaohuan Shan
Chunjie Jia
Linlin Ding
Xingyan Ding
Baoyan Song
Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
Information
labeled graph
dynamic Top-K query
interesting subgraph
sliding window
title Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
title_full Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
title_fullStr Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
title_full_unstemmed Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
title_short Dynamic Top-K Interesting Subgraph Query on Large-Scale Labeled Graphs
title_sort dynamic top k interesting subgraph query on large scale labeled graphs
topic labeled graph
dynamic Top-K query
interesting subgraph
sliding window
url https://www.mdpi.com/2078-2489/10/2/61
work_keys_str_mv AT xiaohuanshan dynamictopkinterestingsubgraphqueryonlargescalelabeledgraphs
AT chunjiejia dynamictopkinterestingsubgraphqueryonlargescalelabeledgraphs
AT linlinding dynamictopkinterestingsubgraphqueryonlargescalelabeledgraphs
AT xingyanding dynamictopkinterestingsubgraphqueryonlargescalelabeledgraphs
AT baoyansong dynamictopkinterestingsubgraphqueryonlargescalelabeledgraphs