Distributed Subgraph Query Processing Using Filtering Scores on Spark
As various services have been generating large-scale graphs to represent multiple relationships between objects, studies have been conducted to obtain subgraphs with particular patterns. In this paper, we propose a distributed query processing method to efficiently search a subgraph for a large grap...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-08-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/12/17/3645 |
_version_ | 1797582680458526720 |
---|---|
author | Kyoungsoo Bok Minyoung Kim Hyeonbyeong Lee Dojin Choi Jongtae Lim Jaesoo Yoo |
author_facet | Kyoungsoo Bok Minyoung Kim Hyeonbyeong Lee Dojin Choi Jongtae Lim Jaesoo Yoo |
author_sort | Kyoungsoo Bok |
collection | DOAJ |
description | As various services have been generating large-scale graphs to represent multiple relationships between objects, studies have been conducted to obtain subgraphs with particular patterns. In this paper, we propose a distributed query processing method to efficiently search a subgraph for a large graph on Spark. To reduce unnecessary processing costs, the search order is determined by filtering scores using the probability distribution. The partitioned queries are searched in parallel in the distributed graph of each slave node according to the search order, and the local search results obtained from each slave node are combined and returned. The query is partitioned in triplets based on the determined search order. The performance of the proposed method is compared with the performance of existing methods to demonstrate its superiority. |
first_indexed | 2024-03-10T23:24:52Z |
format | Article |
id | doaj.art-99b53b91826648318ff439c619dc2934 |
institution | Directory Open Access Journal |
issn | 2079-9292 |
language | English |
last_indexed | 2024-03-10T23:24:52Z |
publishDate | 2023-08-01 |
publisher | MDPI AG |
record_format | Article |
series | Electronics |
spelling | doaj.art-99b53b91826648318ff439c619dc29342023-11-19T08:02:05ZengMDPI AGElectronics2079-92922023-08-011217364510.3390/electronics12173645Distributed Subgraph Query Processing Using Filtering Scores on SparkKyoungsoo Bok0Minyoung Kim1Hyeonbyeong Lee2Dojin Choi3Jongtae Lim4Jaesoo Yoo5Department of Artificial Intelligence Convergence, Wonkwang University, Iksandae 460, Iksan 54538, Jeonbuk, Republic of KoreaDepartment of Information and Communication Engineering, Chungbuk National University, Chungdae-ro 1, Seowon-gu, Cheongju 28644, Chungbuk, Republic of KoreaDepartment of Information and Communication Engineering, Chungbuk National University, Chungdae-ro 1, Seowon-gu, Cheongju 28644, Chungbuk, Republic of KoreaDepartment of Computer Engineering, Changwon National University, Changwondaehak-ro 20, Uichang-gu, Changwon 51140, Gyeongsangnam, Republic of KoreaDepartment of Information and Communication Engineering, Chungbuk National University, Chungdae-ro 1, Seowon-gu, Cheongju 28644, Chungbuk, Republic of KoreaDepartment of Information and Communication Engineering, Chungbuk National University, Chungdae-ro 1, Seowon-gu, Cheongju 28644, Chungbuk, Republic of KoreaAs various services have been generating large-scale graphs to represent multiple relationships between objects, studies have been conducted to obtain subgraphs with particular patterns. In this paper, we propose a distributed query processing method to efficiently search a subgraph for a large graph on Spark. To reduce unnecessary processing costs, the search order is determined by filtering scores using the probability distribution. The partitioned queries are searched in parallel in the distributed graph of each slave node according to the search order, and the local search results obtained from each slave node are combined and returned. The query is partitioned in triplets based on the determined search order. The performance of the proposed method is compared with the performance of existing methods to demonstrate its superiority.https://www.mdpi.com/2079-9292/12/17/3645subgraph querysearch orderdistributed graphfiltering score |
spellingShingle | Kyoungsoo Bok Minyoung Kim Hyeonbyeong Lee Dojin Choi Jongtae Lim Jaesoo Yoo Distributed Subgraph Query Processing Using Filtering Scores on Spark Electronics subgraph query search order distributed graph filtering score |
title | Distributed Subgraph Query Processing Using Filtering Scores on Spark |
title_full | Distributed Subgraph Query Processing Using Filtering Scores on Spark |
title_fullStr | Distributed Subgraph Query Processing Using Filtering Scores on Spark |
title_full_unstemmed | Distributed Subgraph Query Processing Using Filtering Scores on Spark |
title_short | Distributed Subgraph Query Processing Using Filtering Scores on Spark |
title_sort | distributed subgraph query processing using filtering scores on spark |
topic | subgraph query search order distributed graph filtering score |
url | https://www.mdpi.com/2079-9292/12/17/3645 |
work_keys_str_mv | AT kyoungsoobok distributedsubgraphqueryprocessingusingfilteringscoresonspark AT minyoungkim distributedsubgraphqueryprocessingusingfilteringscoresonspark AT hyeonbyeonglee distributedsubgraphqueryprocessingusingfilteringscoresonspark AT dojinchoi distributedsubgraphqueryprocessingusingfilteringscoresonspark AT jongtaelim distributedsubgraphqueryprocessingusingfilteringscoresonspark AT jaesooyoo distributedsubgraphqueryprocessingusingfilteringscoresonspark |