Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries

Evaluating Regular Path Queries (RPQs) have been of interest since they were used as a powerful way to explore paths and patterns in graph databases. Traditional automata-based approaches are restricted in the graph size and/or highly complex queries, which causes a high evaluation cost (e.g., memor...

Full description

Bibliographic Details
Main Authors: Van-Quyet Nguyen, Van-Hau Nguyen, Minh-Quy Nguyen, Quyet-Thang Huynh, Kyungbaek Kim
Format: Article
Language:English
Published: MDPI AG 2021-04-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/10/9/990
_version_ 1797536915514195968
author Van-Quyet Nguyen
Van-Hau Nguyen
Minh-Quy Nguyen
Quyet-Thang Huynh
Kyungbaek Kim
author_facet Van-Quyet Nguyen
Van-Hau Nguyen
Minh-Quy Nguyen
Quyet-Thang Huynh
Kyungbaek Kim
author_sort Van-Quyet Nguyen
collection DOAJ
description Evaluating Regular Path Queries (RPQs) have been of interest since they were used as a powerful way to explore paths and patterns in graph databases. Traditional automata-based approaches are restricted in the graph size and/or highly complex queries, which causes a high evaluation cost (e.g., memory space and response time) on large graphs. Recently, although using the approach based on the threshold rare label for large graphs has been achieving some success, they could not often guarantee the minimum searching cost. Alternatively, the Unit-Subquery Cost Matrix (USCM) has been studied and obtained the viability of the usage of subqueries. Nevertheless, this method has an issue, which is, it does not cumulate the cost among subqueries that causes the long response time on a large graph. In order to overcome this issue, this paper proposes a method for estimating joining cost of subqueries to accelerate the USCM based parallel evaluation of RPQs on a large graph, namely USCM-Join. Through real-world datasets, we experimentally show that the USCM-Join outperforms others and estimating the joining cost enhances the USCM based approach up to around 20% in terms of response time.
first_indexed 2024-03-10T12:07:39Z
format Article
id doaj.art-76c8d580c5a44919835b95d2b719251e
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T12:07:39Z
publishDate 2021-04-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-76c8d580c5a44919835b95d2b719251e2023-11-21T16:29:07ZengMDPI AGElectronics2079-92922021-04-0110999010.3390/electronics10090990Efficiently Estimating Joining Cost of Subqueries in Regular Path QueriesVan-Quyet Nguyen0Van-Hau Nguyen1Minh-Quy Nguyen2Quyet-Thang Huynh3Kyungbaek Kim4Faculty of Information Technology, Hung Yen University of Technology and Education, Hung Yen 160000, VietnamFaculty of Information Technology, Hung Yen University of Technology and Education, Hung Yen 160000, VietnamFaculty of Information Technology, Hung Yen University of Technology and Education, Hung Yen 160000, VietnamSchool of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, VietnamDepartment of Artificial Intelligence Convergence, Chonnam National University, Gwangju 61186, KoreaEvaluating Regular Path Queries (RPQs) have been of interest since they were used as a powerful way to explore paths and patterns in graph databases. Traditional automata-based approaches are restricted in the graph size and/or highly complex queries, which causes a high evaluation cost (e.g., memory space and response time) on large graphs. Recently, although using the approach based on the threshold rare label for large graphs has been achieving some success, they could not often guarantee the minimum searching cost. Alternatively, the Unit-Subquery Cost Matrix (USCM) has been studied and obtained the viability of the usage of subqueries. Nevertheless, this method has an issue, which is, it does not cumulate the cost among subqueries that causes the long response time on a large graph. In order to overcome this issue, this paper proposes a method for estimating joining cost of subqueries to accelerate the USCM based parallel evaluation of RPQs on a large graph, namely USCM-Join. Through real-world datasets, we experimentally show that the USCM-Join outperforms others and estimating the joining cost enhances the USCM based approach up to around 20% in terms of response time.https://www.mdpi.com/2079-9292/10/9/990graph queriesUSCMparallel evaluationestimating joining cost
spellingShingle Van-Quyet Nguyen
Van-Hau Nguyen
Minh-Quy Nguyen
Quyet-Thang Huynh
Kyungbaek Kim
Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries
Electronics
graph queries
USCM
parallel evaluation
estimating joining cost
title Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries
title_full Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries
title_fullStr Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries
title_full_unstemmed Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries
title_short Efficiently Estimating Joining Cost of Subqueries in Regular Path Queries
title_sort efficiently estimating joining cost of subqueries in regular path queries
topic graph queries
USCM
parallel evaluation
estimating joining cost
url https://www.mdpi.com/2079-9292/10/9/990
work_keys_str_mv AT vanquyetnguyen efficientlyestimatingjoiningcostofsubqueriesinregularpathqueries
AT vanhaunguyen efficientlyestimatingjoiningcostofsubqueriesinregularpathqueries
AT minhquynguyen efficientlyestimatingjoiningcostofsubqueriesinregularpathqueries
AT quyetthanghuynh efficientlyestimatingjoiningcostofsubqueriesinregularpathqueries
AT kyungbaekkim efficientlyestimatingjoiningcostofsubqueriesinregularpathqueries