Query execution time estimation in graph databases based on graph neural networks

Query execution time estimation is an essential task for databases, accurate estimation results can help administrators to manage and monitor systems. This study proposes an interaction-aware and dependency-aware query execution time estimation approach that utilizes graph neural networks to capture...

Full description

Bibliographic Details
Main Authors: Zhenzhen He, Jiong Yu, Tiquan Gu, Dexian Yang
Format: Article
Language:English
Published: Elsevier 2024-04-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157824001071
_version_ 1797234079821725696
author Zhenzhen He
Jiong Yu
Tiquan Gu
Dexian Yang
author_facet Zhenzhen He
Jiong Yu
Tiquan Gu
Dexian Yang
author_sort Zhenzhen He
collection DOAJ
description Query execution time estimation is an essential task for databases, accurate estimation results can help administrators to manage and monitor systems. This study proposes an interaction-aware and dependency-aware query execution time estimation approach that utilizes graph neural networks to capture dependence and interaction relationships. We divide graph query execution time estimation tasks into three stages: workload generation and running, graph-based feature modeling and representation, training and estimation. Specifically, we generate query workloads and run them to collect the database and plan information when queries are executed. Then, the collected plan and database components are modeled into vertexes, the interaction and dependency between them are modeled into edges of graph-based feature representation. We develop an estimation model based on graph neural networks, in which the vertex embedding network is proposed to deal with the vertex heterogeneity, and the message passing network is proposed to aggregate the local representation into the global representation to obtain an embedding that can represent the higher-order feature information of the whole graph, and the estimation network is proposed to estimate execution times. The experiment results on datasets show that our estimation approach can improve estimation quality and outperform other estimation approaches in terms of estimation accuracy.
first_indexed 2024-04-24T16:26:22Z
format Article
id doaj.art-f3107d8cf9364559bef5657a375da5d3
institution Directory Open Access Journal
issn 1319-1578
language English
last_indexed 2024-04-24T16:26:22Z
publishDate 2024-04-01
publisher Elsevier
record_format Article
series Journal of King Saud University: Computer and Information Sciences
spelling doaj.art-f3107d8cf9364559bef5657a375da5d32024-03-31T04:37:11ZengElsevierJournal of King Saud University: Computer and Information Sciences1319-15782024-04-01364102018Query execution time estimation in graph databases based on graph neural networksZhenzhen He0Jiong Yu1Tiquan Gu2Dexian Yang3School of Information Science and Engineering, Xinjiang University, Urumqi 830049, China; Corresponding author.School of Information Science and Engineering, Xinjiang University, Urumqi 830049, China; School of Software, Xinjiang University, Urumqi 830046, ChinaSchool of Information Science and Engineering, Xinjiang University, Urumqi 830049, ChinaSchool of Information Science and Engineering, Xinjiang University, Urumqi 830049, ChinaQuery execution time estimation is an essential task for databases, accurate estimation results can help administrators to manage and monitor systems. This study proposes an interaction-aware and dependency-aware query execution time estimation approach that utilizes graph neural networks to capture dependence and interaction relationships. We divide graph query execution time estimation tasks into three stages: workload generation and running, graph-based feature modeling and representation, training and estimation. Specifically, we generate query workloads and run them to collect the database and plan information when queries are executed. Then, the collected plan and database components are modeled into vertexes, the interaction and dependency between them are modeled into edges of graph-based feature representation. We develop an estimation model based on graph neural networks, in which the vertex embedding network is proposed to deal with the vertex heterogeneity, and the message passing network is proposed to aggregate the local representation into the global representation to obtain an embedding that can represent the higher-order feature information of the whole graph, and the estimation network is proposed to estimate execution times. The experiment results on datasets show that our estimation approach can improve estimation quality and outperform other estimation approaches in terms of estimation accuracy.http://www.sciencedirect.com/science/article/pii/S1319157824001071Neo4j database management systemsDeep learningGraph neural networkGraph queriesExecution time estimation
spellingShingle Zhenzhen He
Jiong Yu
Tiquan Gu
Dexian Yang
Query execution time estimation in graph databases based on graph neural networks
Journal of King Saud University: Computer and Information Sciences
Neo4j database management systems
Deep learning
Graph neural network
Graph queries
Execution time estimation
title Query execution time estimation in graph databases based on graph neural networks
title_full Query execution time estimation in graph databases based on graph neural networks
title_fullStr Query execution time estimation in graph databases based on graph neural networks
title_full_unstemmed Query execution time estimation in graph databases based on graph neural networks
title_short Query execution time estimation in graph databases based on graph neural networks
title_sort query execution time estimation in graph databases based on graph neural networks
topic Neo4j database management systems
Deep learning
Graph neural network
Graph queries
Execution time estimation
url http://www.sciencedirect.com/science/article/pii/S1319157824001071
work_keys_str_mv AT zhenzhenhe queryexecutiontimeestimationingraphdatabasesbasedongraphneuralnetworks
AT jiongyu queryexecutiontimeestimationingraphdatabasesbasedongraphneuralnetworks
AT tiquangu queryexecutiontimeestimationingraphdatabasesbasedongraphneuralnetworks
AT dexianyang queryexecutiontimeestimationingraphdatabasesbasedongraphneuralnetworks