Optimization for Multi-Join Queries on the GPU

Multi-join queries are important operations in data management systems and data integration systems, and their efficiency has attracted the attention of researchers. In recent years, graphics processing units (GPUs) have developed rapidly and become a powerful tool for parallel computing, providing...

Full description

Bibliographic Details
Main Authors: Xue-Xuan Hu, Jian-Qing Xi, De-You Tang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9117127/
Description
Summary:Multi-join queries are important operations in data management systems and data integration systems, and their efficiency has attracted the attention of researchers. In recent years, graphics processing units (GPUs) have developed rapidly and become a powerful tool for parallel computing, providing a new idea for multi-join query optimization. This paper studies the use of GPU technology to optimize multi-join queries and focuses on two points: 1) a multi-phase optimization strategy and 2) optimization methods of each stage. For the first point, we discuss a two-phase optimization strategy on the GPU and prove the effectiveness of this strategy. For the second point, we provide an establishment method of a minimum cost join tree on the GPU, the parallel execution methods of intra-join and inter-join on the GPU, and a strategy of scheduling multiple joins to execute in parallel on the GPU. Experimental results show that the multi-join query optimization proposed in this paper improves the efficiency of multi-join queries, especially in the case of high load and complex join queries, achieving higher throughput than that of previous optimization algorithms.
ISSN:2169-3536