Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP

The data cube is a critical tool for accelerating online analysis in big data. Due to its exponential space overhead, the quotient cube, as the main data cube compression approach, was proposed to significantly reduce the number of data cells if they are aggregated over the same base tuple set, i.e....

Full description

Bibliographic Details
Main Authors: Quankun Wang, Jinguo You, Benyuan Zou, Yu Chen, Xingrui Huang, Lianyin Jia
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9570331/
_version_ 1828406426734690304
author Quankun Wang
Jinguo You
Benyuan Zou
Yu Chen
Xingrui Huang
Lianyin Jia
author_facet Quankun Wang
Jinguo You
Benyuan Zou
Yu Chen
Xingrui Huang
Lianyin Jia
author_sort Quankun Wang
collection DOAJ
description The data cube is a critical tool for accelerating online analysis in big data. Due to its exponential space overhead, the quotient cube, as the main data cube compression approach, was proposed to significantly reduce the number of data cells if they are aggregated over the same base tuple set, i.e. they are cover equivalent to form an equivalence class. Nevertheless, it still poses challenges to efficiently analyze massive data due to high storage space consumption. This paper proposes the reduced quotient cube (RQC) based on the following observation. (i) there are equivalence classes of various sizes in a quotient cube; (ii) the small equivalence classes usually dominate; (iii) the big equivalence classes are more capable of query answering since they can induce more data cells. Unlike the quotient cube, which preserves all the equivalence classes of equal priority, the reduced quotient cube preferentially does those with larger query answering capacity and smaller space occupied capacity. Further, we design its efficient constructing and querying algorithms. The extensive experimental results show that compared with the quotient cube, the reduced quotient cube space is only 11.3%, while the maximum query capacity is 95.9%. The query time of the reduced quotient cube is reduced by 51.24% on average compared to the quotient cube.
first_indexed 2024-12-10T11:11:30Z
format Article
id doaj.art-871a18fc99ad47718f31f365d9bb6993
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-12-10T11:11:30Z
publishDate 2021-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-871a18fc99ad47718f31f365d9bb69932022-12-22T01:51:24ZengIEEEIEEE Access2169-35362021-01-01914152414153510.1109/ACCESS.2021.31202789570331Reduced Quotient Cube: Maximize Query Answering Capacity in OLAPQuankun Wang0https://orcid.org/0000-0002-3478-7855Jinguo You1https://orcid.org/0000-0002-9118-3775Benyuan Zou2https://orcid.org/0000-0002-9164-2927Yu Chen3Xingrui Huang4Lianyin Jia5https://orcid.org/0000-0002-0269-9017Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, ChinaFaculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, ChinaThe data cube is a critical tool for accelerating online analysis in big data. Due to its exponential space overhead, the quotient cube, as the main data cube compression approach, was proposed to significantly reduce the number of data cells if they are aggregated over the same base tuple set, i.e. they are cover equivalent to form an equivalence class. Nevertheless, it still poses challenges to efficiently analyze massive data due to high storage space consumption. This paper proposes the reduced quotient cube (RQC) based on the following observation. (i) there are equivalence classes of various sizes in a quotient cube; (ii) the small equivalence classes usually dominate; (iii) the big equivalence classes are more capable of query answering since they can induce more data cells. Unlike the quotient cube, which preserves all the equivalence classes of equal priority, the reduced quotient cube preferentially does those with larger query answering capacity and smaller space occupied capacity. Further, we design its efficient constructing and querying algorithms. The extensive experimental results show that compared with the quotient cube, the reduced quotient cube space is only 11.3%, while the maximum query capacity is 95.9%. The query time of the reduced quotient cube is reduced by 51.24% on average compared to the quotient cube.https://ieeexplore.ieee.org/document/9570331/Data cubeequivalence classOLAPquotient cubestorage optimization
spellingShingle Quankun Wang
Jinguo You
Benyuan Zou
Yu Chen
Xingrui Huang
Lianyin Jia
Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP
IEEE Access
Data cube
equivalence class
OLAP
quotient cube
storage optimization
title Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP
title_full Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP
title_fullStr Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP
title_full_unstemmed Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP
title_short Reduced Quotient Cube: Maximize Query Answering Capacity in OLAP
title_sort reduced quotient cube maximize query answering capacity in olap
topic Data cube
equivalence class
OLAP
quotient cube
storage optimization
url https://ieeexplore.ieee.org/document/9570331/
work_keys_str_mv AT quankunwang reducedquotientcubemaximizequeryansweringcapacityinolap
AT jinguoyou reducedquotientcubemaximizequeryansweringcapacityinolap
AT benyuanzou reducedquotientcubemaximizequeryansweringcapacityinolap
AT yuchen reducedquotientcubemaximizequeryansweringcapacityinolap
AT xingruihuang reducedquotientcubemaximizequeryansweringcapacityinolap
AT lianyinjia reducedquotientcubemaximizequeryansweringcapacityinolap