Instance-Optimized Database Indexes and Storage Layouts

For any modern database system, its physical design, which is composed of both the storage layout of the data itself and auxiliary data structures such as indexes, is a critical piece of maintaining high performance in the face of increasing data volumes. Existing physical design components are gene...

Full description

Bibliographic Details
Main Author: Ding, Jialin
Other Authors: Kraska, Tim
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/147396
_version_ 1826199470384087040
author Ding, Jialin
author2 Kraska, Tim
author_facet Kraska, Tim
Ding, Jialin
author_sort Ding, Jialin
collection MIT
description For any modern database system, its physical design, which is composed of both the storage layout of the data itself and auxiliary data structures such as indexes, is a critical piece of maintaining high performance in the face of increasing data volumes. Existing physical design components are general-purpose: they achieve adequate performance for the average use case but don’t achieve optimal performance for any individual use case. These physical design components expose numerous configuration knobs that users must manually tune to achieve better performance for their individual use case, but tuning complex systems is labor-intensive, and poor tuning can result in degraded performance and increased costs. In this thesis, we explore how database systems can maximize performance while minimizing manual effort through instance-optimization, which is the process of designing systems that are able to automatically self-adjust in order to achieve the best performance for a given use case. We leverage instance-optimization to introduce novel designs for database indexes and data storage layouts that outperform existing state-of-the-art indexes and data layouts by orders of magnitude. We also demonstrate how to incorporate multiple instance-optimized database components into an end-toend analytic database system that outperforms a well-tuned commercial cloud-based analytics system by up to 3×.
first_indexed 2024-09-23T11:20:25Z
format Thesis
id mit-1721.1/147396
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T11:20:25Z
publishDate 2023
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1473962023-01-20T03:40:00Z Instance-Optimized Database Indexes and Storage Layouts Ding, Jialin Kraska, Tim Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science For any modern database system, its physical design, which is composed of both the storage layout of the data itself and auxiliary data structures such as indexes, is a critical piece of maintaining high performance in the face of increasing data volumes. Existing physical design components are general-purpose: they achieve adequate performance for the average use case but don’t achieve optimal performance for any individual use case. These physical design components expose numerous configuration knobs that users must manually tune to achieve better performance for their individual use case, but tuning complex systems is labor-intensive, and poor tuning can result in degraded performance and increased costs. In this thesis, we explore how database systems can maximize performance while minimizing manual effort through instance-optimization, which is the process of designing systems that are able to automatically self-adjust in order to achieve the best performance for a given use case. We leverage instance-optimization to introduce novel designs for database indexes and data storage layouts that outperform existing state-of-the-art indexes and data layouts by orders of magnitude. We also demonstrate how to incorporate multiple instance-optimized database components into an end-toend analytic database system that outperforms a well-tuned commercial cloud-based analytics system by up to 3×. Ph.D. 2023-01-19T18:50:27Z 2023-01-19T18:50:27Z 2022-09 2022-10-19T19:08:09.217Z Thesis https://hdl.handle.net/1721.1/147396 In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/ application/pdf Massachusetts Institute of Technology
spellingShingle Ding, Jialin
Instance-Optimized Database Indexes and Storage Layouts
title Instance-Optimized Database Indexes and Storage Layouts
title_full Instance-Optimized Database Indexes and Storage Layouts
title_fullStr Instance-Optimized Database Indexes and Storage Layouts
title_full_unstemmed Instance-Optimized Database Indexes and Storage Layouts
title_short Instance-Optimized Database Indexes and Storage Layouts
title_sort instance optimized database indexes and storage layouts
url https://hdl.handle.net/1721.1/147396
work_keys_str_mv AT dingjialin instanceoptimizeddatabaseindexesandstoragelayouts