SageDB: A learned database system

© 2019 Conference on Innovative Data Systems Research (CIDR). All rights reserved. Modern data processing systems are designed to be general purpose, in that they can handle a wide variety of different schemas, data types, and data distributions, and aim to provide efficient access to that data via...

Full description

Bibliographic Details
Main Authors: Kraska, T, Alizadeh, M, Beutel, A, Chi, EH, Ding, J, Kristo, A, Leclerc, G, Madden, S, Mao, H, Nathan, V
Format: Article
Language:English
Published: 2021
Online Access:https://hdl.handle.net/1721.1/132282
_version_ 1811069377283358720
author Kraska, T
Alizadeh, M
Beutel, A
Chi, EH
Ding, J
Kristo, A
Leclerc, G
Madden, S
Mao, H
Nathan, V
author_facet Kraska, T
Alizadeh, M
Beutel, A
Chi, EH
Ding, J
Kristo, A
Leclerc, G
Madden, S
Mao, H
Nathan, V
author_sort Kraska, T
collection MIT
description © 2019 Conference on Innovative Data Systems Research (CIDR). All rights reserved. Modern data processing systems are designed to be general purpose, in that they can handle a wide variety of different schemas, data types, and data distributions, and aim to provide efficient access to that data via the use of optimizers and cost models. This general purpose nature results in systems that do not take advantage of the characteristics of the particular application and data of the user. With SageDB we present a vision towards a new type of a data processing system, one which highly specializes to an application through code synthesis and machine learning. By modeling the data distribution, workload, and hardware, SageDB learns the structure of the data and optimal access methods and query plans. These learned models are deeply embedded, through code synthesis, in essentially every component of the database. As such, SageDB presents radical departure from the way database systems are currently developed, raising a host of new problems in databases, machine learning and programming systems.
first_indexed 2024-09-23T08:10:03Z
format Article
id mit-1721.1/132282
institution Massachusetts Institute of Technology
language English
last_indexed 2024-09-23T08:10:03Z
publishDate 2021
record_format dspace
spelling mit-1721.1/1322822021-09-21T04:07:21Z SageDB: A learned database system Kraska, T Alizadeh, M Beutel, A Chi, EH Ding, J Kristo, A Leclerc, G Madden, S Mao, H Nathan, V © 2019 Conference on Innovative Data Systems Research (CIDR). All rights reserved. Modern data processing systems are designed to be general purpose, in that they can handle a wide variety of different schemas, data types, and data distributions, and aim to provide efficient access to that data via the use of optimizers and cost models. This general purpose nature results in systems that do not take advantage of the characteristics of the particular application and data of the user. With SageDB we present a vision towards a new type of a data processing system, one which highly specializes to an application through code synthesis and machine learning. By modeling the data distribution, workload, and hardware, SageDB learns the structure of the data and optimal access methods and query plans. These learned models are deeply embedded, through code synthesis, in essentially every component of the database. As such, SageDB presents radical departure from the way database systems are currently developed, raising a host of new problems in databases, machine learning and programming systems. 2021-09-20T18:21:39Z 2021-09-20T18:21:39Z 2021-01-11T16:32:16Z Article http://purl.org/eprint/type/ConferencePaper https://hdl.handle.net/1721.1/132282 en http://cidrdb.org/cidr2019/program.html CIDR 2019 - 9th Biennial Conference on Innovative Data Systems Research Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Other repository
spellingShingle Kraska, T
Alizadeh, M
Beutel, A
Chi, EH
Ding, J
Kristo, A
Leclerc, G
Madden, S
Mao, H
Nathan, V
SageDB: A learned database system
title SageDB: A learned database system
title_full SageDB: A learned database system
title_fullStr SageDB: A learned database system
title_full_unstemmed SageDB: A learned database system
title_short SageDB: A learned database system
title_sort sagedb a learned database system
url https://hdl.handle.net/1721.1/132282
work_keys_str_mv AT kraskat sagedbalearneddatabasesystem
AT alizadehm sagedbalearneddatabasesystem
AT beutela sagedbalearneddatabasesystem
AT chieh sagedbalearneddatabasesystem
AT dingj sagedbalearneddatabasesystem
AT kristoa sagedbalearneddatabasesystem
AT leclercg sagedbalearneddatabasesystem
AT maddens sagedbalearneddatabasesystem
AT maoh sagedbalearneddatabasesystem
AT nathanv sagedbalearneddatabasesystem