Materialization Strategies in a Column-Oriented DBMS

There has been renewed interest in column-oriented database architectures in recent years. For read-mostly query workloads such as those found in data warehouse and decision support applications, ``column-stores'' have been shown to perform particularly well relative to ``row-stores.'...

Full description

Bibliographic Details
Main Authors: Abadi, Daniel J., Myers, Daniel S., DeWitt, David J., Madden, Samuel R.
Other Authors: Sam Madden
Language:en_US
Published: 2006
Online Access:http://hdl.handle.net/1721.1/34929
_version_ 1826211951507668992
author Abadi, Daniel J.
Myers, Daniel S.
DeWitt, David J.
Madden, Samuel R.
author2 Sam Madden
author_facet Sam Madden
Abadi, Daniel J.
Myers, Daniel S.
DeWitt, David J.
Madden, Samuel R.
author_sort Abadi, Daniel J.
collection MIT
description There has been renewed interest in column-oriented database architectures in recent years. For read-mostly query workloads such as those found in data warehouse and decision support applications, ``column-stores'' have been shown to perform particularly well relative to ``row-stores.'' In order for column-stores to be readily adopted as a replacement for row-stores, however, they must present the same interface to client applications as do row stores, which implies that they must output row-store-style tuples.Thus, the input columns stored on disk must be converted to rows at some point in the query plan, but the optimal point at which to do the conversion is not obvious. This problem can be considered as the opposite of the projection problem in row-store systems: while row-stores need to determine where in query plans to place projection operators to make tuples narrower, column-stores need to determine when to combine single-column projections into wider tuples. This paper describes a variety of strategies for tuple construction and intermediate result representations and provides a systematic evaluation of these strategies.
first_indexed 2024-09-23T15:13:41Z
id mit-1721.1/34929
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T15:13:41Z
publishDate 2006
record_format dspace
spelling mit-1721.1/349292019-04-11T09:50:23Z Materialization Strategies in a Column-Oriented DBMS Abadi, Daniel J. Myers, Daniel S. DeWitt, David J. Madden, Samuel R. Sam Madden Database There has been renewed interest in column-oriented database architectures in recent years. For read-mostly query workloads such as those found in data warehouse and decision support applications, ``column-stores'' have been shown to perform particularly well relative to ``row-stores.'' In order for column-stores to be readily adopted as a replacement for row-stores, however, they must present the same interface to client applications as do row stores, which implies that they must output row-store-style tuples.Thus, the input columns stored on disk must be converted to rows at some point in the query plan, but the optimal point at which to do the conversion is not obvious. This problem can be considered as the opposite of the projection problem in row-store systems: while row-stores need to determine where in query plans to place projection operators to make tuples narrower, column-stores need to determine when to combine single-column projections into wider tuples. This paper describes a variety of strategies for tuple construction and intermediate result representations and provides a systematic evaluation of these strategies. 2006-11-28T19:34:39Z 2006-11-28T19:34:39Z 2006-11-27 MIT-CSAIL-TR-2006-078 http://hdl.handle.net/1721.1/34929 Extension of Publication (of the same title) in the Proceedings of ICDE 2007 en_US Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory 13 p. 952582 bytes 2497163 bytes application/pdf application/postscript application/pdf application/postscript
spellingShingle Abadi, Daniel J.
Myers, Daniel S.
DeWitt, David J.
Madden, Samuel R.
Materialization Strategies in a Column-Oriented DBMS
title Materialization Strategies in a Column-Oriented DBMS
title_full Materialization Strategies in a Column-Oriented DBMS
title_fullStr Materialization Strategies in a Column-Oriented DBMS
title_full_unstemmed Materialization Strategies in a Column-Oriented DBMS
title_short Materialization Strategies in a Column-Oriented DBMS
title_sort materialization strategies in a column oriented dbms
url http://hdl.handle.net/1721.1/34929
work_keys_str_mv AT abadidanielj materializationstrategiesinacolumnorienteddbms
AT myersdaniels materializationstrategiesinacolumnorienteddbms
AT dewittdavidj materializationstrategiesinacolumnorienteddbms
AT maddensamuelr materializationstrategiesinacolumnorienteddbms