Compression and query execution within column oriented databases

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005.

Bibliographic Details
Main Author: Ferreira, Miguel C. (Miguel Cacela Rosa Lopes Ferreira)
Other Authors: Samuel Madden.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2006
Subjects:
Online Access:http://hdl.handle.net/1721.1/33150
_version_ 1826189016889819136
author Ferreira, Miguel C. (Miguel Cacela Rosa Lopes Ferreira)
author2 Samuel Madden.
author_facet Samuel Madden.
Ferreira, Miguel C. (Miguel Cacela Rosa Lopes Ferreira)
author_sort Ferreira, Miguel C. (Miguel Cacela Rosa Lopes Ferreira)
collection MIT
description Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005.
first_indexed 2024-09-23T08:08:32Z
format Thesis
id mit-1721.1/33150
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T08:08:32Z
publishDate 2006
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/331502019-04-09T16:55:06Z Compression and query execution within column oriented databases Ferreira, Miguel C. (Miguel Cacela Rosa Lopes Ferreira) Samuel Madden. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005. Includes bibliographical references (p. 65-66). Compression is a known technique used by many database management systems ("DBMS") to increase performance[4, 5, 14]. However, not much research has been done in how compression can be used within column oriented architectures. Storing data in column increases the similarity between adjacent records, thus increase the compressibility of the data. In addition, compression schemes not traditionally used in row-oriented DBMSs can be applied to column-oriented systems. This thesis presents a column-oriented query executor designed to operate directly on compressed data. 'We show that operating directly on compressed data can improve query performance. Additionally, the choice of compression scheme depends on the expected query workload, suggesting that for ad-hoc queries we may wish to store a column redundantly under different coding schemes. Furthermore, the executor is designed to be extensible so that the addition of new compression schemes does not impact operator implementation. The executor is part of a larger database system, known as CStore [10]. by Miguel C. Ferreira. M.Eng. 2006-06-19T17:45:29Z 2006-06-19T17:45:29Z 2005 2005 Thesis http://hdl.handle.net/1721.1/33150 62256545 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 66 p. 3410300 bytes 3412430 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Ferreira, Miguel C. (Miguel Cacela Rosa Lopes Ferreira)
Compression and query execution within column oriented databases
title Compression and query execution within column oriented databases
title_full Compression and query execution within column oriented databases
title_fullStr Compression and query execution within column oriented databases
title_full_unstemmed Compression and query execution within column oriented databases
title_short Compression and query execution within column oriented databases
title_sort compression and query execution within column oriented databases
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/33150
work_keys_str_mv AT ferreiramiguelcmiguelcacelarosalopesferreira compressionandqueryexecutionwithincolumnorienteddatabases