On the use of NAND flash memory in high-performance relational databases

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2008.

Detalles Bibliográficos
Autor principal: Myers, Daniel (Daniel Sumers)
Otros Autores: Samuel R. Madden.
Formato: Tesis
Lenguaje:eng
Publicado: Massachusetts Institute of Technology 2008
Materias:
Acceso en línea:http://hdl.handle.net/1721.1/43070
_version_ 1826204044763332608
author Myers, Daniel (Daniel Sumers)
author2 Samuel R. Madden.
author_facet Samuel R. Madden.
Myers, Daniel (Daniel Sumers)
author_sort Myers, Daniel (Daniel Sumers)
collection MIT
description Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2008.
first_indexed 2024-09-23T12:48:02Z
format Thesis
id mit-1721.1/43070
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T12:48:02Z
publishDate 2008
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/430702019-04-10T08:30:30Z On the use of NAND flash memory in high-performance relational databases Myers, Daniel (Daniel Sumers) Samuel R. Madden. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. Electrical Engineering and Computer Science. Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2008. Includes bibliographical references (p. 47-49). High-density NAND flash storage has become relatively inexpensive due to the popularity of various consumer electronics. Recently, several manufacturers have released IDE-compatible NAND flash-based drives in sizes up to 64 GB at reasonable (sub-$1000) prices. Because flash is significantly more durable than mechanical hard drives and requires considerably less energy, there is some speculation that large data centers will adopt these devices. As database workloads make up a substantial fraction of the processing done by data centers, it is interesting to ask how switching to flash-based storage will affect the performance of database systems. We evaluate this question using IDE-based flash drives from two major manufacturers. We measure their read and write performance and find that flash has excellent random read performance, acceptable sequential read performance, and quite poor write performance compared to conventional IDE disks. We then consider how standard database algorithms are affected by these performance characteristics and find that the fast random read capability dramatically improves the performance of secondary indexes and index-based join algorithms. We next investigate using logstructured filesystems to mitigate the poor write performance of flash and find an 8.2x improvement in random write performance, but at the cost of a 3.7x decrease in random read performance. Finally, we study techniques for exploiting the inherent parallelism of multiple-chip flash devices, and we find that adaptive coding strategies can yield a 2x performance improvement over static ones. We conclude that in many cases flash disk performance is still worse than on traditional drives and that current flash technology may not yet be mature enough for widespread database adoption if performance is a dominant factor. Finally, we briefly speculate how this landscape may change based on expected performance of next-generation flash memories. by Daniel Myers. S.M. 2008-11-07T18:58:44Z 2008-11-07T18:58:44Z 2008 2008 Thesis http://hdl.handle.net/1721.1/43070 244250008 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 49 p. application/pdf Massachusetts Institute of Technology
spellingShingle Electrical Engineering and Computer Science.
Myers, Daniel (Daniel Sumers)
On the use of NAND flash memory in high-performance relational databases
title On the use of NAND flash memory in high-performance relational databases
title_full On the use of NAND flash memory in high-performance relational databases
title_fullStr On the use of NAND flash memory in high-performance relational databases
title_full_unstemmed On the use of NAND flash memory in high-performance relational databases
title_short On the use of NAND flash memory in high-performance relational databases
title_sort on the use of nand flash memory in high performance relational databases
topic Electrical Engineering and Computer Science.
url http://hdl.handle.net/1721.1/43070
work_keys_str_mv AT myersdanieldanielsumers ontheuseofnandflashmemoryinhighperformancerelationaldatabases