Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory

New data storage technologies such as the recently introduced Intel® Optane™ DC Persistent Memory Module (PMM) offer exciting opportunities for optimizing the query processing performance of database workloads. In particular, the unique combination of low latency, byte-addressability, persistence, a...

Full description

Bibliographic Details
Main Authors: Shanbhag, Anil Atmanand, Tatbul Bitim, Emine Nesime, Cohen, David, Madden, Samuel R
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Published: Association for Computing Machinery (ACM) 2021
Online Access:https://hdl.handle.net/1721.1/130104
_version_ 1826204874865377280
author Shanbhag, Anil Atmanand
Tatbul Bitim, Emine Nesime
Cohen, David
Madden, Samuel R
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Shanbhag, Anil Atmanand
Tatbul Bitim, Emine Nesime
Cohen, David
Madden, Samuel R
author_sort Shanbhag, Anil Atmanand
collection MIT
description New data storage technologies such as the recently introduced Intel® Optane™ DC Persistent Memory Module (PMM) offer exciting opportunities for optimizing the query processing performance of database workloads. In particular, the unique combination of low latency, byte-addressability, persistence, and large capacity make persistent memory (PMem) an attractive alternative along with DRAM and SSDs. Exploring the performance characteristics of this new medium is the first critical step in understanding how it will impact the design and performance of database systems. In this paper, we present one of the first experimental studies on characterizing Intel® Optane™ DC PMM's performance behavior in the context of analytical database workloads. First, we analyze basic access patterns common in such workloads, such as sequential, selective, and random reads as well as the complete Star Schema Benchmark, comparing standalone DRAM- and PMem-based implementations. Then we extend our analysis to join algorithms over larger datasets, which require using DRAM and PMem in a hybrid fashion while paying special attention to the read-write asymmetry of PMem. Our study reveals interesting performance tradeoffs that can help guide the design of next-generation OLAP systems in presence of persistent memory in the storage hierarchy.
first_indexed 2024-09-23T13:02:27Z
format Article
id mit-1721.1/130104
institution Massachusetts Institute of Technology
last_indexed 2024-09-23T13:02:27Z
publishDate 2021
publisher Association for Computing Machinery (ACM)
record_format dspace
spelling mit-1721.1/1301042022-09-28T11:41:49Z Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory Shanbhag, Anil Atmanand Tatbul Bitim, Emine Nesime Cohen, David Madden, Samuel R Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory New data storage technologies such as the recently introduced Intel® Optane™ DC Persistent Memory Module (PMM) offer exciting opportunities for optimizing the query processing performance of database workloads. In particular, the unique combination of low latency, byte-addressability, persistence, and large capacity make persistent memory (PMem) an attractive alternative along with DRAM and SSDs. Exploring the performance characteristics of this new medium is the first critical step in understanding how it will impact the design and performance of database systems. In this paper, we present one of the first experimental studies on characterizing Intel® Optane™ DC PMM's performance behavior in the context of analytical database workloads. First, we analyze basic access patterns common in such workloads, such as sequential, selective, and random reads as well as the complete Star Schema Benchmark, comparing standalone DRAM- and PMem-based implementations. Then we extend our analysis to join algorithms over larger datasets, which require using DRAM and PMem in a hybrid fashion while paying special attention to the read-write asymmetry of PMem. Our study reveals interesting performance tradeoffs that can help guide the design of next-generation OLAP systems in presence of persistent memory in the storage hierarchy. 2021-03-08T21:38:09Z 2021-03-08T21:38:09Z 2020-06 Article http://purl.org/eprint/type/ConferencePaper 9781450380249 https://hdl.handle.net/1721.1/130104 Shanbhag, Anil et al. "Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory." DaMoN '20: Proceedings of the 16th International Workshop on Data Management on New Hardware, June 2020, Portland, Oregon, Association for Computing Machinery, June 2020. http://dx.doi.org/10.1145/3399666.3399933 DaMoN '20: Proceedings of the 16th International Workshop on Data Management on New Hardware Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf Association for Computing Machinery (ACM) Prof. Madden via Phoebe Ayers
spellingShingle Shanbhag, Anil Atmanand
Tatbul Bitim, Emine Nesime
Cohen, David
Madden, Samuel R
Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory
title Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory
title_full Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory
title_fullStr Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory
title_full_unstemmed Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory
title_short Large-scale in-memory analytics on Intel ® Optane™ DC persistent memory
title_sort large scale in memory analytics on intel r optane™ dc persistent memory
url https://hdl.handle.net/1721.1/130104
work_keys_str_mv AT shanbhaganilatmanand largescaleinmemoryanalyticsoninteloptanedcpersistentmemory
AT tatbulbitimeminenesime largescaleinmemoryanalyticsoninteloptanedcpersistentmemory
AT cohendavid largescaleinmemoryanalyticsoninteloptanedcpersistentmemory
AT maddensamuelr largescaleinmemoryanalyticsoninteloptanedcpersistentmemory