AdaptDB: Adaptive Partitioning for Distributed Joins

Big data analytics often involves complex join queries over two or more tables. Such join processing is expensive in a distributed setting both because large amounts of data must be read from disk, and because of data shuffling across the network. Many techniques based on data partitioning have b...

Full description

Bibliographic Details
Main Authors: Jundal, Alekh, Lu, Yi, Shanbhag, Anil Atmanand, Madden, Samuel R
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Format: Article
Language:en_US
Published: Association for Computing Machinery (ACM) 2018
Online Access:http://hdl.handle.net/1721.1/116354
https://orcid.org/0000-0002-2718-9443
https://orcid.org/0000-0002-0925-1354
https://orcid.org/0000-0002-7470-3265