Estimating Frequency Distributions in Data Streams

Streaming algorithms allow for space-efficient processing of massive datasets. The distribution of the frequencies of items in a large dataset is often used to characterize that data: e.g., the data is heavy-tailed, the data follows a power law, or there are many elements that only appear only once...

Full description

Bibliographic Details
Main Author: Chen, Justin Y.
Other Authors: Indyk, Piotr
Format: Thesis
Published: Massachusetts Institute of Technology 2023
Online Access:https://hdl.handle.net/1721.1/150228