Streaming Similarity Search over One Billion Tweets Using Parallel Locality-Sensitive Hashing
Finding nearest neighbors has become an important operation on databases, with applications to text search, multimedia indexing, and many other areas. One popular algorithm for similarity search, especially for high dimensional data (where spatial indexes like kd-trees do not perform well) is Locali...
Main Authors: | Sundaram, Narayanan, Turmukhametova, Aizana Z., Satish, Nadathur, Mostak, Todd, Indyk, Piotr, Madden, Samuel R., Dubey, Pradeep |
---|---|
Other Authors: | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory |
Format: | Article |
Language: | en_US |
Published: |
Association for Computing Machinery (ACM)
2014
|
Online Access: | http://hdl.handle.net/1721.1/86923 https://orcid.org/0000-0002-7470-3265 https://orcid.org/0000-0002-7983-9524 |
Similar Items
-
Diverse sampling of streaming data
by: Turmukhametova, Aizana
Published: (2014) -
Sketching via hashing: from heavy hitters to compressed sensing to sparse fourier transform
by: Indyk, Piotr
Published: (2014) -
Beyond locality-sensitive hashing
by: Andoni, Alexandr, et al.
Published: (2018) -
GenBase: A Complex Analytics Genomics Benchmark
by: Taft, Rebecca, et al.
Published: (2013) -
An online framework for civil unrest prediction using tweet stream based on tweet weight and event diffusion
by: Islam, Md Kamrul, et al.
Published: (2020)