Caveat emptor, computational social science: Large-scale missing data in a widely-published Reddit corpus

As researchers use computational methods to study complex social behaviors at scale, the validity of this computational social science depends on the integrity of the data. On July 2, 2015, Jason Baumgartner published a dataset advertised to include “every publicly available Reddit comment” which wa...

Full description

Bibliographic Details
Main Authors: Gaffney, Devin, Matias, J. Nathan
Format: Article
Language:English
Published: PLOS One 2020
Subjects:
Online Access:https://hdl.handle.net/1721.1/123458