Caveat emptor, computational social science: Large-scale missing data in a widely-published Reddit corpus
As researchers use computational methods to study complex social behaviors at scale, the validity of this computational social science depends on the integrity of the data. On July 2, 2015, Jason Baumgartner published a dataset advertised to include “every publicly available Reddit comment” which wa...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
PLOS One
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/1721.1/123458 |