The pushshift reddit dataset
J Baumgartner, S Zannettou, B Keegan… - Proceedings of the …, 2020 - aaai.org
Proceedings of the international AAAI conference on web and social media, 2020•aaai.org
Social media data has become crucial to the advancement of scientific understanding.
However, even though it has become ubiquitous, just collecting large-scale social media
data involves a high degree of engineering skill set and computational resources. In fact,
research is often times gated by data engineering problems that must be overcome before
analysis can proceed. This has resulted recognition of datasets as meaningful research
contributions in and of themselves.
However, even though it has become ubiquitous, just collecting large-scale social media
data involves a high degree of engineering skill set and computational resources. In fact,
research is often times gated by data engineering problems that must be overcome before
analysis can proceed. This has resulted recognition of datasets as meaningful research
contributions in and of themselves.
Abstract
Social media data has become crucial to the advancement of scientific understanding. However, even though it has become ubiquitous, just collecting large-scale social media data involves a high degree of engineering skill set and computational resources. In fact, research is often times gated by data engineering problems that must be overcome before analysis can proceed. This has resulted recognition of datasets as meaningful research contributions in and of themselves.
aaai.org