8 years ago
7
Download eight years' worth of Reddit comments
You can read every public comment ever posted on Reddit. Good luck with that. Those comments are saved as plain text, along with their authors' usernames, scores and subreddit locations, among other info. Archive.org even considered the feat notable enough to preserve for future generations.
Continue Reading http://www.engadget.com-
Complete Public Reddit Comments Corpus - archive.org
This is an archive of Reddit comments from October of 2007 until May of 2015 (complete month). This reflects 14 months of work and a lot of API calls. This dataset includes nearly every publicly available Reddit comment. Approximately 350,000 comments out of ~1.65 billion were unavailable. -
I have every publicly available Reddit comment for research
I am currently doing a massive analysis of Reddit's entire publicly available comment dataset. The dataset is ~1.7 billion JSON objects complete with the comment, score, author, subreddit, position in comment tree and other fields that are available through Reddit's API.
Join the Discussion
Why would anyone want to read that? It's nothing but one liners and memes!
And reposts. It would be interesting to see exactly how many of those there are.
Can't you filter that on Karma Decay?
Easily, for a given post. But for 8 years worth of data? Probably not. I wouldn't want to run a web query on a TB worth of posts.
For science? We must make hard sacrifices for science! We must know how many mentions of "dickbutt", or something, IDK. ;)
But that knowledge is just too dangerous for mankind to have...
Oh god no...
Deletes account