Historical Reddit Data Collection
Good afternoon all, I'm making a call for help regarding collecting reddit data from a particular sub-reddit (/r/teachers). I'm looking to collect all of the posts (+comments, replies, etc.) starting August 1, 2022 to present for a project I am working on. The problem is that I don't code, so the many tutorials about how to do it with Python aren't helpful for me. I have been using a great tool called commualytic (developed by some Canadian scholars - also does Twitter, Telegram, and CrowdTangle [Facebook/Insta]) that generates lovely CSV files, but due to a recent change in the Reddit Pushshift API, only the last 31 days of subreddit data are available. If there is anyone out there that could help me with this, point me in the direction of another tool that I could use, or a capable student or colleague that might be able to help, I would be much obliged. Many thanks in advance, Luc Cousineau Luc S. Cousineau, Ph.D. (he/him | they/them) Postdoctoral Fellow in the Université du Québec à Montréal's International Network on Technology, Work and Family (INTWAF)<https://intwaf.esg.uqam.ca/> Twitter<http://www.twitter.com/LucCousineau> | Website<https://luccousineauphd.ca/> Newest Publications: The Right to Disconnect: A Policy Innovation First Step<https://ssir.org/articles/entry/the_right_to_disconnect>. In Stanford Social Innovation Review<https://ssir.org/> We need to pay better attention to the ways people talk about incels. The Conversation. https://theconversation.ca/we-need-to-pay-better-attention-to-the-ways-peopl...
Hi, What’s your timeline? The Social Media Archive at ICPSR is setting up a Reddit collection, but it won’t be available until early 2023. I can talk to our staff about it when we’re back in January. Pushshift has a complete (AFAIK) Reddit record: http://files.pushshift.io/reddit/ You’ll need to be able to write or reuse some code to get what you’re looking for though. SOMAR can help with that too, but not for a few weeks at least. Libby -- Libby Hemphill pronouns: she/her/hers Director, Resource Center for Minority Data <http://www.icpsr.umich.edu/RCMD>, ICPSR <http://www.icpsr.umich.edu/icpsrweb/> Director, Social Media Archive <http://socialmediaarchive.org/>, ICPSR <http://www.icpsr.umich.edu/icpsrweb/> Associate Director, Center for Social Media Responsibility <http://csmr.umich.edu/> Research Associate Professor, Institute for Social Research <http://home.isr.umich.edu/> Associate Professor, School of Information <https://www.si.umich.edu/> University of Michigan Libby On Thu, Dec 29, 2022 at 12:08 PM Luc Cousineau via Air-L < air-l@listserv.aoir.org> wrote:
Good afternoon all,
I'm making a call for help regarding collecting reddit data from a particular sub-reddit (/r/teachers). I'm looking to collect all of the posts (+comments, replies, etc.) starting August 1, 2022 to present for a project I am working on. The problem is that I don't code, so the many tutorials about how to do it with Python aren't helpful for me. I have been using a great tool called commualytic (developed by some Canadian scholars - also does Twitter, Telegram, and CrowdTangle [Facebook/Insta]) that generates lovely CSV files, but due to a recent change in the Reddit Pushshift API, only the last 31 days of subreddit data are available.
If there is anyone out there that could help me with this, point me in the direction of another tool that I could use, or a capable student or colleague that might be able to help, I would be much obliged.
Many thanks in advance,
Luc Cousineau
Luc S. Cousineau, Ph.D. (he/him | they/them) Postdoctoral Fellow in the Université du Québec à Montréal's International Network on Technology, Work and Family (INTWAF)< https://intwaf.esg.uqam.ca/> Twitter<http://www.twitter.com/LucCousineau> | Website< https://luccousineauphd.ca/> Newest Publications: The Right to Disconnect: A Policy Innovation First Step< https://ssir.org/articles/entry/the_right_to_disconnect>. In Stanford Social Innovation Review<https://ssir.org/> We need to pay better attention to the ways people talk about incels. The Conversation.
https://theconversation.ca/we-need-to-pay-better-attention-to-the-ways-peopl...
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
participants (2)
-
Libby Hemphill -
Luc Cousineau