Hi Xanat et al, I’ve also done some work in this area, in part using BigQuery but also some other tools. https://www.google.co.uk/amp/s/qz.com/1056319/what-is-the-alt-right-a-lingui... https://www.google.co.uk/amp/s/qz.com/1083444/analysis-of-500-million-reddit... https://www.google.co.uk/amp/s/qz.com/1092037/the-alt-right-is-creating-its-... You might also want to look at: https://www.google.co.uk/amp/s/fivethirtyeight.com/features/dissecting-trump... And some recent slides from a talk I did which have links to a bunch of useful tools in them: https://docs.google.com/presentation/d/1TffY4BCt0CHxifq6_0DMCTvNLZ76CPgCmzgN... All the best, TJM Sent from my iPhone
On 15 May 2018, at 13:32, kalev leetaru <kalev.leetaru5@gmail.com> wrote:
Xanat, in case it is of interest, Google has also regularly loaded the Reddit dataset into BigQuery (you get some amount of free quota per month to use the BQ service):
https://www.reddit.com/r/bigquery/comments/5z957b/more_than_3_billion_reddit...
They've done some neat example analyses with it:
https://medium.com/@hoffa/reddit-favorite-sources-the-most-linked-sites-expa... https://medium.com/@hoffa/which-subreddits-have-the-most-energy-how-upvotes-... https://medium.freecodecamp.com/reddit-uptime-2008-2016-bigquery-b3d7b11046e... https://medium.com/google-cloud/reddit-s-presidential-race-candidate-mention... https://medium.com/google-cloud/a-short-story-of-the-comments-on-reddit-from...
Also, a more complex example combining it with TensorFlow, Cloud Dataflow and my GDELT data:
https://cloud.google.com/blog/big-data/2018/03/predicting-community-engageme... https://cloud.google.com/blog/big-data/2018/03/predicting-community-engageme... https://cloud.google.com/blog/big-data/2018/03/predicting-community-engageme...
Even if you don't use BigQuery at your institution, some of the examples above might give you ideas on some of the at-scale analyses that can be done and combining it with TF, etc.
K
On Tue, May 15, 2018 at 7:59 AM, Deen Freelon <dfreelon@gmail.com> wrote:
Hi Xanat,
You're in luck--all Reddit posts and comments from the beginning are available to download here: http://files.pushshift.io/reddit/ . However, they're in JSON format so you'll need to learn how to parse that before you can access them. Best, /DEEN
On 5/15/2018 12:54 AM, Xanat Meza wrote:
Hello members! In our department, we are planning a study about the expression of phobias in social networks. I would like to know any software recommendations to download: a) Reddit posts and b) Information about Reddit users. If anyone has literature related to Reddit and psychology, we would be grateful too. Regards, Xanat V. Meza
Ph.D. candidate - Kansei, Behavioral and Brain SciencesUniversity of Tsukuba M.A. Media and Communication Yeungnam University B.D. Graphic Communication Design Universidad Autonoma Metropolitana
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Deen Freelon, Ph.D. Associate Professor School of Media and Journalism, UNC-Chapel Hill http://dfreelon.org | @dfreelon <https://twitter.com/dfreelon> | https://github.com/dfreelon _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/