Looking for CATA program for tweet analysis
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it. - - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
At the point you hit >1 GB, you probably want to invest time in setting up an actual database and using programmatic means to analyze the data. I recommend checking out PostgreSQL, which has great features for text search and analysis. --- Alexander Leavitt PhD Candidate USC Annenberg School for Communication & Journalism http://alexleavitt.com Twitter: @alexleavitt On Aug 26, 2014 11:57 PM, "Dave Voelker" <dvoelker@stanford.edu> wrote:
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.
- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.
- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- -- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/
Linguistic Inquiry and Word Count (LIWC)? http://www.liwc.net/ On Wed, Aug 27, 2014 at 8:46 AM, MC Cambre <mcambre@ualberta.ca> wrote:
I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc
On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing
a
concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.
- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
-- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About
http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers:
NVivo is a possibility. I haven't used it for years but it was the top program for qualitative data analysis a while back. http://www.qsrinternational.com/products_nvivo.aspx _______ Dr. Natalie Harrower Manager, Education & Outreach Digital Repository of Ireland Royal Irish Academy 19 Dawson St., Dublin 2 www.dri.ie Twitter: @dri_ireland www.ria.ie The Royal Irish Academy/Acadamh Ríoga na hÉireann Ireland's Academy for the sciences and humanities ________________________________________ From: Air-L [air-l-bounces@listserv.aoir.org] on behalf of MC Cambre [mcambre@ualberta.ca] Sent: Wednesday, August 27, 2014 1:46 PM To: Dave Voelker Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Looking for CATA program for tweet analysis I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.
- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- -- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org Join the Association of Internet Researchers: http://www.aoir.org/ The Royal Irish Academy is subject to the Freedom of Information Acts 1997 & 2003 and is compliant with the provisions of the Data Protection Acts 1988 & 2003. For further information see our website www.ria.ie
Many thanks for all your messages and suggestions. I'm now checking all the messages. I'll test the tools you have proposed and I promise to share which is the best option to extract these data from Facebook! *Cristina Aced Toledano* PhD Candidate Doctoral Programme in Knowledge and Information Society Universitat Oberta de Catalunya | caced@uoc.edu www.cristinaaced.com/blog | Twitter: @blogocorp 2014-08-27 15:08 GMT+02:00 Natalie Harrower <N.Harrower@ria.ie>:
NVivo is a possibility. I haven't used it for years but it was the top program for qualitative data analysis a while back.
http://www.qsrinternational.com/products_nvivo.aspx
_______
Dr. Natalie Harrower Manager, Education & Outreach Digital Repository of Ireland Royal Irish Academy 19 Dawson St., Dublin 2 www.dri.ie Twitter: @dri_ireland
www.ria.ie The Royal Irish Academy/Acadamh Ríoga na hÉireann Ireland's Academy for the sciences and humanities
________________________________________ From: Air-L [air-l-bounces@listserv.aoir.org] on behalf of MC Cambre [ mcambre@ualberta.ca] Sent: Wednesday, August 27, 2014 1:46 PM To: Dave Voelker Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Looking for CATA program for tweet analysis
I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc
On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing
a
concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.
- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
-- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About
http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
The Royal Irish Academy is subject to the Freedom of Information Acts 1997 & 2003 and is compliant with the provisions of the Data Protection Acts 1988 & 2003. For further information see our website www.ria.ie _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
Sorry, my last message was not in connection with this thread. Cristina 2014-08-27 16:45 GMT+02:00 Cristina Aced <blogocorp@gmail.com>:
Many thanks for all your messages and suggestions. I'm now checking all the messages. I'll test the tools you have proposed and I promise to share which is the best option to extract these data from Facebook!
*Cristina Aced Toledano* PhD Candidate Doctoral Programme in Knowledge and Information Society Universitat Oberta de Catalunya | caced@uoc.edu www.cristinaaced.com/blog | Twitter: @blogocorp
2014-08-27 15:08 GMT+02:00 Natalie Harrower <N.Harrower@ria.ie>:
NVivo is a possibility. I haven't used it for years but it was the top
program for qualitative data analysis a while back.
http://www.qsrinternational.com/products_nvivo.aspx
_______
Dr. Natalie Harrower Manager, Education & Outreach Digital Repository of Ireland Royal Irish Academy 19 Dawson St., Dublin 2 www.dri.ie Twitter: @dri_ireland
www.ria.ie The Royal Irish Academy/Acadamh Ríoga na hÉireann Ireland's Academy for the sciences and humanities
________________________________________ From: Air-L [air-l-bounces@listserv.aoir.org] on behalf of MC Cambre [ mcambre@ualberta.ca] Sent: Wednesday, August 27, 2014 1:46 PM To: Dave Voelker Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Looking for CATA program for tweet analysis
I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc
On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases
representing a
concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.
- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
-- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About
http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
The Royal Irish Academy is subject to the Freedom of Information Acts 1997 & 2003 and is compliant with the provisions of the Data Protection Acts 1988 & 2003. For further information see our website www.ria.ie _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
participants (6)
-
Alex Leavitt -
Cristina Aced -
Dave Voelker -
Guo Zhang Freeman -
MC Cambre -
Natalie Harrower