Looking for CATA program for tweet analysis

newer
Re: [Air-L] Software to extract...

older
2014 ASIS&T Annual Meeting now open

Dave Voelker

27 Aug 2014 27 Aug '14

6:57 a.m.

I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it. - - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

Show replies by date

Alex Leavitt

27 Aug 27 Aug

7:07 a.m.

At the point you hit >1 GB, you probably want to invest time in setting up an actual database and using programmatic means to analyze the data. I recommend checking out PostgreSQL, which has great features for text search and analysis. --- Alexander Leavitt PhD Candidate USC Annenberg School for Communication & Journalism http://alexleavitt.com Twitter: @alexleavitt On Aug 26, 2014 11:57 PM, "Dave Voelker" <dvoelker@stanford.edu> wrote:

...

I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.

- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

MC Cambre

12:46 p.m.

I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:

...

I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.

- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

Guo Zhang Freeman

12:56 p.m.

Linguistic Inquiry and Word Count (LIWC)? http://www.liwc.net/ On Wed, Aug 27, 2014 at 8:46 AM, MC Cambre <mcambre@ualberta.ca> wrote:

...

I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc

On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:

...
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing

a

...
concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.

- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

--

-- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About

http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers:

Natalie Harrower

1:08 p.m.

NVivo is a possibility. I haven't used it for years but it was the top program for qualitative data analysis a while back. http://www.qsrinternational.com/products_nvivo.aspx _______ Dr. Natalie Harrower Manager, Education & Outreach Digital Repository of Ireland Royal Irish Academy 19 Dawson St., Dublin 2 www.dri.ie Twitter: @dri_ireland www.ria.ie The Royal Irish Academy/Acadamh Ríoga na hÉireann Ireland's Academy for the sciences and humanities ________________________________________ From: Air-L [air-l-bounces@listserv.aoir.org] on behalf of MC Cambre [mcambre@ualberta.ca] Sent: Wednesday, August 27, 2014 1:46 PM To: Dave Voelker Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Looking for CATA program for tweet analysis I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:

...

I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing a concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.

- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

-- -- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org Join the Association of Internet Researchers: http://www.aoir.org/ The Royal Irish Academy is subject to the Freedom of Information Acts 1997 & 2003 and is compliant with the provisions of the Data Protection Acts 1988 & 2003. For further information see our website www.ria.ie

Cristina Aced

2:45 p.m.

Many thanks for all your messages and suggestions. I'm now checking all the messages. I'll test the tools you have proposed and I promise to share which is the best option to extract these data from Facebook! *Cristina Aced Toledano* PhD Candidate Doctoral Programme in Knowledge and Information Society Universitat Oberta de Catalunya | caced@uoc.edu www.cristinaaced.com/blog | Twitter: @blogocorp 2014-08-27 15:08 GMT+02:00 Natalie Harrower <N.Harrower@ria.ie>:

...

NVivo is a possibility. I haven't used it for years but it was the top program for qualitative data analysis a while back.

http://www.qsrinternational.com/products_nvivo.aspx

_______

Dr. Natalie Harrower Manager, Education & Outreach Digital Repository of Ireland Royal Irish Academy 19 Dawson St., Dublin 2 www.dri.ie Twitter: @dri_ireland

www.ria.ie The Royal Irish Academy/Acadamh Ríoga na hÉireann Ireland's Academy for the sciences and humanities

________________________________________ From: Air-L [air-l-bounces@listserv.aoir.org] on behalf of MC Cambre [ mcambre@ualberta.ca] Sent: Wednesday, August 27, 2014 1:46 PM To: Dave Voelker Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Looking for CATA program for tweet analysis

I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc

On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:

...
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases representing

a

...
concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.

- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

--

-- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About

http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

The Royal Irish Academy is subject to the Freedom of Information Acts 1997 & 2003 and is compliant with the provisions of the Data Protection Acts 1988 & 2003. For further information see our website www.ria.ie _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

Cristina Aced

2:48 p.m.

Sorry, my last message was not in connection with this thread. Cristina 2014-08-27 16:45 GMT+02:00 Cristina Aced <blogocorp@gmail.com>:

...

Many thanks for all your messages and suggestions. I'm now checking all the messages. I'll test the tools you have proposed and I promise to share which is the best option to extract these data from Facebook!

*Cristina Aced Toledano* PhD Candidate Doctoral Programme in Knowledge and Information Society Universitat Oberta de Catalunya | caced@uoc.edu www.cristinaaced.com/blog | Twitter: @blogocorp

2014-08-27 15:08 GMT+02:00 Natalie Harrower <N.Harrower@ria.ie>:

NVivo is a possibility. I haven't used it for years but it was the top

...
program for qualitative data analysis a while back.

http://www.qsrinternational.com/products_nvivo.aspx

_______

Dr. Natalie Harrower Manager, Education & Outreach Digital Repository of Ireland Royal Irish Academy 19 Dawson St., Dublin 2 www.dri.ie Twitter: @dri_ireland

www.ria.ie The Royal Irish Academy/Acadamh Ríoga na hÉireann Ireland's Academy for the sciences and humanities

________________________________________ From: Air-L [air-l-bounces@listserv.aoir.org] on behalf of MC Cambre [ mcambre@ualberta.ca] Sent: Wednesday, August 27, 2014 1:46 PM To: Dave Voelker Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Looking for CATA program for tweet analysis

I am not sure what it is called but I know that linguists are using a software like that for corpus analysis. cc

On Wed, Aug 27, 2014 at 2:57 AM, Dave Voelker <dvoelker@stanford.edu> wrote:

...
I'd like to know if someone can recommend -- or knows of -- any computer assisted text analysis (CATA) programs capable of applying predefined concept dictionaries (lists of words and phrases

representing a

...
concept) to large datasets of tweets. I'm referring here to traditional, conceptual, content analysis (social science), not the bottom-up, data mining methods (computer science) primarily being used to analyze big data. An example of the kind of program I'm talking about is the freeware Yoshikoder (http://sourceforge.net/p/yoshikoder/wiki/Home/), but I'm unable to open a large (~ 3 GB) text file of tweets with it.

- - - - - - - - - - - - - - David Voelker, Ph.D. Department of Communication Stanford University http://comm.stanford.edu/faculty/voelker/

_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

--

-- Carolina Cambre PhD Assistant Professor, King's University College @ Western University http://ualberta.academia.edu/mariacarolinacambre/About

http://www.bloomsbury.com/us/the-semiotics-of-che-guevara-9781472505293/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

The Royal Irish Academy is subject to the Freedom of Information Acts 1997 & 2003 and is compliant with the provisions of the Data Protection Acts 1988 & 2003. For further information see our website www.ria.ie _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org

Join the Association of Internet Researchers: http://www.aoir.org/

4327

Age (days ago)

4327

Last active (days ago)

List overview

Download

6 comments

6 participants

participants (6)

Alex Leavitt
Cristina Aced
Dave Voelker
Guo Zhang Freeman
MC Cambre
Natalie Harrower