I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset) Many thanks in advance! Mona Arslan, Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
I did it sometimes what is the issue?
Il giorno 19 apr 2017, alle ore 15:51, mona arslan <mona_arslan@hotmail.com> ha scritto:
I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
Hi Mona, I work for QSR, the developers of NVivo. I would be happy to talk to you off-line about using Facebook and NVivo. Best wishes, Silvana Silvana di Gregorio, PhD QSR Research, Director QSR International (UK) Limited Vanguard House, Keckwick Lane Daresbury, Cheshire WA4 4AB United Kingdom T +44 (0)1925 357 960 D +44 (0)1925 357 962 F +44 (0)1925 357 980 E s.digregorio@qsrinternational.com qsrinternational.com Disclaimer This transmission may contain information which is confidential and privileged and intended only for the addressee. If you are not the addressee you may not use, disseminate or copy this information. If you have received this information in error please notify the sender immediately. Thank you. -----Original Message----- From: Air-L [mailto:air-l-bounces@listserv.aoir.org] On Behalf Of mona arslan Sent: 19 April 2017 14:51 To: air-l@listserv.aoir.org Subject: [Air-L] Nvivo for facebook pages I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset) Many thanks in advance! Mona Arslan, Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org Join the Association of Internet Researchers: http://www.aoir.org/
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues: a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem. I am happy to talk to you about these and other issues. Marisa On Wed, Apr 19, 2017 at 10:51 AM, mona arslan <mona_arslan@hotmail.com> wrote:
I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia
Hi, I would strongly advise against using NVivo to scrape Facebook data - I have seen the same issues that Marisa has identified - missing data, problems to import. Moreover, the NCapture tool only talks to NVivo, so you’re not ‘free' to analyze the data elsewhere. People tend to have good results with Netvizz. I use Facepager, which is an open source software created by and for academics. It requires a little knowledge on the APIs to make the right calls, but works like a charm and exports results to CSV. best, Patricia Rossini Postdoctoral researcher | School of Information Studies Syracuse University www.patriciarossini.com <http://www.patriciarossini.com/>
Em 19 de abr de 2017, à(s) 12:43, Marisa von Bülow <marisavonbulow@gmail.com> escreveu:
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues:
a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I am happy to talk to you about these and other issues.
Marisa
On Wed, Apr 19, 2017 at 10:51 AM, mona arslan <mona_arslan@hotmail.com> wrote:
I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
Hi, I´ve been using Facepager for some time and I love it. It really requires a little bit of knowledge on how to make the calls, but it´s really a good tool to invest time in. Some issues on the Mac version and a little bit of knowledge on the different versions of the API to get reactions downloaded right. I do agree with Bernhard that working with facebook data, due to the lack of transparency, it´s really complicated. best, Fábio C Gouveia Scientrometrics and Altmetrics Researcher Fundação Oswaldo Cruz - Brazil fgouveiafiocruz@gmail.com 2017-04-19 14:04 GMT-03:00 Patricia Rossini <patyrossini@gmail.com>:
Hi,
I would strongly advise against using NVivo to scrape Facebook data - I have seen the same issues that Marisa has identified - missing data, problems to import. Moreover, the NCapture tool only talks to NVivo, so you’re not ‘free' to analyze the data elsewhere. People tend to have good results with Netvizz. I use Facepager, which is an open source software created by and for academics. It requires a little knowledge on the APIs to make the right calls, but works like a charm and exports results to CSV.
best,
Patricia Rossini Postdoctoral researcher | School of Information Studies Syracuse University www.patriciarossini.com <http://www.patriciarossini.com/>
Em 19 de abr de 2017, à(s) 12:43, Marisa von Bülow <marisavonbulow@gmail.com> escreveu:
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues:
a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I am happy to talk to you about these and other issues.
Marisa
On Wed, Apr 19, 2017 at 10:51 AM, mona arslan <mona_arslan@hotmail.com> wrote:
I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
If you're not afraid of a little Python, here's another data collection alternative (written by yours truly): https://github.com/dfreelon/fb_scrape_public Just two lines of code and you're off to the races, plus it collects all the new reactions ("haha," "love," etc) and is robust to network errors. It's also free, which helps. /DEEN On 4/19/2017 1:39 PM, Fabio Gouveia wrote:
Hi,
I´ve been using Facepager for some time and I love it. It really requires a little bit of knowledge on how to make the calls, but it´s really a good tool to invest time in. Some issues on the Mac version and a little bit of knowledge on the different versions of the API to get reactions downloaded right. I do agree with Bernhard that working with facebook data, due to the lack of transparency, it´s really complicated.
best,
Fábio C Gouveia Scientrometrics and Altmetrics Researcher Fundação Oswaldo Cruz - Brazil fgouveiafiocruz@gmail.com
2017-04-19 14:04 GMT-03:00 Patricia Rossini <patyrossini@gmail.com>:
Hi,
I would strongly advise against using NVivo to scrape Facebook data - I have seen the same issues that Marisa has identified - missing data, problems to import. Moreover, the NCapture tool only talks to NVivo, so you’re not ‘free' to analyze the data elsewhere. People tend to have good results with Netvizz. I use Facepager, which is an open source software created by and for academics. It requires a little knowledge on the APIs to make the right calls, but works like a charm and exports results to CSV.
best,
Patricia Rossini Postdoctoral researcher | School of Information Studies Syracuse University www.patriciarossini.com <http://www.patriciarossini.com/>
Em 19 de abr de 2017, à(s) 12:43, Marisa von Bülow <marisavonbulow@gmail.com> escreveu:
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues:
a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I am happy to talk to you about these and other issues.
Marisa
On Wed, Apr 19, 2017 at 10:51 AM, mona arslan <mona_arslan@hotmail.com> wrote:
I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Deen Freelon, Ph.D. Associate Professor School of Communication, American University Office: McKinley 325 freelon@american.edu | http://dfreelon.org | @dfreelon <https://twitter.com/dfreelon> New report: Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice <http://www.cmsimpact.org/blmreport>
Yes, Deen! Our research group has turned to Python (could have been R, Maurice, but we chose Python for now) to collect data on Twitter and we are now trying to learn how to do use it for Facebook. Not easy... we will definitely check out your code, many thanks for sharing! However, even Python and R scraping face the issues pointed out by Bernhard Rieder, don't they? I mean, we are still dealing with incomplete datasets, which we can only speculate about. Right? Have you followed his suggestion, that is, have you compared retrieved data using Python, R, Netvizz ...? Marisa On Wed, Apr 19, 2017 at 10:49 PM, Deen Freelon <dfreelon@gmail.com> wrote:
If you're not afraid of a little Python, here's another data collection alternative (written by yours truly): https://github.com/dfreelon/fb _scrape_public
Just two lines of code and you're off to the races, plus it collects all the new reactions ("haha," "love," etc) and is robust to network errors. It's also free, which helps. /DEEN
On 4/19/2017 1:39 PM, Fabio Gouveia wrote:
Hi,
I´ve been using Facepager for some time and I love it. It really requires a little bit of knowledge on how to make the calls, but it´s really a good tool to invest time in. Some issues on the Mac version and a little bit of knowledge on the different versions of the API to get reactions downloaded right. I do agree with Bernhard that working with facebook data, due to the lack of transparency, it´s really complicated.
best,
Fábio C Gouveia Scientrometrics and Altmetrics Researcher Fundação Oswaldo Cruz - Brazil fgouveiafiocruz@gmail.com
2017-04-19 14:04 GMT-03:00 Patricia Rossini <patyrossini@gmail.com>:
Hi,
I would strongly advise against using NVivo to scrape Facebook data - I have seen the same issues that Marisa has identified - missing data, problems to import. Moreover, the NCapture tool only talks to NVivo, so you’re not ‘free' to analyze the data elsewhere. People tend to have good results with Netvizz. I use Facepager, which is an open source software created by and for academics. It requires a little knowledge on the APIs to make the right calls, but works like a charm and exports results to CSV.
best,
Patricia Rossini Postdoctoral researcher | School of Information Studies Syracuse University www.patriciarossini.com <http://www.patriciarossini.com/>
Em 19 de abr de 2017, à(s) 12:43, Marisa von Bülow <
marisavonbulow@gmail.com> escreveu:
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues:
a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I am happy to talk to you about these and other issues.
Marisa
On Wed, Apr 19, 2017 at 10:51 AM, mona arslan <mona_arslan@hotmail.com> wrote:
I would be very happy to talk to anyone who has managed to capture
content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Deen Freelon, Ph.D. Associate Professor School of Communication, American University Office: McKinley 325 freelon@american.edu | http://dfreelon.org | @dfreelon < https://twitter.com/dfreelon> New report: Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice <http://www.cmsimpact.org/blmreport>
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia
Complete datasets will never be (and never were) possible for Facebook because of the degraded function of the Open Graph API. They are somewhat possible using Gnip-enabled services for historical Twitter, as long as you discount the deleted Tweets in your measure of completeness, and they are truly possible with Gnip real time services for Twitter data. I would add to the discussion of completeness, specifically with respect to content analysis: unless you review the data using the Twitter or Facebook display, it is degraded and incomplete. A spreadsheet of Twitter or Facebook posts is a poor approximation of the rich visual content enabled in the live display. In many cases, the meaning of a Tweet or a Facebook post is incomprehensible without the visual referent that was the cause of the post. ~Stu On Thu, Apr 20, 2017 at 4:54 AM, Marisa von Bülow <marisavonbulow@gmail.com> wrote:
Yes, Deen! Our research group has turned to Python (could have been R, Maurice, but we chose Python for now) to collect data on Twitter and we are now trying to learn how to do use it for Facebook. Not easy... we will definitely check out your code, many thanks for sharing!
However, even Python and R scraping face the issues pointed out by Bernhard Rieder, don't they? I mean, we are still dealing with incomplete datasets, which we can only speculate about. Right? Have you followed his suggestion, that is, have you compared retrieved data using Python, R, Netvizz ...?
Marisa
On Wed, Apr 19, 2017 at 10:49 PM, Deen Freelon <dfreelon@gmail.com> wrote:
If you're not afraid of a little Python, here's another data collection alternative (written by yours truly): https://github.com/dfreelon/fb _scrape_public
Just two lines of code and you're off to the races, plus it collects all the new reactions ("haha," "love," etc) and is robust to network errors. It's also free, which helps. /DEEN
On 4/19/2017 1:39 PM, Fabio Gouveia wrote:
Hi,
I´ve been using Facepager for some time and I love it. It really requires a little bit of knowledge on how to make the calls, but it´s really a good tool to invest time in. Some issues on the Mac version and a little bit of knowledge on the different versions of the API to get reactions downloaded right. I do agree with Bernhard that working with facebook data, due to the lack of transparency, it´s really complicated.
best,
Fábio C Gouveia Scientrometrics and Altmetrics Researcher Fundação Oswaldo Cruz - Brazil fgouveiafiocruz@gmail.com
2017-04-19 14:04 GMT-03:00 Patricia Rossini <patyrossini@gmail.com>:
Hi,
I would strongly advise against using NVivo to scrape Facebook data - I have seen the same issues that Marisa has identified - missing data, problems to import. Moreover, the NCapture tool only talks to NVivo, so you’re not ‘free' to analyze the data elsewhere. People tend to have good results with Netvizz. I use Facepager, which is an open source software created by and for academics. It requires a little knowledge on the APIs to make the right calls, but works like a charm and exports results to CSV.
best,
Patricia Rossini Postdoctoral researcher | School of Information Studies Syracuse University www.patriciarossini.com <http://www.patriciarossini.com/>
Em 19 de abr de 2017, à(s) 12:43, Marisa von Bülow <
marisavonbulow@gmail.com> escreveu:
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues:
a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I am happy to talk to you about these and other issues.
Marisa
On Wed, Apr 19, 2017 at 10:51 AM, mona arslan < mona_arslan@hotmail.com> wrote:
I would be very happy to talk to anyone who has managed to capture
content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Deen Freelon, Ph.D. Associate Professor School of Communication, American University Office: McKinley 325 freelon@american.edu | http://dfreelon.org | @dfreelon < https://twitter.com/dfreelon> New report: Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice <http://www.cmsimpact.org/blmreport>
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Dr. Stuart W. Shulman Founder and CEO, Texifter LinkedIn: http://www.linkedin.com/in/stuartwshulman
If the pages you're scraping from are public, you should be able to pull everything you can see on the web through the API. The main issue you run into re: incompleteness is with comments, as FB only allows apps to pull comments from users whose posts are world-readable. To do otherwise would be an unacceptable breach of privacy, so that limitation is a good thing. To Stu's point--yes, it's true that the understanding of some social media posts is not complete without the visual content. But the metadata for both Facebook and Twitter contain links to any pictures and video that may be present, which can be viewed for qualitative analysis should the researcher wish. What's difficult is doing so at scale, of course... Finally, let me lovingly thumb my nose at the R partisans on this thread--you can do excellent analysis with R or Python, and there are plenty of great research-grade libraries available for both (I say this as a user of both). R does have more advanced stats packages but Python is better at preprocessing and transforming text data at scale. I liken it to the Mac vs. PC debate--everyone's got their favorite and there's no objectively correct answer... /DEEN On 4/20/2017 7:54 AM, Marisa von Bülow wrote:
Yes, Deen! Our research group has turned to Python (could have been R, Maurice, but we chose Python for now) to collect data on Twitter and we are now trying to learn how to do use it for Facebook. Not easy... we will definitely check out your code, many thanks for sharing!
However, even Python and R scraping face the issues pointed out by Bernhard Rieder, don't they? I mean, we are still dealing with incomplete datasets, which we can only speculate about. Right? Have you followed his suggestion, that is, have you compared retrieved data using Python, R, Netvizz ...?
Marisa
On Wed, Apr 19, 2017 at 10:49 PM, Deen Freelon <dfreelon@gmail.com <mailto:dfreelon@gmail.com>> wrote:
If you're not afraid of a little Python, here's another data collection alternative (written by yours truly): https://github.com/dfreelon/fb_scrape_public <https://github.com/dfreelon/fb_scrape_public>
Just two lines of code and you're off to the races, plus it collects all the new reactions ("haha," "love," etc) and is robust to network errors. It's also free, which helps. /DEEN
On 4/19/2017 1:39 PM, Fabio Gouveia wrote:
Hi,
I´ve been using Facepager for some time and I love it. It really requires a little bit of knowledge on how to make the calls, but it´s really a good tool to invest time in. Some issues on the Mac version and a little bit of knowledge on the different versions of the API to get reactions downloaded right. I do agree with Bernhard that working with facebook data, due to the lack of transparency, it´s really complicated.
best,
Fábio C Gouveia Scientrometrics and Altmetrics Researcher Fundação Oswaldo Cruz - Brazil fgouveiafiocruz@gmail.com <mailto:fgouveiafiocruz@gmail.com>
2017-04-19 14:04 GMT-03:00 Patricia Rossini <patyrossini@gmail.com <mailto:patyrossini@gmail.com>>:
Hi,
I would strongly advise against using NVivo to scrape Facebook data - I have seen the same issues that Marisa has identified - missing data, problems to import. Moreover, the NCapture tool only talks to NVivo, so you’re not ‘free' to analyze the data elsewhere. People tend to have good results with Netvizz. I use Facepager, which is an open source software created by and for academics. It requires a little knowledge on the APIs to make the right calls, but works like a charm and exports results to CSV.
best,
Patricia Rossini Postdoctoral researcher | School of Information Studies Syracuse University www.patriciarossini.com <http://www.patriciarossini.com> <http://www.patriciarossini.com/ <http://www.patriciarossini.com/>>
Em 19 de abr de 2017, à(s) 12:43, Marisa von Bülow <marisavonbulow@gmail.com <mailto:marisavonbulow@gmail.com>> escreveu:
Our research group at the University of Brasilia (Brazil) has used NVivo/NCapture to capture and analyze data of Facebook pages, but recently has bumped into two serious issues:
a) the dataset won't open in NVivo (when we asked for help, NVivo personnel said this happened because of date problems, which we have not been able to solve). We just switched to Netvizz and then opened the dataset on NVivo to do content analysis of posts. b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I am happy to talk to you about these and other issues.
Marisa
On Wed, Apr 19, 2017 at 10:51 AM, mona arslan <mona_arslan@hotmail.com <mailto:mona_arslan@hotmail.com>> wrote:
I would be very happy to talk to anyone who has managed to capture content from facebook page for NVIVO. Facebook pages as a data set specifically is my interest( not groups or conversation and not as a PDF, but as a dataset)
Many thanks in advance!
Mona Arslan,
Teacher Assistant (AASTM) and PhD Candidate (Digital Media) Founder of The Egyptian Social Media Initiative @monarslan
_______________________________________________ The Air-L@listserv.aoir.org <mailto:Air-L@listserv.aoir.org> mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/ listinfo.cgi/air-l-aoir.org <http://air-l-aoir.org>
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia _______________________________________________ The Air-L@listserv.aoir.org <mailto:Air-L@listserv.aoir.org> mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org <http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org>
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org <mailto:Air-L@listserv.aoir.org> mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org <http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org>
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org <mailto:Air-L@listserv.aoir.org> mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org <http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org>
Join the Association of Internet Researchers: http://www.aoir.org/
-- Deen Freelon, Ph.D. Associate Professor School of Communication, American University Office: McKinley 325 freelon@american.edu <mailto:freelon@american.edu> | http://dfreelon.org | @dfreelon <https://twitter.com/dfreelon> New report: Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice <http://www.cmsimpact.org/blmreport <http://www.cmsimpact.org/blmreport>>
_______________________________________________ The Air-L@listserv.aoir.org <mailto:Air-L@listserv.aoir.org> mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org <http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org>
Join the Association of Internet Researchers: http://www.aoir.org/
--
Marisa von Bülow Professora Associada/Professor Instituto de Ciência Política/Political Science Institute IPOL - UnB/University of Brasilia
-- Deen Freelon, Ph.D. Associate Professor School of Communication, American University Office: McKinley 325 freelon@american.edu | http://dfreelon.org | @dfreelon <https://twitter.com/dfreelon> New report: Beyond the Hashtags: #Ferguson, #Blacklivesmatter, and the Online Struggle for Offline Justice <http://www.cmsimpact.org/blmreport>
b) we have noticed differences in the amount of information gathered by NCapture in comparison with Netvizz. There are chunks of periods in which posts have not been collected by NCapture (and we confirmed that the data is there), which has surprised us, because we had not previously had this problem.
I fear that this may have little to do with NCapture or Netvizz per se, but with the way APIs have come to function nowadays. Platforms like Facebook have the imperative to provide service as fast as possible with as much uptime as possible. Data completeness is simply not a concern. I imagine that Facebook has a tiered storage architecture that will hold some data readily available, while other elements are stored further down the hierarchy. Which elements are currently held in fast storage depends on the shard (storage unit) users are currently connecting to. Shards are possibly assigned on the basis of IP, app identifier (e.g. NCapture's app token), user identifier, main network affiliation, etc. When data is not in the fast storage you’re connected to, it may be omitted. This is all just speculation, but as the developer of Netvizz, I have observed that sometimes, particularly at peak hours, post can be missing for one user, while another user can get everything without any problems. Liking a page seems to have some effect on this. Netvizz tries to alleviate some of these problems through caching, but given the size of Facebook compared to our meager resources, this is a losing battle. My recommendation would be to compare retrieved data with the actual pages at least in a cursory fashion and, if possible, to check by downloading data from two different user accounts. Ultimately, any tool that uses the public API is just a dumb data exporter that sits on Facebook's vast and weird data infrastructure - which we know preciously little about. cheers, Bernhard -- Bernhard Rieder | Associate Professor | New Media and Digital Culture University of Amsterdam | Turfdraagsterpad 9 | 1012 XT Amsterdam | The Netherlands http://thepoliticsofsystems.net | http://rieder.polsys.net | https://www.digitalmethods.net | @RiederB
participants (9)
-
Bernhard Rieder -
Deen Freelon -
Fabio Gouveia -
Marisa von Bülow -
mona arslan -
Patricia Rossini -
Piergiorgio Degli Esposti -
Shulman, Stu -
Silvana di Gregorio