Twitter Data Sharing Update - Thou Shalt Not Share Collections of Tweets
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS. http://bit.ly/l8DSA3 To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time. It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they? I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team. ~Stu -- Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/> Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs*
This is very interesting, Stuart, I have tweeted it. Sadly, this is not a new phenomena. Many times the owners of a web service uses the TOS to put walls around knowledge, no matter if people is generating that knowledge. And usually happens after the "garage age"; before that age, owners begs for a little of attention. I allways says about this kind of conduct in digital projects owners: "All of them are hippies, until they enter Wall Street" Alejandro Tortolini Scitech journalist - Teacher Member of RADPC <http://www.radpc.org>
IIRC Twitter was sending regular data dumps to the Library of Congress. If that's the case, why not see if you can snag/grab/analyse the LoC's collection of UBL tweets instead? I would think that the LoC, unlike Twitter, would welcome such scholarly activities. -- rick On May 5, 2011, at 08:24 , Stuart Shulman wrote:
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS.
To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time.
It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they?
I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team.
~Stu
--
Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs* _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
As far as I know (via casual conversation with a friend who works on the project), the Library of Congress is still figuring out storage and access issues around their Twitter collection - I would not expect to have access to that data for some time, and when it does come, you will most likely need to work with someone at the LOC to have them manage your request. -Amanda Amanda Lenhart Pew Research Center alenhart@pewinternet.org -----Original Message----- From: air-l-bounces@listserv.aoir.org [mailto:air-l-bounces@listserv.aoir.org] On Behalf Of Richard Forno Sent: Thursday, May 05, 2011 9:38 AM To: stuart.shulman@gmail.com Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Twitter Data Sharing Update - Thou Shalt Not Share Collections of Tweets IIRC Twitter was sending regular data dumps to the Library of Congress. If that's the case, why not see if you can snag/grab/analyse the LoC's collection of UBL tweets instead? I would think that the LoC, unlike Twitter, would welcome such scholarly activities. -- rick On May 5, 2011, at 08:24 , Stuart Shulman wrote:
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS.
To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time.
It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they?
I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team.
~Stu
--
Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs* _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org Join the Association of Internet Researchers: http://www.aoir.org/
This is correct. And the original agreements between LoC and Twitter note a 6-month lag in making tweets available. -- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org On May 5, 2011, at 9:01 AM, Amanda Lenhart wrote:
As far as I know (via casual conversation with a friend who works on the project), the Library of Congress is still figuring out storage and access issues around their Twitter collection - I would not expect to have access to that data for some time, and when it does come, you will most likely need to work with someone at the LOC to have them manage your request.
-Amanda Amanda Lenhart Pew Research Center alenhart@pewinternet.org
-----Original Message----- From: air-l-bounces@listserv.aoir.org [mailto:air-l-bounces@listserv.aoir.org] On Behalf Of Richard Forno Sent: Thursday, May 05, 2011 9:38 AM To: stuart.shulman@gmail.com Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Twitter Data Sharing Update - Thou Shalt Not Share Collections of Tweets
IIRC Twitter was sending regular data dumps to the Library of Congress. If that's the case, why not see if you can snag/grab/analyse the LoC's collection of UBL tweets instead? I would think that the LoC, unlike Twitter, would welcome such scholarly activities.
-- rick
On May 5, 2011, at 08:24 , Stuart Shulman wrote:
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS.
To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time.
It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they?
I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team.
~Stu
--
Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs* _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/ _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
It appears the Twitter API doesn't care if you're selling or giving it away, as I.4.a prohibits any attempt to "sell, rent, lease, sublicense, redistribute, or syndicate access to the Twitter API or Twitter Content to any third party without prior written approval from Twitter", as well as noting that "Exporting Twitter Content to a datastore as a service or other cloud based service, however, is not permitted" http://dev.twitter.com/pages/api_terms I'm not justifying their terms, but it does appear that you violated them. -mz -- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org On May 5, 2011, at 7:24 AM, Stuart Shulman wrote:
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS.
To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time.
It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they?
I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team.
~Stu
--
Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs* _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
Hi Michael, We did not contest the violation warning and so we took the data down. The policy is another matter. These are tweets that yearn to be free, insofar as tweets can collectively yearn for anything: http://bit.ly/ixJVHu In this instance, I find myself liking the Facebook policy, which in an opposite manner sets data free. This may explain, in part, why the "Scraping Facebook" video seems to be getting more views than the one on harvesting Twitter tweets: http://www.screencast.com/t/iW3rvdYY ~Stu On Thu, May 5, 2011 at 10:25 AM, Michael Zimmer <zimmerm@uwm.edu> wrote:
It appears the Twitter API doesn't care if you're selling or giving it away, as I.4.a prohibits any attempt to "sell, rent, lease, sublicense, redistribute, or syndicate access to the Twitter API or Twitter Content to any third party without prior written approval from Twitter", as well as noting that "Exporting Twitter Content to a datastore as a service or other cloud based service, however, is not permitted"
http://dev.twitter.com/pages/api_terms
I'm not justifying their terms, but it does appear that you violated them.
-mz
-- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org
On May 5, 2011, at 7:24 AM, Stuart Shulman wrote:
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS.
To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time.
It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they?
I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team.
~Stu
--
Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs* _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/> Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs*
Are there any implications from using RSS feeds on Twitter search or Twitter profiles to "redistribute" tweets? ________________________________________ From: Michael Zimmer [zimmerm@uwm.edu] Sent: Friday, 6 May 2011 02:25 To: stuart.shulman@gmail.com Cc: air-l@listserv.aoir.org Subject: Re: [Air-L] Twitter Data Sharing Update - Thou Shalt Not Share Collections of Tweets It appears the Twitter API doesn't care if you're selling or giving it away, as I.4.a prohibits any attempt to "sell, rent, lease, sublicense, redistribute, or syndicate access to the Twitter API or Twitter Content to any third party without prior written approval from Twitter", as well as noting that "Exporting Twitter Content to a datastore as a service or other cloud based service, however, is not permitted" http://dev.twitter.com/pages/api_terms I'm not justifying their terms, but it does appear that you violated them. -mz -- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org On May 5, 2011, at 7:24 AM, Stuart Shulman wrote:
Twitter closed down our efforts to share post-Osama bin Laden Twitter data (or any other collections) for research purposes, again citing their TOS & API TOS.
To be clear: we were giving the data away, not selling it. Also, it was not scraped of Twitter. Rather, it was gathered using a Twitter-authorized account and an API that lets us fetch 1500 items at a time.
It is a shame that the now 2 million tweets cannot, for example, be sampled and coded using a crowd source model. Or could they?
I am assuming the provision against sharing data does not extend to individuals who gather it and keep it to themselves or work with it in a research team.
~Stu
--
Stuart Shulman President & CEO Texifter, LLC <http://www.texifter.com/>
Have you tried DiscoverText? http://discovertext.com *Featuring the Facebook Graph & Twitter APIs* _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
participants (6)
-
Alejandro Tortolini -
Amanda Lenhart -
Andrew Long -
Michael Zimmer -
Richard Forno -
Stuart Shulman