Facebook Data of 1.2 Million Users from 2005 Released
Dear AoIR-ers: Some might be interested in this Facebook dataset release, and the issues I raise regarding its "anonymization". -mz Facebook Data of 1.2 Million Users from 2005 Released: Limited Exposure, but Very Problematic http://michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-... Recently, a Facebook dataset was released consisting of the complete set of users from the Facebook networks at 100 American institutions, and all of the in-network “friendship” links between those users as they existed at a single moment of time in September 2005. Surprisingly, it initially included each users unique Facebook ID, meaning the presumed “anonymous” dataset could be easily re-identified, potentially putting the personal information of 1.2 million Facebook users at risk. -- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org
Michael, I read through your blog post via a link from Twitter this morning. I agree that's it's a big misstep to include the User IDs. But in your write-up, as well as on the release page of the dataset, there doesn't seem to be any mention of what percentage of the users (at the time of collection) had privacy settings enabled versus were fully public. Any idea? Alex --- Alexander Leavitt Researcher Microsoft Research New England Microsoft Entertainment & Devices Convergence Culture Consortium (Comparative Media Studies, MIT) http://alexleavitt.com http://doalchemy.org Twitter: @alexleavitt On Tue, Feb 15, 2011 at 12:38 PM, Michael Zimmer <zimmerm@uwm.edu> wrote:
Dear AoIR-ers:
Some might be interested in this Facebook dataset release, and the issues I raise regarding its "anonymization".
-mz
Facebook Data of 1.2 Million Users from 2005 Released: Limited Exposure, but Very Problematic
http://michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-...
Recently, a Facebook dataset was released consisting of the complete set of users from the Facebook networks at 100 American institutions, and all of the in-network “friendship” links between those users as they existed at a single moment of time in September 2005. Surprisingly, it initially included each users unique Facebook ID, meaning the presumed “anonymous” dataset could be easily re-identified, potentially putting the personal information of 1.2 million Facebook users at risk.
-- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
At the time - in Facebook - there really was no conception of fully public. One could only be open to their friends or their networks. So "public" was a function of the size of the networks in which an individual participated. To the question of being "open to the network," in 2005 it is fairly safe to assume that 80-90% of profiles were open to the network. We saw numbers like that fairly consistently across some studies run at the time. Best, Fred On Feb 15, 2011, at 12:45 PM, Alex Leavitt wrote:
Michael,
I read through your blog post via a link from Twitter this morning. I agree that's it's a big misstep to include the User IDs. But in your write-up, as well as on the release page of the dataset, there doesn't seem to be any mention of what percentage of the users (at the time of collection) had privacy settings enabled versus were fully public. Any idea?
Alex
---
Alexander Leavitt Researcher Microsoft Research New England Microsoft Entertainment & Devices Convergence Culture Consortium (Comparative Media Studies, MIT) http://alexleavitt.com http://doalchemy.org Twitter: @alexleavitt
On Tue, Feb 15, 2011 at 12:38 PM, Michael Zimmer <zimmerm@uwm.edu> wrote:
Dear AoIR-ers:
Some might be interested in this Facebook dataset release, and the issues I raise regarding its "anonymization".
-mz
Facebook Data of 1.2 Million Users from 2005 Released: Limited Exposure, but Very Problematic
http://michaelzimmer.org/2011/02/15/facebook-data-of-1-2-million-users-from-...
Recently, a Facebook dataset was released consisting of the complete set of users from the Facebook networks at 100 American institutions, and all of the in-network “friendship” links between those users as they existed at a single moment of time in September 2005. Surprisingly, it initially included each users unique Facebook ID, meaning the presumed “anonymous” dataset could be easily re-identified, potentially putting the personal information of 1.2 million Facebook users at risk.
-- Michael Zimmer, PhD Assistant Professor, School of Information Studies Co-Director, Center for Information Policy Research University of Wisconsin-Milwaukee e: zimmerm@uwm.edu w: www.michaelzimmer.org
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- Fred Stutzman Postdoctoral Fellow H. John Heinz III College Carnegie Mellon University fred@fredstutzman.com
BOOK ANNOUNCEMENT: Digital Jesus Hello AoIRers: I apologize for the self-promotion, but I wanted all my friends and colleagues in the AoIR community to hear that (after a long haul on this research) my book _Digital Jesus_ is finally available from NYU! As may be obvious, its about online religion (or religion online . . . maybe); but largely vetted by a more Religious Studies sort of crowd. Nonetheless, many members of this list have been a huge influence on my thinking in it; and I am thrilled to let you know it finally exists! Thanks to you all for blazing the trail of internet research! Order it at Amazon: http://www.amazon.com/Digital-Jesus-Christian-Fundamentalist-Alternative/dp/... Or you can save a dime by downloading the intro. for free from NYU through a link on the front of my homepage. With just the intro., will miss all the fantastic examples! But it has the basic point of course. http://rghoward.com Thanks for taking a quick look at it if you can! Rob -------------------------------------------------------------------------------------- Digital Jesus: The Making of a New Christian Fundamentalist Community on the Internet "In the 1990s, Marilyn Agee developed one of the most well-known amateur evangelical websites focused on the “End Times”, The Bible Prophecy Corner. Around the same time, Lambert Dolphin, a retired Stanford physicist, started the website Lambert’s Library to discuss with others online how to experience the divine. While Marilyn and Lambert did not initially correspond directly, they have shared several correspondents in common. Even as early as 1999 it was clear that they were members of the same online network of Christians, a virtual church built around those who embraced a common ideology. Digital Jesus documents how such like-minded individuals created a large web of religious communication on the Internet, in essence developing a new type of new religious movement—one without a central leader or institution. Based on over a decade of interaction with figures both large and small within this community, Robert Glenn Howard offers the first sustained ethnographic account of the movement as well as a realistic and pragmatic view of how new communication technologies can both empower and disempower the individuals who use them. By tracing the group’s origins back to the email lists and “Usenet” groups of the 1980s up to the online forums of today, Digital Jesus also serves as a succinct history of the development of online group communications." Robert Glenn Howard is associate professor in the Department of Communication Arts and Director of Digital Studies at the University of Wisconsin-Madison. Currently he is also associate director of the Folklore Program at Wisconsin and editor of the journal Western Folklore." Robert Glenn Howard http://rghoward.com _______________________________ University of Wisconsin -- Madison Associate Professor, Department of Communication Arts Director, Digital Studies Associate Chair, Folklore Program Affiliated Faculty, Religious Studies and the Center for the Study of Upper Midwestern Culture Editor, Western Folklore
participants (4)
-
Alex Leavitt -
Fred Stutzman -
Michael Zimmer -
RGH