Re: [Air-L] Wikipedia article edit history extraction tools?
You're probably already aware of it but I guess it's worth highlighting to the list too that the Oxford Internet Institute (OII) does some good stuff (http://www.oii.ox.ac.uk/), and Mark Graham (there) did a great talk on the Digital Divide which included lots of stuff a) about representation of demographic groups on Wiki (including dominant language - dominant on wiki, males more dominant both in contributions and pages, etc. which sometimes reflects the historic artefact, e.g. the husband of the woman with most children has a wiki page, while she does (or did?) not). Talk can be found here http://www.zerogeography.net/2012/04/video-of-my-dfid-talk-is-now-online.htm.... Good luck with the research Simon -- Simon Knight Teaching Associate, ORBIT Project http://orbit.educ.cam.ac.uk/ Nominet Trust Associate Researcher/blogger http://www.nominettrust.org.uk/knowledge-centre/blogs/ <http://www.nominettrust.org.uk/knowledge-centre/blogs/?filters=uid%3A3929> MPhil candidate, University of Cambridge https://sites.google.com/site/sjgknight/ On 15/08/2012 11:00 PM, air-l-request@listserv.aoir.org wrote:
Send Air-L mailing list submissions to air-l@listserv.aoir.org
To subscribe or unsubscribe via the World Wide Web, visit http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org or, via email, send a message with subject or body 'help' to air-l-request@listserv.aoir.org
You can reach the person managing the list at air-l-owner@listserv.aoir.org
When replying, please edit your Subject line so it is more specific than "Re: Contents of Air-L digest..."
Today's Topics:
1. Wikipedia article edit history extraction tools? (Monika Sengul-Jones) 2. Re: Wikipedia article edit history extraction tools? (Peter Timusk) 3. Re: Wikipedia article edit history extraction tools? (Marj Kibby) 4. Fee waiver and Ignite-IR deadline today (Alexander Halavais) 5. Re: Public Sphere article? (Tessa Houghton) 6. Re: Public Sphere Article? (Taylor-Smith, Ella) 7. Curating Public Sphere readings discussion (David Brake) 8. Papers on Public Opinion (Patricia Rossini) 9. Re: Papers on Public Opinion (Peter Gloviczki) 10. Public Sphere Article? (John Monberg) 11. CfP Experiential Learning in Virtual Worlds Conference 2013 (Moira Hunter) 12. Social Informatics Blog (David Nemer) 13. Re: Wikipedia article edit history extraction tools? (Monika Sengul-Jones)
----------------------------------------------------------------------
Message: 1 Date: Tue, 14 Aug 2012 15:38:53 -0700 From: Monika Sengul-Jones <jones.monika@gmail.com> To: air-l@listserv.aoir.org Subject: [Air-L] Wikipedia article edit history extraction tools? Message-ID: <CAD_EfP+5jdaG8MDPfGMSwMO=vCifMU_bR8+7MPvDUF5Yb2AVyQ@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1
Hello Air-L list:
This summer I'm doing research on Wikipedia entries in the field of Science and Technology Studies. A central question I'm asking is the extent to which this field, as it is now on Wikipedia, includes/features/references contributions made by women, feminist theorists, and feminist theory.
To answer this, I'm gathering data on existing pages using a variety of mixed methods. I would like to ask for recommendations on tools for extracting the history of editing on a page. I want to see how many times a given article has been edited, by whom, and what types of edits and content contributions are made over time. So far, I've found the "history" tool on the Wikipedia page limited. I cannot see how many edits have been made on a particular article and understanding what kinds of edits are made (e.g. grammatical, content) requires going into each historical page view. I'd love to find a way to download the history of an article and extract the data into a spreadsheet -- perhaps this is a tall order.
So far, I've found tools for extracting data on Wikipedia from the Digital Methods Initiative website (which was first introduced to me by this list serve! :)). Specifically, the program History Flow is useful to an extent for visualizing types of content contributions and edits over time. But there is no way to translate these visualizations into a spreadsheet format -- as far as I can tell -- so I've been doing that manually, somehow piecing together the history of edits on an article. Meanwhile, I was recommended a tool called WikiChecker ( http://en.wikichecker.com/article/?a=science_studies) but the summary format is limited and, at times, contradictory to data I get elsewhere.
If anyone has any other tools or methods to suggest for ways to collect data on content contributions and edits on Wikipedia I would be most grateful.
I'd also be happy to be in conversation with anymore interested in the concept of the project. I'm working on it as a part of the FemTechNet Initiative, spearheaded by Anne Balsamo and Alexandra Juhasz. I'm not sure if information on the initiative has circulated here, so I'll paste in a copy of the "call" which took place last spring. * http://aljean.files.wordpress.com/2012/05/femtechnet-long-form-invite-may-20... *
Thank you, Monika
-- Monika Sengul-Jones Graduate Student Communication & Science Studies University of California, San Diego msengul@ucsd.edu
------------------------------
Message: 2 Date: Tue, 14 Aug 2012 19:02:44 -0400 From: Peter Timusk <ptimusk@sympatico.ca> To: "'Monika Sengul-Jones'" <jones.monika@gmail.com>, <air-l@listserv.aoir.org> Subject: Re: [Air-L] Wikipedia article edit history extraction tools? Message-ID: <BLU0-SMTP44661A367FC0C8478B6FAEDCB70@phx.gbl> Content-Type: text/plain; charset="us-ascii"
Monika
I am not sure how you will get the demography variables you obviously need. I use a handle to do my edits on Wikipedia. That's all you see in the edit history. Of course some like me may have a male first name in this handle or a female first name. In my legal studies BA we learned that we had to cite the first names of scholars because this allowed us to see the gender. Wikipedia do not know my gender. Unlike some paid web site that may have my credit card data and access to my gender which they could in turn share with a researcher I don't think Wikipedia have much real data about me they can share.
-----Original Message----- From: air-l-bounces@listserv.aoir.org [mailto:air-l-bounces@listserv.aoir.org] On Behalf Of Monika Sengul-Jones Sent: August-14-12 6:39 PM To: air-l@listserv.aoir.org Subject: [Air-L] Wikipedia article edit history extraction tools?
Hello Air-L list:
This summer I'm doing research on Wikipedia entries in the field of Science and Technology Studies. A central question I'm asking is the extent to which this field, as it is now on Wikipedia, includes/features/references contributions made by women, feminist theorists, and feminist theory.
To answer this, I'm gathering data on existing pages using a variety of mixed methods. I would like to ask for recommendations on tools for extracting the history of editing on a page. I want to see how many times a given article has been edited, by whom, and what types of edits and content contributions are made over time. So far, I've found the "history" tool on the Wikipedia page limited. I cannot see how many edits have been made on a particular article and understanding what kinds of edits are made (e.g. grammatical, content) requires going into each historical page view. I'd love to find a way to download the history of an article and extract the data into a spreadsheet -- perhaps this is a tall order.
So far, I've found tools for extracting data on Wikipedia from the Digital Methods Initiative website (which was first introduced to me by this list serve! :)). Specifically, the program History Flow is useful to an extent for visualizing types of content contributions and edits over time. But there is no way to translate these visualizations into a spreadsheet format -- as far as I can tell -- so I've been doing that manually, somehow piecing together the history of edits on an article. Meanwhile, I was recommended a tool called WikiChecker ( http://en.wikichecker.com/article/?a=science_studies) but the summary format is limited and, at times, contradictory to data I get elsewhere.
If anyone has any other tools or methods to suggest for ways to collect data on content contributions and edits on Wikipedia I would be most grateful.
I'd also be happy to be in conversation with anymore interested in the concept of the project. I'm working on it as a part of the FemTechNet Initiative, spearheaded by Anne Balsamo and Alexandra Juhasz. I'm not sure if information on the initiative has circulated here, so I'll paste in a copy of the "call" which took place last spring. * http://aljean.files.wordpress.com/2012/05/femtechnet-long-form-invite-may-20 12.pdf *
Thank you, Monika
-- Monika Sengul-Jones Graduate Student Communication & Science Studies University of California, San Diego msengul@ucsd.edu _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
------------------------------
Message: 3 Date: Wed, 15 Aug 2012 09:26:13 +1000 From: Marj Kibby <Marj.Kibby@newcastle.edu.au> To: air-l@listserv.aoir.org Subject: Re: [Air-L] Wikipedia article edit history extraction tools? Message-ID: <502B6B34020000CB000534A2@WINDOMPRD00.newcastle.edu.au> Content-Type: text/plain; charset=US-ASCII
Is is possible to get access to non-public data http://meta.wikimedia.org/wiki/Research:Index
Marj
Associate Professor Marjorie Kibby, B.Ed, M.A, Ph.D, FHERDSA Director, Student Experience FEDUA Head of Discipline: Film, Media and Cultural Studies School of Humanities and Social Science The University of Newcastle Callaghan NSW 2308 Australia Marj.Kibby@newcastle.edu.au +61 2 49216604
Peter Timusk <ptimusk@sympatico.ca> 15/08/12 9:04 AM >>> Monika
I am not sure how you will get the demography variables you obviously need. I use a handle to do my edits on Wikipedia. That's all you see in the edit history. Of course some like me may have a male first name in this handle or a female first name. In my legal studies BA we learned that we had to cite the first names of scholars because this allowed us to see the gender. Wikipedia do not know my gender. Unlike some paid web site that may have my credit card data and access to my gender which they could in turn share with a researcher I don't think Wikipedia have much real data about me they can share.
-----Original Message----- From: air-l-bounces@listserv.aoir.org [mailto:air-l-bounces@listserv.aoir.org] On Behalf Of Monika Sengul-Jones Sent: August-14-12 6:39 PM To: air-l@listserv.aoir.org Subject: [Air-L] Wikipedia article edit history extraction tools?
Hello Air-L list:
This summer I'm doing research on Wikipedia entries in the field of Science and Technology Studies. A central question I'm asking is the extent to which this field, as it is now on Wikipedia, includes/features/references contributions made by women, feminist theorists, and feminist theory.
To answer this, I'm gathering data on existing pages using a variety of mixed methods. I would like to ask for recommendations on tools for extracting the history of editing on a page. I want to see how many times a given article has been edited, by whom, and what types of edits and content contributions are made over time. So far, I've found the "history" tool on the Wikipedia page limited. I cannot see how many edits have been made on a particular article and understanding what kinds of edits are made (e.g. grammatical, content) requires going into each historical page view. I'd love to find a way to download the history of an article and extract the data into a spreadsheet -- perhaps this is a tall order.
So far, I've found tools for extracting data on Wikipedia from the Digital Methods Initiative website (which was first introduced to me by this list serve! :)). Specifically, the program History Flow is useful to an extent for visualizing types of content contributions and edits over time. But there is no way to translate these visualizations into a spreadsheet format -- as far as I can tell -- so I've been doing that manually, somehow piecing together the history of edits on an article. Meanwhile, I was recommended a tool called WikiChecker ( http://en.wikichecker.com/article/?a=science_studies) but the summary format is limited and, at times, contradictory to data I get elsewhere.
If anyone has any other tools or methods to suggest for ways to collect data on content contributions and edits on Wikipedia I would be most grateful.
I'd also be happy to be in conversation with anymore interested in the concept of the project. I'm working on it as a part of the FemTechNet Initiative, spearheaded by Anne Balsamo and Alexandra Juhasz. I'm not sure if information on the initiative has circulated here, so I'll paste in a copy of the "call" which took place last spring. * http://aljean.files.wordpress.com/2012/05/femtechnet-long-form-invite-may-20 12.pdf *
Thank you, Monika
-- Monika Sengul-Jones Graduate Student Communication & Science Studies University of California, San Diego msengul@ucsd.edu _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
------------------------------
Message: 4 Date: Wed, 15 Aug 2012 00:27:02 -0500 From: Alexander Halavais <halavais@gmail.com> To: aoir list <air-l@aoir.org> Subject: [Air-L] Fee waiver and Ignite-IR deadline today Message-ID: <CAKxz=uCLALdaXmPchw7xYW6TQqW=_RiLdosSFdqtoGXkmvB7qA@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1
Just a quick reminder that if you planned to submit a proposal for the Ignite series, today (August 15, or at least before i wake up on the 16th) is your last chance. The ignite talks were a great way to get out fresh ideas last year to a large audience, and a lot of fun to watch. More details: http://ir13.aoir.org/ignite-ir/
Likewise, if you want to apply for the IR fee waiver and travel stipend, be sure to email your application today. More details: http://aoir.org/fee-waivertravel-stipend-for-ir13-0/
For those of you starting out your semesters, best of luck for the fall and I look forward to seeing you in Salford for a mid-semester break. I am already transcribing a map of the local pubs onto a silk handkerchief in case I get lost.
Best,
Alex
participants (1)
-
Simon Knight