I've served on a couple of University IRBs and recently had a chapter published on getting online research approved by IRBs - so here's my take (and from a strictly US perspective) Getting permission from the site owners is helpful but not determinative. The two key issues that I'd say would be the emphasis of IRB concerns are: 1) Is the site and content public? 2) How are individual responses recorded? For 1)- it's a little tricky since one could argue that anything on the internet is inherently public, but the federal IRB guidelines define "Public" in an online sense in terms of whether the people providing the responses have any expectation of privacy for their responses and the site. If there's some expectation of privacy, the IRB's likely to ask you to get consent from participants before collecting responses. Otherwise the responses are public and fair game. For 2) - to qualify as "exempt" research the key is that the data/responses must be recorded anonymously- Basically that means that you don't record any personally identifying information (like usernames). If the raw data comes with identifiers there are two options- first, strip the identifying information from the raw data and create a research data set with no identifiers - then destroy the raw dataset (so you can't go back and peek); if you want to somehow link an individual's responses, you can also replace the personal identifier (username) with a generic identifier (Subject1) to create a new research dataset before destroying the raw data. It's also helpful (but not necessary) to have someone other than the researchers do the anonymising. The key to qualifying as exempt is that the data you actually use for the research is recorded anonymously - and that you don't keep raw data with personally identifying information long (or better, as Pis you don't even see the raw data (the anonymising is done by someone else)). I hope this helps, Ben Bates College of Comm & Info U of Tennessee, Knoxville
Message: 2 Date: Thu, 25 Oct 2012 16:11:06 -0700 From: "Dan L. Burk" <dburk@uci.edu> To: "Kathie Gossett" <gossettphd@gmail.com> Cc: Charles Tong <charles.t.tong@gmail.com>, air-l@listserv.aoir.org Subject: Re: [Air-L] IRB Exemption and Terms of Service - Reddit.com Message-ID: <8acaf68737c537202a2af478b7af8dae.squirrel@webmail.uci.edu> Content-Type: text/plain;charset=iso-8859-1
Er -- since we don't know what jurisdiction Charles is in, we can't say whether fair use applies to him or not (or, for that matter, how enforceable the Reddit ToS is against him).
It sounds like he might be located in the U.S. but that's a guess.
I also don't think we know enough about his research to say very much about copyright/fair use. There are a lot of moving parts there. Just as an example, it's not clear to me from his post what kind of data he is gathering -- but the copyright in the "influential posts" he wants to analyze may very well lie with the authors, not with Reddit.
(In which case, if his IRB wants informed consent from the participants, he might end up with their authorization to copy the material as well.)
DLB
Charles,
You've got two separate issues here: copyright and IRB.
According to my understanding you should be able to use the data you're collecting under the Fair Use clause. The TOS you copied below specifically mentions commercial or unauthorized uses under copyright. The Fair Use clause states: "...the fair use of a copyrighted work, including such use by reproduction in copies or phonorecords or by any other means specified by that section, for purposes such as criticism, comment, news reporting, teaching (including multiple copies for classroom use), scholarship, or research, is not an infringement of copyright." You need to understand though that fair use is not a right, it's a "defensible position", meaning that it is an acceptable defense in any legal action; however, you would have to prove that your use of the material falls under the fair use clause in a courtroom if the company decides to challenge your use of their data--it doesn't automatically negate the possibility of legal action.
There are a couple of things I'd suggest. First, you need to find out what your advisor will approve. They are directing your thesis and you need to make sure any data you use meets with their approval. Second, most university legal departments have some statement about what constitutes fair use on their campus, so I'd check with them (but I'd say only if your advisor requires it--many universities actually have much stricter policies on paper than are required under the Fair Use clause). Third, I'd look at the Ethics Guide from AOIR (http://aoir.org/documents/ethics-guide/) and make sure you're working within those guidelines. They've been developed by scholars who regularly work with the type of data you're using, so you should make sure your study works within those guidelines.
Regarding IRB, I agree with Charles. You should contact your university IRB and find out what requirements they may have for exempt or non-exempt status. Often departments or colleges within universities will have an IRB specialist that you can work with and who will help you fill out the appropriate applications.
Good luck, Kathie
On Oct 25, 2012, at 3:22 PM, Charles Tong wrote:
Hello AoIR,
My name is Charles Tong and I currently working on my undergraduate thesis in Sociology. Specifically, I am trying to construct a social network based on reddit.com by data-mining posts and comments from day to day. I am specifically looking to identify how consistent influence and power is created on the website. I hope to do this by constructing multiple networks from day to day of specific sub-forums and differentiating between those who are one-time stars and consistent stars. Then I will perform a content analysis of certain influential posts to determine what makes someone consistently powerful or not and in what ways they are influential.
Unfortunately, I am unsure of how this will pass under IRB exemption. I am hoping that anyone on the AoIR mailing list may have some insight that may help. I have attempted to e-mail and get in contact with the website numerous times, but each time, they have not responded. I understand that I will have to send reddit.com?s user agreement.
The user agreement states the following: ?You may not in any way make commercial or other unauthorized use, by publication, re-transmission, distribution, performance, caching, or otherwise, of material obtained through the Website, including without limitation the Assets or Website Content, except as permitted by the Copyright Act or other law or as expressly permitted in writing by this Agreement, Service Provider or the Website.?
I was wondering if anyone knows how to approach this specific issue. I understand that Fair Use might permit this research under the Copyright Act, but I am not sure of its nature.
Thank you very much in advance for your help. Let me know if any additional information should be provided.
Best Regards, Charles Tong _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
_______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/
-- School of Law University of California, Irvine 4500 Berkeley Place Irvine, CA 92697-8000 Voice: (949) 824-9325 Fax: (949)824-7336 bits: dburk@uci.edu
*************************************
Greetings, To build on what Ben has said. I too am an IRB member. I also do online research with a protected population (U.S.A.) so my work comes under additional IRB scrutiny. When I apply for approval my requests include a variety of wavers and requests. First, I do not anonymize for my public research...when I am working with their webpages only. I write that fact into my IRB requests. I take the position that the adolescents I study have already chosen to be known by their online monikers and that their published work is available to the entire internet....while mine is only available to a subset thereof. This logic has been consistently approved. Likewise I do not request approval from the participants to use their online words because they have already made them available to the entire internet. This logic would not prevail for protected sites...where more than a a cursory registration is necessary, if you can register and immediately receive registration approval then the registration is considered to be perfunctory so basically public. This logic is analogous to studying letters to the editor in newspapers since the writer chose to submit the letter for publication, and even though access to the newspaper is limited to subscribers....anyone who pays their subscription fee including public libraries, etc. can see their letters. The above changes when I communicate with the adolescent website producers directly. Then I ask for waivers so I don't have to get parental consent...this is only appropriate when the questions are not too personal or controversial, or when the teens would be in more danger from getting parental approval then if there were no approval. My work falls into the first category, I have not asked particularly personal information beyond basic demographic information. I also ask for waivers of documented assent from the adolescents themselves. That way I have nothing that ties their real names to the monikers they choices to use online...which may or may not be related to their real names. Even with this waiver I have been allowed to ask survey participants for contact information if they would like to be interviewed at a later date....their choice. In the states, local IRB's can approve waivers for most any of the requirements if the waiver request meets criteria. However not every college and university will entertain waiver requests....so you need to know if waivers are possible before you design you research project. Hope that helps Lois Ann Scheidt Doctoral Candidate - School of Library and Information Science, Indiana University, Bloomington IN USA Webpage: http://www.loisscheidt.com CV: http://www.loisscheidt.com/cv.html Blog: http://www.professional-lurker.com On Sat, Oct 27, 2012 at 7:35 AM, Bates, Benjamin J <bjbates@utk.edu> wrote:
I've served on a couple of University IRBs and recently had a chapter published on getting online research approved by IRBs - so here's my take (and from a strictly US perspective)
Getting permission from the site owners is helpful but not determinative. The two key issues that I'd say would be the emphasis of IRB concerns are: 1) Is the site and content public? 2) How are individual responses recorded?
For 1)- it's a little tricky since one could argue that anything on the internet is inherently public, but the federal IRB guidelines define "Public" in an online sense in terms of whether the people providing the responses have any expectation of privacy for their responses and the site. If there's some expectation of privacy, the IRB's likely to ask you to get consent from participants before collecting responses. Otherwise the responses are public and fair game.
For 2) - to qualify as "exempt" research the key is that the data/responses must be recorded anonymously- Basically that means that you don't record any personally identifying information (like usernames). If the raw data comes with identifiers there are two options- first, strip the identifying information from the raw data and create a research data set with no identifiers - then destroy the raw dataset (so you can't go back and peek); if you want to somehow link an individual's responses, you can also replace the personal identifier (username) with a generic identifier (Subject1) to create a new research dataset before destroying the raw data. It's also helpful (but not necessary) to have someone other than the researchers do the anonymising. The key to qualifying as exempt is that the data you actually use for the research is recorded anonymously - and that you don't keep raw data with personally identifying information long (or better, as Pis you don't even see the raw data (the anonymising is done by someone else)).
I hope this helps,
Ben Bates College of Comm & Info U of Tennessee, Knoxville
participants (2)
-
Bates, Benjamin J -
Lois Scheidt