Hi Kathleen, Apache Lucene is the best resource for something like this, in my opinion. Available here: http://lucene.apache.org/ Requires some programming knowledge though. Thanks, Wojciech On Mon, Feb 13, 2012 at 12:33 AM, Kathleen Stansberry <kpontius@uoregon.edu>wrote:
I¹m working on a project that involves conducting a cluster analysis (type of textual analysis based on Kenneth Burke¹s work) on the content of five different websites. I want to download the full content of these five sites so I have hard copies to work from during the rather arduous process of going through and categorizing the text.
Can anyone recommend a good program to download full websites (to a page depth of at least 3)? I¹ve been using SiteSucker but am finding it a bit buggy.
Thank you! Katie
Kathleen Stansberry Ph.D. Candidate University of Oregon School of Journalism and Communication http://katiestansberry.com kpontius@uoregon.edu (541) 228-5576 _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/