I don't aware of any ready to use data mining tools. Probably you need to develop the tool yourself. For one project (e.g. Weiboscope [1]), we need to gather data from the API first and then do the data analysis. The tricky part about the Chinese language is that it is not space delimited and therefore one cannot tokenize a sentence into words as in the case of English (or other space delimited languages such as French or German.) It can be solved partially using text segmenters such as Stanford NLP toolkits or Jieba. [1] Fu, Chan, Chau. Assessing Censorship on Microblogs in China. https://hub.hku.hk/bitstream/10722/183851/1/content.pdf?accept=1 On Tue, May 9, 2017 at 11:58 PM, Helen Kennedy <h.kennedy@sheffield.ac.uk> wrote:
Hello clever AOIR folks
Asking for postgrad students: any recommendations of social media data mining tools that work on Chinese social media platforms / with Chinese languages?
Thanks!
Helen
-- Professor Helen Kennedy, Chair in Digital Society Department of Sociological Studies / Faculty of Social Sciences Elmfield, Northumberland Road Sheffield S10 2TU T: 0114 2226488 E: h.kennedy@sheffield.ac.uk
LATEST ARTICLE: *'*The Feeling of Numbers: emotions in everyday engagements with data and their visualisation <http://journals.sagepub.com/doi/abs/10.1177/0038038516674675?journalCode=soca>', *Sociology*, 2017. _______________________________________________ The Air-L@listserv.aoir.org mailing list is provided by the Association of Internet Researchers http://aoir.org Subscribe, change options or unsubscribe at: http://listserv.aoir.org/listinfo.cgi/air-l-aoir.org
Join the Association of Internet Researchers: http://www.aoir.org/