Over the summer I've been working with a summer intern to sample and analyze data from Weibo. We've been collecting Weibo posts over the past couple months, sampling the public stream every few seconds, using multiple API keys coming from various IP addresses (yup, they heavily rate limit API access). Some interesting facts about Weibo's API: - The way we're currently sampling, we're seeing around 4000 posts per minute - Weibo doesn't easily provide the friendship/follower graph like Twitter. It only reveals the last 5k followers for any public account. - Weibo uses an explicit sentiment/emoticons mechanism which is very popular. They link an emoticon to an emotion (spelled out). When a user chooses an icon, it embeds the word that the icon represents, within the user's post. Its possible to start mining this sentiment by looking at Weibo posts (emotions are placed within square brackets). - The API has an up-and-coming feature (according to their docs) which will give us the ability to know how many of the account's followers are online at any given time (VERY COOL). We just published a first analysis from this data, looking at people's reaction to Olympic hurdler Liu Xiang's epic fail. http://blog.socialflow.com/post/7120245585/weibo-chinas-twitter-equivalent-a... The post shows some of the ways in which we can use Weibo data. We have this growing corpus of Weibo data. I'd love to collaborate with other folks (or their students!) who want to explore the data and help us figure out how we can use it to learn about public sentiment in China. I'm not ready to make a public call yet, as I don't want Weibo to ban us (or our IP addresses) from hitting their servers. Let me know if this sounds interesting, or if there's someone I should talk to! -- Gilad | @gilgul thoughts: http://giladlotan.com/blog activism: http://www.globalvoicesonline.org/author/gilad-lotan/
We have a slightly dormant working group on Google for Weibo, but there are some good posts in there and folks you might network with. http://blog.texifter.com/index.php/2012/05/05/sina-weibo-working-group/ People who join and participate in the group get free access to our Enterprise licenses. Our main problem so far has been that Weibo shut off our access to the API. However, if you have Weibo data you can upload it in .CSV format. We are working to restore access to the API via DiscoverText, for now, it is a suspended option: http://discovertext.com/aboutdt/import.aspx ~Stu On Fri, Aug 10, 2012 at 4:36 PM, Gilad Lotan <giladlotan@gmail.com> wrote:
Over the summer I've been working with a summer intern to sample and analyze data from Weibo. We've been collecting Weibo posts over the past couple months, sampling the public stream every few seconds, using multiple API keys coming from various IP addresses (yup, they heavily rate limit API access).
Some interesting facts about Weibo's API:
-- Dr. Stuart W. Shulmanhttp://people.umass.edu/stu Founder and CEO, Texifterhttp://texifter.com LinkedIn: http://www.linkedin.com/pub/stuart-shulman/10/351/899 Twitter: http://twitter.com/#!/StuartWShulman Director, QDAP-UMasshttp://www.umass.edu/qdap Editor Emeritus, JITPwww.jitp.net
participants (2)
-
Gilad Lotan -
Shulman, Stuart