About the Repository

Social Computing Data Repository hosts data from a collection of many different social media sites, most of which have blogging capacity. Some of the prominent social media sites included in this repository are BlogCatalog, Twitter, MyBlogLog, Digg, StumbleUpon, del.icio.us, MySpace, LiveJournal, The Unofficial Apple Weblog (TUAW), Reddit, etc. The repository contains various facets of blog data including blog site metadata like, user defined tags, predefined categories, blog site description; blog post level metadata like, user defined tags, date and time of posting; blog posts; blog post mood (which is defined as the blogger's emotions when (s)he wrote the blog post); blogger name; blog post comments; and blogger social network.
The repository has been designed in 2009 by Reza Zafarani and Huan Liu. Funding support from the Air Force Office of Scientific Research (AFOSR) and Office of Naval Research (ONR) is gratefully acknowledged. The credit also goes to our dataset creaters who made gathering this repository possible.