Downloads

The full collection of Italian tweets is not available for download due to the Twitter terms and conditions, so we release the TWITA as a list of the IDs of the tweet we collected. This is enough to recreate the collection, minus the tweets that have been deleted. One inefficient way of downloading a tweet given its ID is with the wget command:

$ wget http://www.twitter.com/uid/statuses/TWEET_ID
and subsequently parse its output looking for the tag
<p class="js-tweet-text tweet-text">
A better way would be to use the Twitter Streaming API.

new! TWITA tweet ID lists

note: there is a "hole" in the collection from 12th to 30th June, due to a failure to switch to the new version of the Twitter API.

Hashtags frequency lists

Resources for Sentiment Analysis