Random sample of tweets
The first dataset is a random sample of 720,000 tweets captured at 5-minute intervals from the public timeline over the period 1/26/09-6/13/09 using the Twitter API. This sample includes tweets from 437,708 unique users, but does not include tweets from those with protected accounts. This data set provides valuable insight into the prevalence of a variety of Twitter practices. Using this data, we found that: