Skip to main content

Social Media Data

This guide describes social media datasets available to the GW community, the Social Feed Manager service, and tools for analyzing social media data.

Social Media Datasets

The following are examples of datasets available to GW faculty, students, and researchers. 

We have a number of additional datasets available that may meet your needs. Contact us at libdata@gwu.edu for more information and access to the data. To comply with social media platforms' terms of service, full datasets are only available if you are affiliated with GW. 

2016 United States Presidential Election Tweet Ids

Approximately 280 million tweets related to the 2016 United States presidential election. They were collected between July 13, 2016 and November 10, 2016 from the Twitter API using Social Feed Manager.

Each subset was collected either from the Twitter API's user timeline method or the Twitter Stream API's filter method. The subsets include:

  • Candidates and key election hashtags (Twitter filter)
  • Democratic candidates (Twitter user timeline)
  • Democratic Convention (Twitter filter)
  • Democratic Party (Twitter user timeline)
  • Election Day (Twitter filter)
  • First presidential debate (Twitter filter)
  • GOP Convention (Twitter filter)
  • Republican candidates (Twitter user timeline)
  • Republican Party (Twitter user timeline)
  • Second presidential debate (Twitter filter)
  • Third presidential debate (Twitter filter)
  • Vice Presidential debate (Twitter filter)

Womens March

Contains 7,275,228 tweets related to the Women's March on January 21, 2017. They were collected between December 19, 2016 and January 23, 2017 using Social Feed Manager. These tweets were collected using the POST statuses/filter method of the Twitter Stream API. 

Keywords tracked include:

WomensMarch, #WMW, #WMWArchiveProject, #WhyIMarch, #ActivistADay, #WomensMarchWednesday, #WMWArt, #MarchMusicMondays, #WMWYouth

Tweets, quotes, and replies by @WomensMarch, as well as replies to @WomensMarch and mentions of @WomensMarch are included. 

News Outlets

Tweets from 92 news outlets' Twitter accounts, including @nytimes, @washingtonpost and many U.S. and international accounts. Historical data from each account varies and ranges from 2012 to 2017. 

End of Term Archive (2016)

Tweets and Tumblr blog posts from federal agencies, collected at the end of the Obama presidential administration, through February 2017 and the start of the Trump administration. Collection includes tweets from 2,981 federal agency accounts and 72 Tumblr blogs. 

This collection was created as part of the collaborative End of Term Archive project, which collects websites from the federal web presence at the end of each presidential term. 

115th United State Congress

Tweets from members of Congress, official offices, and other related accounts. More than 1 million tweets from 603 accounts belonging to senators, representatives, congressional causes and committees of the 115th Congress. 

Trump Administration

Tweets from 60 Twitter accounts belonging to the President, Vice President, White House and administration officials, Cabinet members, and political appointments. 

GW Libraries • 2130 H Street NW • Washington DC 20052202.994.6558AskUs@gwu.edu