Skip to Main Content

Data Preservation and Data Sources

Overview

The Research Data Management Task Force is a collaboration between Libraries and Academic Innovation, Himmelfarb Health Sciences Library, GW Information Technology, and the Office of the Vice Provost for Research.

Data Preservation Tools

These data preservation organizations focus on preserving data:

  • ArchiveBox has archived datasets from data.gov, CIBP, USCIS, NOAA, NASA, NSIDC
  • Archive Team is focusing on archiving datasets from the U.S. Government
  • The Federal Environmental Web Tracker tracks website changes related to climate, energy, and the environment.
  • GitHub: Awesome Datahoarding provides lists of tools for web harvesting
  • GovDiff shows side-by-side comparisons of government website changes
  • MIT Libraries: Data Management Checklist provides a checklist for curating data rescue efforts
  • r/Data Hoarder is a subreddit of data preservation activists
  • Safeguarding Research Discourse Group is a preservation group hosted outside of the US
  • WebRecorder has archived 8TB+ of government sites
2130 H Street NWWashington DC 20052202.994.6558