This guide will point to resources that identify and track steps taken by the Trump administration and Congress to scale back or eliminate access to federal government information. It also provides links to groups performing data and website rescue.
Data & Website Rescue Efforts
- End of Term Web ArchiveCaptures and saves U.S. Government websites at the end of presidential administrations. The EOT has thus far preserved websites from administration changes in 2008, 2012, 2016, and 2020.
- Wayback Machine - by the Internet ArchiveExplore more than 928 billion web pages saved over time. Install the Official Wayback Machine Extension to easily save websites, view missing 404 Not Found pages, or read archived books and papers. Internet Archive.
- Archive of Data.govFiles in this repository were collected intermittently between 2024-11-19 and 2025-02-06. Beginning on 2025-02-06, the repository will be updated daily. Announcing the Data.gov Archive
Harvard Law School Library Innovation Lab.
- Data Rescue ProjectThe Data Rescue Project is a coordinated effort among a group of data organizations, including IASSIST, RDAP, and members of the Data Curation Network. Serves as a clearinghouse for data rescue-related efforts and data access points for public US governmental data that are currently at risk. See what data is currently being saved with their new Data Rescue Tracker. Follow on Data Rescue Project - Bluesky Social.
- DataLumosDataLumos is an ICPSR archive for valuable government data resources. ICPSR has a long commitment to safekeeping and disseminating US government and other social science data. They have an older version of many major datasets, including some from the CDC. This is the main repository for Data Rescue Project's data with added data from FEMA, the Department of Education, and IMLS.
- Find Lost DataThis search tool provides links for data that have been downloaded from federal, state, and local agencies. Boston University School of Public Health.
- ICPSRMaintains a data archive of more than 350,000 files of research in the social and behavioral sciences. It hosts 23 specialized collections of data in education, aging, criminal justice, substance abuse, terrorism, and other fields. The Inter-university Consortium for Political and Social Research (ICPSR) is an American political science and social science research consortium, based at the University of Michigan.
- IPUMSThe IPUMS Center for Data Integration, based at the University of Minnesota, provides census and survey data from around the world. Find microdata from the U.S. Decennial Census, the American Community Survey (ACS), and the Current Population Survey. As part of their standard procedures, they download and preserve original data from U.S. statistical agencies that serve as the source data for IPUMS. Since Fri., Jan. 31, 2025, several organizations (and individuals) have downloaded many other public federal datasets. There are efforts underway to catalog and make these data available.
- GovTrackTracks the activities of the United States Congress.
- Archive-itThe Federal Depository Library Program (FDLP) Web Archive. Harvested by Library Services and Content Management (LSCM) and U.S. Government Publishing Office (GPO). The FDLP was established by Congress in 1895 to provide free, permanent public access to federal government information. GPO administers the FDLP on behalf of the participating libraries and the American people. To provide permanent public access to federal agency web content, the FDLP harvests selected U.S. Government web sites in their entirety. Access to these sites is made available through links in the online public access catalog, the Catalog of U.S. Government Publications.
- Climate & Economic Justice Screening ToolThe tool has an interactive map and uses datasets that are indicators of burdens communities experience in eight categories: climate change, energy, health, housing, legacy pollution, transportation, water and wastewater, and workforce development. This tool has been downloaded and rehosted in advance of the Trump administration's takedown of this invaluable website and data resource. Article
- DryadAn open data publishing platform and a community committed to the open availability and routine re-use of all research data. Generalist repository available to help with data publication, storage, and preservation.
- Roper CenterHas collected over 50,000 files (datasets and documentation) from 22 federal survey projects. Efforts to this point have been focused on acquiring the files and ensuring backup copies are preserved on multiple servers. Cornell University.
- Data Liberation ProjectAn initiative to identify, obtain, reformat, clean, document, publish, and disseminate government datasets of public interest.
- Big Local NewsGathers data, builds tools and collaborates with reporters to provide access to public records. Some projects have been archived by the Stanford Digital Repository, which ensures their long-term preservation.
- MuckRockProvides a repository of hundreds of thousands of pages of original government materials, information on how to file FOIA requests and tools to make the requesting process easier. Fill out their form to suggest FOIAs based on missing data sets.
- r/Data HoarderA reddit community that is coordinating efforts to rescue data.
- Safeguarding ResearchBased in the EU, USA, and globally - this initiative has access to 1-2 PB (and more on the way) of storage & people willing to seed. Have several large-scale efforts, including a 350GB web archive of CDC, including all 30.000 files from archive.cdc.gov and much more. There is a forum you can join.
- Archive TeamAn offloading point and information depot for a number of archiving projects, all related to saving websites or data that is in danger of being lost, including archiving the federal government. They maintain US Government data page.
- Data HoardingAn index of resources and archives related to data hoarding, web archival and self hosting. It was inspired by the recent purge of online information by government agencies, corporations and others, and aims to provide easier access to tools and information.
- Data Rescues 2025In response to political threats to social, environmental, health, and personal data, the University of Washington Center for Advances in Libraries, Museums, and Archives (CALMA) in collaboration with Seattle-based BKS Studio, is hosting a series of DATA RESCUE efforts.
- The Original DataRescue Workflow (2017)For reference; many of the tools no longer are supported.
- IMLSDirect link to Institute of Museum and Library Services (IMLS) archived data in DataLumos.
- Digital Government HubA dynamic, open-source reference library for anyone using design, data, and technology to improve and enhance government service delivery.
- Webrecorder US Government Web ArchiveHas archived a large number of government sites both independently and as one of the End of Term Archive partners. Webrecorder US Government Web Archive is web archiving with Browsertrix.
- ZenodoAn open dissemination research data repository for the preservation and making available of research, educational and informational content. Access to Zenodo’s content is open to all, for non-military purposes only. CERN.
Education
- Journals No Longer Being Indexed By ERICCrowdsourcing effort to figure out which journals are being removed from ERIC. From Michigan State University.
- Department of EducationDirect link to Department of Education archived data in DataLumos.
- Integrated Postsecondary Education Data System (IPEDS) SeriesComplete data files from 1980 to 2023. Includes data file, STATA data file, SPSS program, SAS program, STATA program, and dictionary. The IPEDS surveys most postsecondary institutions annually, including universities and colleges, as well as institutions offering technical and vocational education beyond the high school level. Allows users to compare the characteristics of different postsecondary institutions. Older IPEDS data are also available through the central ICPSR archive. Inter-university Consortium for Political and Social Research (ICPSR).
Environment & Climate
- Climate & Economic Justice Screening ToolThe tool has an interactive map and uses datasets that are indicators of burdens communities experience in eight categories: climate change, energy, health, housing, legacy pollution, transportation, water and wastewater, and workforce development. This tool has been downloaded and rehosted in advance of the Trump administration's takedown of this invaluable website and data resource. Article
- EPA's EJScreen 2.3An unofficial copy of EJScreen hosted by the Public Environmental Data Partners.
- Public Environmental Data ProjectCommitted to preserving and providing public access to federal environmental data. They have identified 57 high-priority databases, of which 37 have been archived thus far [February 2025].
- DataverseData uploaded by the Climate Change and Health Research Coordinating Center (CAFE)
Includes CDC's Social Vulnerability Index data.
Most of what's being placed here is data focusing on health and the environment. Harvard College. - DataRefuge DataVerseDataRefuge is also an initiative committed to identifying, assessing, prioritizing, securing, and distributing reliable copies of federal climate and environmental data so that it remains available to researchers. Harvard College.
- EDGIA research collaborative and network of diverse professionals promoting evidence-based policy-making and public interest science that advances the Environmental Right to Know (ERTK). EDGI's Federal Environmental Web Tracker. Environmental Data & Governance Initiative (EDGI).
- The Climate Mirror ProjectTrying to mirror and safely archive U.S. Govt. websites and datasets related to climate, climate change, and global warming.
- Open Energy Data Initiative (OEDI)A centralized repository of datasets aggregated from the U.S. Department of Energy’s Programs, Offices, and National Laboratories.
- PublicData - UC Santa Barbara Letters & Science ITMirrored and archived public data on locally hosted git server. Includes retrieved data sets from CDC, DoE, NIH, and NOAA.
- EPA Risk Management Program DatabaseA recently updated version of the EPA’s Risk Management Program, with submissions through December 2024. It includes risk management plans filed by facilities with extremely hazardous substances. Data Liberation Project (DLP).
- Healthy Regions & Policies (HeRoP) Lab - U of Illinois Urbana-ChampaignPreserved datasets and guidances include: The Center for Disease Control (CDC); The Environmental Protection Agency (EPA); The Health Resources and Services Administration (HRSA). Available via Box.
- NOAA Heat-Index FilesInternet Archive.
- Climate Change and Human Health Literature Portal (CCHHL) data dumpHere is gathered bibliographic information about 22,695 research items (journal articles etc.) as compiled by the National Institute of Environmental Health Sciences (NIEHS); part of the US federal government's National Institutes of Health (NIH) for its Climate Change and Human Health Literature Portal (CCHHL). Internet Archive.
- FEMA filesDirect link to Federal Emergency Management Agency (FEMA) archived data in DataLumos.
- Find Lost DataThis search tool provides links for data that have been downloaded from federal, state, and local agencies. Boston University School of Public Health.
- Climate Program PortalTracks climate investments from the Infrastructure Investment and Jobs Act (IIJA) and the Inflation Reduction Act (IRA). Includes many of the latest publicly available datasets in the climate space, including emissions data, disadvantaged community designations, public investments, and climatic event information. Focused on preserving federal agency resources that would be removed with the new administration, archiving fact sheets, program guidance, toolkits, and more. Atlas Public Policy.
- Data Rescues 2025In response to political threats to social, environmental, health, and personal data, the University of Washington Center for Advances in Libraries, Museums, and Archives (CALMA) in collaboration with Seattle-based BKS Studio, is hosting a series of DATA RESCUE efforts.
Public Health
- CDC Datasets on Internet ArchiveCDC datasets uploaded before January 28th, 2025.
- RestoredCDC.org"We are developing code to pull CDC pages which were archived by prior to January 20, 2025. Similar archives have been created by the End of Term (https://eotarchive.org) project and are hosted by the Wayback Machine (https://web.archive.org). The individual pages are archived, but links between them are broken and the pages are not easy to locate through web searches. Therefore, we will re-build the links between the pages, to create a site that can be navigated the same way the pre-January 21, 2025 CDC site. The only changes we will make on these pages is to add a header that indicates that this site is not a CDC website. Because of the complex navigation between pages, we will also include a button to report problems in this header. Our goal is to provide a mirror site that provides the same information and user experience as the previous CDC website. Some functionality, such as videos, was not archived and therefore will not work on our site."
- FAKE - CDC Clone SiteA CDC clone site with false vaccine claims is hosted by an NGO once led by the current HHS Secretary. With CDC logos, real social media links, and a near-identical design, it may violate federal laws. Substack of InfoEPI Lab.
- STAT NewsMaintains an ongoing blog post that monitors and documents the changes in CDC data. STAT also has begun an effort to download and archive all available files from data.cdc.gov.
- DataverseData uploaded by the Climate Change and Health Research Coordinating Center (CAFE)
Includes CDC's Social Vulnerability Index data. Most of what's being placed here is data focusing on health and the environment. Harvard College. - SAMSHADirect link to Substance Abuse and Mental Health Administration (SAMSHA) archived data in DataLumos.
- Healthy Regions & Policies (HeRoP) Lab - U of Illinois Urbana-ChampaignPreserved datasets and guidances include: The Center for Disease Control (CDC); The Environmental Protection Agency (EPA); The Health Resources and Services Administration (HRSA). Available via Box.
- PublicData - UC Santa Barbara Letters & Science ITMirrored and archived public data on locally hosted git server. Includes retrieved data sets from CDC, DoE, NIH, and NOAA.
- ACASignups.netLinks to archived versions of every CDC government page (Parts 1 through 15).
- Find Lost DataThis search tool provides links for data that have been downloaded from federal, state, and local agencies. Boston University School of Public Health.
- Data Rescues 2025In response to political threats to social, environmental, health, and personal data, the University of Washington Center for Advances in Libraries, Museums, and Archives (CALMA) in collaboration with Seattle-based BKS Studio, is hosting a series of DATA RESCUE efforts.
- Climate Change and Human Health Literature Portal (CCHHL) data dumpHere is gathered bibliographic information about 22,695 research items (journal articles etc.) as compiled by the National Institute of Environmental Health Sciences (NIEHS); part of the US federal government's National Institutes of Health (NIH) for its Climate Change and Human Health Literature Portal (CCHHL). Internet Archive.
- American College of Obstetricians and Gynecologists (ACOG)Hosting copies of immunization schedules and contraceptive use guidance from the CDC.
- Reproductive Rights ArchiveArchived content from the U.S. Department of Justice website.
- National Center for Biotechnology Information (NCBI)The 1000 Genomes Project is mirrored in The International Genome Sample Resource (IGSR). European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI). Provides open access.
- Resources and LinksVarious individuals and organizations have worked to archive / save data from the NIH, CDC, and other websites. This page lists many of those entities.
Museums and Libraries
- IMLS data sets on DataLumosInter-university Consortium for Political and Social Research (ICPSR).
USAID
- DHS Spatial Data RepositoryData at the country and country sub-division levels that are part of USAID's Demographic Health Survey (DHS). This collection includes geographically-linked health and demographic data from the DHS Program and the U.S. Census Bureau for mapping in a geographic information system (GIS). The data includes indicators related to: fertility, family planning, maternal and child health, gender, HIV/AIDS, literacy, malaria, nutrition, and sanitation. DataLumos.
- USAID Documents Mirror3,000+ (possibly all) documents from the United States Agency for International Development (USAID). Internet Archive.
- US Foreign AssistanceState Department and USAID data retrieved from ForeignAssistance.gov. ICPSR.
- DHS Indicator DataSummary data for countries, country subdivisions, and demographic categories that were generated from USAID's Demographic Health Survey (DHS). The indicators are population-level estimates that were generated from sample surveys that were conducted in over 90 low and middle income countries at various points over several decades. Almost 2,000 indicators capture information related to: fertility, family planning, maternal and child health, gender, HIV/AIDS, literacy, malaria, nutrition, and sanitation. DataLumos.
- DHS API data rescueData were retrieved from the DHS Program indicator data API for years 1985-2023. openICPSR.
Last Updated: Apr 11, 2025 5:14 PM
URL: https://libguides.umn.edu/govpubs/admin