16TB Archive of US Federal Public Datasets Released

Harvard Law School researchers have released a 16TB archive containing over 311,000 datasets, a complete archive of data.gov from 2024 and 2025. The project aims to preserve the integrity and authenticity of data by maintaining detailed metadata and digital signatures, making it easier for researchers and the public to cite and access this information over time. Open-source software and documentation are also released to enable others to replicate the work and create similar repositories. The project is supported by the Filecoin Foundation and the Rockefeller Brothers Fund.
Read more