
Overview
The End of Term Web Archive (EOT) captures and saves U.S. Government websites at the end of presidential administrations. The EOT has thus far preserved websites from administration changes in 2008, 2012, 2016, 2020 and 2024. Data from these web crawls have been made openly available in several formats in this dataset.
Features and programs
Open Data Sponsorship Program
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
Legal
Content disclaimer
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- Web Archive Crawl Data (WARC and ARC formats)
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::eotarchive
- AWS region
- us-east-1
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://eotarchive/
Resources
Vendor resources
Support
Contact
Mark Phillips mark.phillips@unt.edu , Sawood Alam sawood@archive.org
Managed By
How to cite
End of Term Web Archive Dataset was accessed on DATE from https://registry.opendata.aws/eot-web-archive .
License
There are no restrictions on the use, access, and/or download of data from the End of Term Web Archive Dataset. We request that you cite the End of Term Web Archive project when using the data provided from this dataset.
Creative Commons Zero
Similar products
