
Sold by: Amazon Web Services
Deployed on AWS
Use Common Crawl (AWS Data Exchange for AWS Lake Formation Test Product) to understand how to interact with data made available via AWS Lake Formation.
Overview
Open this product detail page to access the AWS Open Data version of this data set. This product contains a snapshot of as of November 2022 of Common Crawl, a corpus of web crawl data composed of over 50 billion web pages.
For instructions on how to use this product, please follow the steps in AWS's free, public Workshop: Query third-party data with AWS Lake Formation
Details
Sold by
Categories
Delivery method
Deployed on AWS
New
Introducing multi-product solutions
You can now purchase comprehensive solutions tailored to use cases and industries.
Features and programs
Financing for AWS Marketplace purchases
AWS Marketplace now accepts line of credit payments through the PNC Vendor Finance program. This program is available to select AWS customers in the US, excluding NV, NC, ND, TN, & VT.
Pricing
This product is available free of charge. Free subscriptions have no end date and may be canceled any time.
Additional AWS infrastructure costs may apply. Use the AWS Pricing Calculator to estimate your infrastructure costs.
Vendor refund policy
No refunds
How can we make this page better?
We'd like to hear your feedback and ideas on how to improve this page.
Legal
Vendor terms and conditions
Upon subscribing to this product, you must acknowledge and agree to the terms and conditions outlined in the vendor's End User License Agreement (EULA) .
Content disclaimer
Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Additional details
You will receive access to the following data sets.
Data set name | Type | Historical revisions | Future revisions | Sensitive information | Data dictionaries | Data samples |
|---|---|---|---|---|---|---|
Common Crawl | All historical revisions | All future revisions | Not included | Not included |
Similar products

Protect against common vulnerabilities and exposures (CVE). CVE Rules for AWS WAF provides protection for high profile CVEs targeting the following systems: Apache, Apache Struts, Bash, Elasticsearch, IIS, JBoss, JSP, Java, Joomla, MySQL, Node.js, PHP, PHPMyAdmin, Perl, Ruby On Rails, and WordPress.
Common Room helps teams capture and act on every buying signal in one place. It empowers go-to-market teams with AI-driven insights for effective outreach.
Common Fate secures cloud identities, enforces least privilege, and provides full visibility for proactive risk management.
The Deloitte HHS NextGen Common Noticing Solution is a cloud-based notice generation and management service designed for government and public sector organizations

This data package contains the National Drug Code to Healthcare Common Procedure Coding System Crosswalks with conversion factors and dates changes.