AWS News Blog

The AWS Report – Lisa Green of Common Crawl

In the latest episode of The AWS Report, I spoke with Lisa Green of Common Crawl to learn more about what they do and how they use AWS:

The Common Crawl data is available in the form of an AWS Public Data Set. If you are planning to process this large (81 TB) data set, you may also want to take a look at the Common Crawl Index and the Common Crawl Tutorial.

— Jeff;

 

 

TAGS:
Jeff Barr

Jeff Barr

Jeff Barr is Chief Evangelist for AWS. He started this blog in 2004 and has been writing posts just about non-stop ever since.