Open Data on AWS

Share any volume of data with as many people as you want

Learn more about Open Data on AWS at re:Invent 2024 | Register now

When data is shared on AWS, anyone can analyze it and build services on top of it using a broad range of compute and data analytics products, including Amazon EC2, Amazon Athena, AWS Lambda, and Amazon EMR. Sharing data in the cloud lets data users spend more time on data analysis rather than data acquisition.

AWS Data Exchange makes it easy to find datasets made publicly available through AWS services. Browse available data and learn how to register your own datasets.

AWS Public Datasets: Unlocking the Potential of Open Data in the Cloud

Putting data to work on AWS

Examples of how data shared on AWS is accelerating research and creation of new applications.

Allen Institute for Brain Science Shares Research Data on AWS

(1:47)

Element 84 Uses AWS to Process Large Datasets at Scale

(1:19)

SpaceNet Accelerates Geospatial Machine Learning using AWS

(1:20)

What's new

Showing results: 10-12
Total results: 145
  • Date (Newest—Oldest)
  • Date (Oldest—Newest)
  • Title (A—Z)
  • Title (Z—A)

No results found

  • Blog Post

    Alex’s Lemonade Stand Foundation uses AWS to advance cutting-edge pediatric cancer research worldwide

    In 2017, the Alex’s Lemonade Stand Foundation (ALSF) founded the Childhood Cancer Data Lab (Data Lab) to address an important gap in the pediatric cancer field: vast amounts of accumulated data were not being put to use at scale. To address this gap, the Data Lab used AWS to build refine.bio, an openly available collection of normalized bulk gene expression data, to make public datasets interoperable and reusable.
    July 2022
  • Blog Post

    Creating access control mechanisms for highly distributed datasets

    Security is priority number one at AWS. Data stored in Amazon Simple Storage Service (Amazon S3) is private by default. However, some datasets are made to be shared. In this blog post, we cover the no-cost mechanisms data providers can utilize to create access control policies for their highly distributed open datasets.
    June 2022
  • Blog Post

    AWS announces simpler access to sustainability data and launches hackathon to accelerate innovation for sustainability

    Artificial intelligence (AI) and machine learning (ML) are critical tools being used in healthcare research, autonomous applications, predictive maintenance, and also a key tool used to advance sustainability solutions. However, to use AI and ML to solve sustainability problems, innovators need specific datasets that are prepared for analysis and training of the models. To help create and accelerate sustainability solutions, the Amazon Sustainability Data Initiative (ASDI) today announced easier identification of sustainability datasets with integration in AWS Data Exchange and the launch of a sustainability hackathon.
    June 2022
1 49

Key initiatives

  • Featured

    Earth on AWS

    Visit Earth on AWS to learn about building planetary-scale applications in the cloud with open geospatial data.

Benefits of sharing data on AWS

Global community of users

Global community of users

When you share data on AWS, you make it available to a large and growing community of developers, startups, and enterprises around the world.

Reduced time to insight

Reduced time to insight

In AWS, tools to analyze data are only ever a click away, which means you reduce the time it takes for people to start working with your data.

New services and tools

New services and tools

The AWS Cloud expands daily, and data shared on AWS becomes more useful as new features and services are released.

Lower cost of research

Lower cost of research

Researchers can analyze data shared on AWS without needing to pay to store their own copy. They only pay for the compute they use, and do not need to purchase storage to start a project.

Open Data Sponsorship Program

The AWS Open Data Sponsorship Program covers the cost of storage for publicly available high-value cloud-optimized datasets. We work with data providers who seek to:

  • Democratize access to data by making it available for analysis on AWS
  • Develop new cloud-native techniques, formats, and tools that lower the cost of working with data
  • Encourage the development of communities that benefit from access to shared datasets
Learn how to propose your dataset to the Open Data Sponsorship Program