AWS Government, Education, & Nonprofits Blog

AWS Public Datasets: Unlocking the potential of open data in the cloud

Lydia Ng, Senior Director at the Allen Institute of Brain Science

Lydia Ng, Senior Director at the Allen Institute of Brain Science. Watch the video, “AWS Public Datasets: Unlocking the potential of open data in the cloud.

Sharing data publicly helps accelerate innovation by increasing the number of people who can perform research and derive insights from it. When data is shared in the cloud, researchers are able to work with data without needing to download or store their own copies. This allows users to start analyzing massive amounts of data in minutes, regardless of their location, local storage space availability, or computing capacity.

Through the AWS Public Dataset Program, we collaborate with AWS customers to explore the best ways to stage data for analysis in the cloud. With the Registry of Open Data on AWS, you can find data provided by our customers along with links to tutorials, demos, applications, and publications using the data.

Watch a video on how the Allen Institute for Brain Science, Element84, and Spacenet are supporting research communities by sharing data through the AWS Public Dataset Program.

“AWS has been an essential partner with SpaceNet simply due to the size and scale of the datasets we are open sourcing,” said Ryan Lewis, Senior Vice President, In-Q-Tel – provider SpaceNet dataset on AWS.

Lydia Ng, Senior Director at the Allen Institute of Brain Science explains how sharing their data on AWS makes it “super easy to run a [training] course anywhere in the world.” Learn more about how the Allen Institute uses AWS to provide students with an analytics environment and 100TB of data in minutes.

A guide to sharing data on AWS

"Sharing Data on AWS" guide

Download the “Sharing Data on AWS” guide.

We published a guide on “Sharing Data on AWS” that shares lessons learned about sharing data through the AWS Public Dataset Program. The guide contains practical guidance for anyone who needs to share data, whether publicly or within their own organization, with topics including:

  • Advantages of sharing data in the cloud
  • Preparing data to be shared on Amazon Simple Storage Service (Amazon S3)
  • Non-technical considerations for data sharing programs
  • Customer case studies

Download the guide “Sharing Data on AWS” to learn more.

Are you interested in collaborating with us to drive usage of your data? Check out the AWS Public Dataset Program.

Want to know what other datasets are available? Sign up to be notified of datasets as they’re added by subscribing to our RSS feed.

Learn more about open data on AWS.