AWS Public Sector Blog

Tag: registry of open data

Analyze terabyte-scale geospatial datasets with Dask and Jupyter on AWS

Terabytes of Earth Observation (EO) data are collected each day, quickly leading to petabyte-scale datasets. By bringing these datasets to the cloud, users can use the compute and analytics resources of the cloud to reliably scale with growing needs. In this post, we show you how to set up a Pangeo solution with Kubernetes, Dask, and Jupyter notebooks step-by-step on Amazon Web Services (AWS), to automatically scale cloud compute resources and parallelize workloads across multiple Dask worker nodes.

Read More

Celebrate Open Science Week with the Allen Institute and available open datasets

The Allen Institute seeks to understand how our brains, cells, and immune systems work when we are healthy and, ultimately, how they go wrong in disease. Allen researchers have generated and shared atlases that map the brain, gene-edited stem cell lines, and many more resources that have been used by millions of scientists around the world to accelerate their research. In collaboration with AWS and the Registry of Open Data on AWS, they make many of their datasets publicly available. In celebration of Open Science Week, check out some of these open datasets from the Allen Institute, and their impact on the research community.

Read More

How open data from weather radar helps scientists improve environmental understanding

Weather radars see more than just the weather: they see smoke from fires, meteors, birds, mayflies, and almost anything else in the atmosphere. This makes weather radars an invaluable tool for scientists seeking to further the understanding of atmospheric processes and anything else that happens to be flying through the radar’s field of view. The Amazon Sustainability Data Initiative (ASDI) seeks to accelerate sustainability-related innovation and research by helping to minimizing the cost and time required to store, acquire, and analyze large weather and climate datasets.

Read More

Satellite imagery over Africa, a large-scale climate ensemble, and product listings with 3D renderings: The latest open data on AWS

The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. This quarter, we released 44 new or updated datasets including satellite imagery over Africa, a large-scale climate ensemble, and product listings with 3D renderings. Learn how you can put these open datasets to work.

Read More

NYU Langone Center increases MRI accessibility through cooperative data sharing and research

About 40 million MRI scans are performed in the United States every year. MRIs are a valuable part of diagnostic plans, but as they exist today, they may not always be a part of a patient’s care plan. A research team at the New York University (NYU) Langone Center set out to make MRIs more accessible for more patients by using artificial intelligence (AI), machine learning (ML), and the power of cooperative open data sharing.

Read More
Lydia Ng, Senior Director at the Allen Institute of Brain Science

AWS Public Datasets: Unlocking the potential of open data in the cloud

Sharing data publicly helps accelerate innovation by increasing the number of people who can perform research and derive insights from it. When data is shared in the cloud, researchers are able to work with data without needing to download or store their own copies. This allows users to start analyzing massive amounts of data in minutes, regardless of their location, local storage space availability, or computing capacity. Through the AWS Public Dataset Program, we collaborate with AWS customers to explore the best ways to stage data for analysis in the cloud.

Read More