AWS Public Sector Blog
Tag: Registry of Open Data on AWS
Street-scale global maps, orca sounds, and COVID-19 detection data: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. We work with data providers to democratize access to data by making it available to the public for analysis on AWS; to develop new cloud-native techniques, formats, and tools that lower the cost of working with data; and to encourage the development of communities that benefit from access to shared datasets. This quarter, we released 19 new or updated datasets like validated OpenStreetMap data, bioacoustic data, COVID-19 detection data, and more.
Analyze terabyte-scale geospatial datasets with Dask and Jupyter on AWS
Terabytes of Earth Observation (EO) data are collected each day, quickly leading to petabyte-scale datasets. By bringing these datasets to the cloud, users can use the compute and analytics resources of the cloud to reliably scale with growing needs. In this post, we show you how to set up a Pangeo solution with Kubernetes, Dask, and Jupyter notebooks step-by-step on Amazon Web Services (AWS), to automatically scale cloud compute resources and parallelize workloads across multiple Dask worker nodes.
Climate data, koala genomes, analysis ready radar data, and highly-queryable genomic data: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. We work with data providers to democratize access to data by making it available to the public for analysis on AWS; develop new cloud-native techniques, formats, and tools that lower the cost of working with data; and encourage the development of communities that benefit from access to shared datasets. Our full list of publicly available datasets are on the Registry of Open Data on AWS. This quarter, we released 26 new or updated datasets including datasets on climate, koala genomes, analysis ready radar data, and highly-queryable genomic data. Check out some highlights.
Celebrate Open Science Week with the Allen Institute and available open datasets
The Allen Institute seeks to understand how our brains, cells, and immune systems work when we are healthy and, ultimately, how they go wrong in disease. Allen researchers have generated and shared atlases that map the brain, gene-edited stem cell lines, and many more resources that have been used by millions of scientists around the world to accelerate their research. In collaboration with AWS and the Registry of Open Data on AWS, they make many of their datasets publicly available. In celebration of Open Science Week, check out some of these open datasets from the Allen Institute, and their impact on the research community.
Satellite imagery over Africa, a large-scale climate ensemble, and product listings with 3D renderings: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. This quarter, we released 44 new or updated datasets including satellite imagery over Africa, a large-scale climate ensemble, and product listings with 3D renderings. Learn how you can put these open datasets to work.
Open data on AWS supports sustainable agricultural practices and crop optimization
With a rapidly growing population, the world is increasingly dependent on the ability to develop and maintain sustainable agriculture and healthy environments. Learn how BASF Digital Farming is leveraging both National Oceanic and Atmospheric Administration (NOAA) weather data hosted on AWS, and commercial weather data, to develop digital solutions to help farmers effectively monitor and manage their fields, and help drive farm sustainability.
How the cloud is helping remove barriers to addressing climate change
What if we were to democratize access to data and compute so that anyone, anywhere in the world could contribute to climate science? The Amazon Sustainability Data Initiative (ASDI) seeks to accelerate sustainability research and innovation by minimizing the cost and time required to acquire and analyze large sustainability datasets. ASDI supports innovators and researchers with the data, tools, and technical expertise they need to advance sustainability initiatives. ASDI is committed to making climate-relevant data easier to access and analyze. ASDI’s growing data catalog comprises petabytes of open data.
Now available: CMIP6 dataset to foster climate innovation and study the impact of future climate conditions
Today, Amazon announced that it is now hosting petabytes of data from the largest and most updated climate simulation dataset in the world. Through two cloud grants from the Amazon Sustainability Data Initiative (ASDI) to the Earth System Grid Federation (ESGF), Amazon is enabling climate researchers worldwide to access and analyze the dataset used for the United Nation’s Intergovernmental Panel on Climate Change’s Sixth Assessment Report (IPCC-AR6) on the AWS Cloud. The report—scheduled to be published in May 2022—provides policymakers worldwide with the latest assessment of the scientific basis of climate change, its impacts and future risks, and options for adaptation and mitigation. The climate simulation dataset, also known as the Coupled Model Intercomparison Project Phase 6 (CMIP6) data archive, traditionally hosted and distributed through the ESGF servers, aggregates the climate models created across approximately 30 working groups and 1,000 researchers working on IPCC-AR6.
SARS-CoV-2 viral genomes, storm surge forecasts, cloud-free satellite imagery: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on Amazon Web Services (AWS). We work with data providers to democratize access to data by making it available for analysis on AWS; develop new cloud-native techniques, formats, and tools that lower the cost of working with data; and encourage the development of communities that benefit from access to shared datasets. Our full list of publicly available datasets are on the Registry of Open Data on AWS. This quarter, we released 28 new datasets including data on SARS-CoV-2 viral genomes, storm surge forecasts, and US census data. Check out some highlights.
Open data helps recovery in the aftermath of devastating weather events
Severe and extreme weather events not only wreak havoc on lives, property and the economy, but the extent of the destruction and devastation left behind can be difficult to map and quantify. Having high resolution imagery of areas devastated by weather events (hurricanes, tornadoes, floods, and etc.) helps to characterize impacts, formulate needed recovery and response activities, support emergency managers in saving lives, and restart the flow of commerce. NOAA’s data plays a critical role in this process. As part of ASDI, we invited Jena Kent from NOAA’s Big Data Program to share how AWS is helping with disaster response by providing access to aerial data and imagery through open data initiatives.