AWS Public Sector Blog
Tag: registry of open data
OpenFold, OpenAlex catalog of scholarly publications, and Capella Space satellite data: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. Our full list of publicly available datasets are on the Registry of Open Data on AWS and are now also discoverable on AWS Data Exchange. This quarter, we released 15 new or updated datasets including OpenFold, OpenAlex, and radar data from Capella Space. Check out some highlights from the new or updated datasets.
Creating access control mechanisms for highly distributed datasets
Security is priority number one at AWS. Data stored in Amazon Simple Storage Service (Amazon S3) is private by default. However, some datasets are made to be shared. In this blog post, we cover the no-cost mechanisms data providers can utilize to create access control policies for their highly distributed open datasets.
Downscaled CMIP5, 1950 US Census, and open genomics data for Galaxy: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on Amazon Web Services (AWS). Our full list of publicly available datasets are on the Registry of Open Data on AWS. This quarter, we released 13 new or updated datasets including CMIP5, 1950s US Decennial Census, and open genomics data for Galaxy. Read on for some highlights.
Bringing world-class satellite imagery to smallholder farmers with open data
As part of the Amazon Sustainability Data Initiative (ASDI), AWS invited Nils Helset, co-founder and chief executive officer (CEO) of DigiFarm, to share how AWS Cloud technology and open data support DigiFarm’s efforts in precision farming to make agricultural practices more sustainable and efficient.
Preventing the next pandemic: How researchers analyze millions of genomic datasets with AWS
How do we avoid the next global pandemic? For researchers collaborating with the University of British Columbia Cloud Innovation Center (UBC CIC), the answer to that question lies in a massive library of genetic sequencing data. But there is a problem: the data library is so massive that traditional computing can’t comprehensively analyze or process it. So the UBC CIC team collaborated with computational virologists to create Serratus, an open-science viral discovery platform to transform the field of genomics—built on the massive computational power of the Amazon Web Services (AWS) Cloud.
Street-scale global maps, orca sounds, and COVID-19 detection data: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. We work with data providers to democratize access to data by making it available to the public for analysis on AWS; to develop new cloud-native techniques, formats, and tools that lower the cost of working with data; and to encourage the development of communities that benefit from access to shared datasets. This quarter, we released 19 new or updated datasets like validated OpenStreetMap data, bioacoustic data, COVID-19 detection data, and more.
Analyze terabyte-scale geospatial datasets with Dask and Jupyter on AWS
Terabytes of Earth Observation (EO) data are collected each day, quickly leading to petabyte-scale datasets. By bringing these datasets to the cloud, users can use the compute and analytics resources of the cloud to reliably scale with growing needs. In this post, we show you how to set up a Pangeo solution with Kubernetes, Dask, and Jupyter notebooks step-by-step on Amazon Web Services (AWS), to automatically scale cloud compute resources and parallelize workloads across multiple Dask worker nodes.
Celebrate Open Science Week with the Allen Institute and available open datasets
The Allen Institute seeks to understand how our brains, cells, and immune systems work when we are healthy and, ultimately, how they go wrong in disease. Allen researchers have generated and shared atlases that map the brain, gene-edited stem cell lines, and many more resources that have been used by millions of scientists around the world to accelerate their research. In collaboration with AWS and the Registry of Open Data on AWS, they make many of their datasets publicly available. In celebration of Open Science Week, check out some of these open datasets from the Allen Institute, and their impact on the research community.
How open data from weather radar helps scientists improve environmental understanding
Weather radars see more than just the weather: they see smoke from fires, meteors, birds, mayflies, and almost anything else in the atmosphere. This makes weather radars an invaluable tool for scientists seeking to further the understanding of atmospheric processes and anything else that happens to be flying through the radar’s field of view. The Amazon Sustainability Data Initiative (ASDI) seeks to accelerate sustainability-related innovation and research by helping to minimizing the cost and time required to store, acquire, and analyze large weather and climate datasets.
Satellite imagery over Africa, a large-scale climate ensemble, and product listings with 3D renderings: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. This quarter, we released 44 new or updated datasets including satellite imagery over Africa, a large-scale climate ensemble, and product listings with 3D renderings. Learn how you can put these open datasets to work.