AWS Public Sector Blog
Tag: open data
Alex’s Lemonade Stand Foundation uses AWS to advance cutting-edge pediatric cancer research worldwide
In 2017, the Alex’s Lemonade Stand Foundation (ALSF) founded the Childhood Cancer Data Lab (Data Lab) to address an important gap in the pediatric cancer field: vast amounts of accumulated data were not being put to use at scale. To address this gap, the Data Lab used AWS to build refine.bio, an openly available collection of normalized bulk gene expression data, to make public datasets interoperable and reusable.
OpenFold, OpenAlex catalog of scholarly publications, and Capella Space satellite data: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. Our full list of publicly available datasets are on the Registry of Open Data on AWS and are now also discoverable on AWS Data Exchange. This quarter, we released 15 new or updated datasets including OpenFold, OpenAlex, and radar data from Capella Space. Check out some highlights from the new or updated datasets.
Creating access control mechanisms for highly distributed datasets
Security is priority number one at AWS. Data stored in Amazon Simple Storage Service (Amazon S3) is private by default. However, some datasets are made to be shared. In this blog post, we cover the no-cost mechanisms data providers can utilize to create access control policies for their highly distributed open datasets.
How Natural Resources Canada migrated petabytes of geospatial data to the cloud
Since 1971, Canada Centre for Mapping and Earth Observation (CCMEO) at Natural Resources Canada (NRCan) has accumulated an Earth observation (EO) data archive in excess of two petabytes (PB). NRCan wanted to modernize its geospatial offerings at a faster pace, so they turned to the AWS Snow Family on AWS to migrate their large volume of data.
Climate Next: How sustainability champions around the world use cloud-powered tech to fight climate change
The theme of this year’s Earth Day is “Invest in our planet.” In celebration of this year’s theme, we want to highlight the important sustainability work featured in the AWS documentary series, Climate Next, which explores the ways organizations and communities in four distinct regions are investing in cloud-powered solutions to fight climate change.
Accelerating new materials design with open data on AWS
The Materials Project at Lawrence Berkeley National Laboratory (LBNL) is an open database that offers information about material properties, or, all the elements and substances that make up the products we use every day. By harnessing the power of the Department of Energy’s (DOE) high-performance scientific computing and state of the art electronic structure methods, the Materials Project provides open web-based access on AWS to computational datasets on both known and potential materials, along with powerful analysis tools to help discover, inspire, and design new materials.
Downscaled CMIP5, 1950 US Census, and open genomics data for Galaxy: The latest open data on AWS
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on Amazon Web Services (AWS). Our full list of publicly available datasets are on the Registry of Open Data on AWS. This quarter, we released 13 new or updated datasets including CMIP5, 1950s US Decennial Census, and open genomics data for Galaxy. Read on for some highlights.
Predicting global biodiversity patterns in Costa Rica with ecosystem modeling on AWS
As part of the Amazon Sustainability Data Initiative (ASDI), AWS invited Rafael Monge Vargas, director of the National Center of GeoEnvironmental Information (CENIGA) at the Costa Rica’s Ministry of Environment and Energy (MINAE), to share how his team is helping advance conservation and economic development in Costa Rica and how they utilize ASDI and AWS to support these efforts.
From open data to machine learning, making 1950 Census data available with AWS
On April 1, the US National Archives and Records Administration (NARA) released the 1950 Census data to the general public. Census data is released 72 years after a census is conducted, and it has been 10 years since the last census data for the 1940 Census was publicly released. With the support of cloud technologies, this release marks a number of important firsts. AWS is honored to support the release of the 1950 Census and help make this data available to the public.
Bringing world-class satellite imagery to smallholder farmers with open data
As part of the Amazon Sustainability Data Initiative (ASDI), AWS invited Nils Helset, co-founder and chief executive officer (CEO) of DigiFarm, to share how AWS Cloud technology and open data support DigiFarm’s efforts in precision farming to make agricultural practices more sustainable and efficient.