AWS Public Sector Blog
Tag: open data on AWS
Bridging AI and biology: Inside the AWS and NVIDIA Open Data knowledge graph hackathon
Knowledge graphs (KGs) and large language models (LLMs) are transforming biomedical research, but ensuring artificial intelligence (AI) outputs are trustworthy and evidence-based remains challenging. At the recent AWS and NVIDIA Open Data knowledge graph hackathon, teams of researchers tackled this challenge by developing innovative solutions that combine knowledge graphs with graph-based retrieval-augmented generation (GraphRAG). Read this post to learn more.
Building AI-powered weather forecasting tools with Open Data on AWS
Although traditional weather forecasting methods have served us well, they often require substantial computational resources and time to deliver results. In this post, we explore how Brightband is revolutionizing this field by combining AI with Open Data on AWS to create faster, more accessible, and highly accurate weather forecasting solutions. When data is shared on AWS, anyone can analyze it and build services on top of it. Sharing data in the cloud lets data users such as Brightband spend more time on data analysis rather than data acquisition.
How the Imaging Data Commons migrated 40 million medical images using AWS DataSync
Learn how the National Cancer Institute Imaging Data Commons (IDC) team migrated the Imaging Data Commons data to AWS using AWS DataSync. Plus, learn how to get started with IDC data, which is accessible at no cost through the AWS Open Data Sponsorship Program.
36 new or updated datasets on the Registry of Open Data: AI analysis-ready datasets and more
This quarter, AWS released 36 new or updated datasets. As July 16 is Artificial Intelligence (AI) Appreciation Day, the AWS Open Data team is highlighting three unique datasets that are analysis-ready for AI. What will you build with these datasets?
Alzheimer’s disease research portal enables data sharing and scientific discovery at scale
The National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (NIAGADS DSS), powered by AWS, is a genomic database that provides access to publicly available datasets for Alzheimer’s disease and related neuropathologies. Created to make Alzheimers-genetics knowledge more accessible to researchers, NIAGADS has genomics data on 172,701 samples from 98 datasets and is now 1.3 petabytes (PB) in total size. NIAGADS is creating a system that promotes scientific discovery through data sharing with a large cadre of institutions.
Making weather forecasts more accessible using serverless infrastructure and open data on AWS
As part of the Registry of Open Data on AWS, AWS invited Alexander Rey, creator of Pirate Weather, to share how AWS technologies and open data are supporting his efforts to provide a no cost and open weather forecast API.
NASA and ASDI announce no-cost access to important climate dataset on the AWS Cloud
To assist the science community in conducting studies of climate change impacts at local to regional scales, NASA created the NASA Earth Exchange (NEX) Global Daily Downscaled Projections (GDDP) dataset, or NEX-GDDP-CMIP6. This dataset is expected to enhance public understanding of possible future climate patterns at the spatial scale of individual towns, cities, and watersheds. It provides a set of global, high resolution, bias-corrected climate change projections that can be used to evaluate climate change impacts on processes that are sensitive to finer-scale climate gradients and the effects of local topography on climate conditions. As part of the Amazon Sustainability Data Initiative (ASDI), this dataset is available at no cost on the Registry of Open Data.
22 new or updated open datasets on AWS: New polar satellite data, blockchain data, and more
The AWS Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. The full list of publicly available datasets are on the Registry of Open Data on AWS and are now also discoverable on AWS Data Exchange. This quarter, AWS released 22 new or updated datasets including Amazonia-1 imagery, Bitcoin and Ethereum data, and elevation data over the Arctic and Antarctica. Check out some highlights.
Building resilience: Using technology to prepare for, respond to, and recover from the unexpected
Every day, people around the world are impacted by the unexpected – from pandemics, to natural and human-wrought disasters, to economic crises. Technologies like the cloud can empower communities to prepare for and respond to the unexpected so that when a crisis hits, they can continue to advance. AWS works with customers and partners to build software solutions that improve government and nonprofits’ prediction, preparedness, response, and recovery capabilities—solutions that are being leveraged across Latin America and the Caribbean.
Creating access control mechanisms for highly distributed datasets
Security is priority number one at AWS. Data stored in Amazon Simple Storage Service (Amazon S3) is private by default. However, some datasets are made to be shared. In this blog post, we cover the no-cost mechanisms data providers can utilize to create access control policies for their highly distributed open datasets.









