AWS Public Sector Blog

Tag: datasets

AWS branded background design with text overlay that says "34 new or updated datasets available on the Registry of Open Data on AWS"

34 new or updated datasets available on the Registry of Open Data on AWS

The Amazon Web Services (AWS) Open Data Sponsorship Program makes high-value, cloud-optimized datasets publicly available on AWS. Through this program, customers are making more than 100 petabytes (PB) of high-value, cloud-optimized data available for public use. Read this blog post to learn about the 34 new or updated datasets that were released in the first quarter.

AWS branded background design with text overlay that says "Generative AI: Understand the challenges to realize the opportunities"

Generative AI: Understand the challenges to realize the opportunities

Generative artificial intelligence (AI) allows anyone to leverage machine learning (ML) capabilities using natural language, and it is extremely intuitive to use. When users are able to search, analyze, and draw conclusions in seconds—from extensive information that exists across their organization or the internet—they can make more informed decisions at speed. This blog post takes a quick look at some of the generative AI considerations public sector organizations need to take.

AWS branded backgroun with text overlay that says "Singapore Eye Research Institute categorizes retinal diseases using Amazon Rekognition"

Singapore Eye Research Institute categorizes retinal diseases using Amazon Rekognition

Amazon Rekognition, a code-free automated machine learning (AutoML) service from Amazon Web Services (AWS), showed impeccable diagnostic performance in categorizing various retinal diseases using optical coherence tomography (OCT) scans. This blog post details the steps to use Amazon Rekognition Custom Labels to train a model that categorizes retinal diseases and the process of training and fine-tuning convolutional neural networks (CNNs), the standard deep learning methodology.

AWS branded background with text overlay that says "Flexibility, cost-savings, and innovation: Kellogg School of Management chooses AWS"

Flexibility, cost-savings, and innovation: Kellogg School of Management chooses AWS

At the end of 2022, Northwestern University’s Kellogg School of Management had a decision to make. The on-premises SQL server used by faculty and students had reached the end of its life, and the school needed to identify a cost-effective way forward while ensuring that the datasets would remain highly available for researchers to use on demand. After weighing various options, Kellogg worked with Amazon Web Services (AWS) to create a data lake that fit its unique needs.

AWS branded background with text overlay that says "Improve road safety by analyzing traffic patterns with no-code ML using Amazon SageMaker Canvas"

Improve road safety by analyzing traffic patterns with no-code ML using Amazon SageMaker Canvas

To improve safety and convenience, transportation agencies amass a substantial volume of data. However, these organizations encounter challenges in data accuracy validation due to issues related to data quality and occasional missing information. With the incorporation of new artificial intelligence and machine learning capabilities from Amazon Web Services (AWS), they can take advantage of no-code solutions to identify and address data gaps.

AWS branded background with text overlay that says "34 new or updated datasets available on the Registry of Open Data on AWS"

34 new or updated datasets available on the Registry of Open Data on AWS

This quarter, AWS released 34 new or updated datasets on the Register of Open Data. What will you build with these datasets? Read through this blog post for inspiration.

36 new or updated datasets on the Registry of Open Data: AI analysis-ready datasets and more

36 new or updated datasets on the Registry of Open Data: AI analysis-ready datasets and more

This quarter, AWS released 36 new or updated datasets. As July 16 is Artificial Intelligence (AI) Appreciation Day, the AWS Open Data team is highlighting three unique datasets that are analysis-ready for AI. What will you build with these datasets?

Alzheimer’s disease research portal enables data sharing and scientific discovery at scale

The National Institute on Aging Genetics of Alzheimer’s Disease Data Storage Site (NIAGADS DSS), powered by AWS, is a genomic database that provides access to publicly available datasets for Alzheimer’s disease and related neuropathologies. Created to make Alzheimers-genetics knowledge more accessible to researchers, NIAGADS has genomics data on 172,701 samples from 98 datasets and is now 1.3 petabytes (PB) in total size. NIAGADS is creating a system that promotes scientific discovery through data sharing with a large cadre of institutions.

Largest metastatic cancer dataset now available at no cost to researchers worldwide

The NYUMets team, led by Dr. Eric Oermann at NYU Langone Medical Center, is collaborating with AWS Open Data, NVIDIA, and Medical Open Network for Artificial Intelligence (MONAI), to develop an open science approach to support researchers to help as many patients with metastatic cancer as possible. With support from the AWS Open Data Sponsorship Program, the NYUMets: Brain dataset is now openly available at no cost to researchers around the world.

How JDRF uses AWS to power Type 1 diabetes research

Advances in technology are transforming the way health research can be conducted. It is now possible to integrate data from siloed sources into a data lake, a central repository where health data are aggregated and analyzed at scale. Now, more than ever, there are opportunities for collaborative research to accelerate life-saving medical innovation – and that’s exactly what JDRF International, the leading global Type 1 Diabetes research and advocacy organization, is doing with AWS.