Posted On: Jan 14, 2021
Eighteen new or updated datasets from Illumina, the University of Alaska Fairbanks, IntelinAir, and others are available on the Registry of Open Data in the following categories.
Life sciences:
- 1000 Genomes Phase 3 Reanalysis with DRAGEN 3.5 from Illumina, Inc.
- BindingDB managed by Amazon Web Services (AWS)
- Sounds of Central African Landscapes from the Center for Conservation Bioacoustics, Cornell University
- Updated: Genome In A Bottle from the National Institute of Standards and Technology
Geospatial:
- AgricultureVision from IntelinAir
- Copernicus Digital Elevation Model managed by Singerise
- High Resolution Downscaled Climate Data for Southeast Alaska from the University of Alaska Fairbanks
Climate and weather:
- IDEAM Colombian Radar Network from IDEAM
- Global Forecast System Warm Start Initial Conditions from National Oceanic and Atmospheric Administration (NOAA)
- Unified Forecast System Subseasonal to Seasonal prototype 5 from NOAA
- WRF Downscaled Coupled Model Intercomparison Project 6 (CMIP6) from UCLA
- Updated: High-Resolution Rapid Refresh Archive from NOAA and in Zarr format managed by the University of Utah
- Updated: Coupled Model Intercomparison Project 6 (CMIP6) NetCDF format managed by Earth Science Grid Federation (ESGF)
- Updated: National Water Model Reanalysis in Zarr format from NOAA
Machine learning:
- DialogLUE from AWS
- Natural Scenes Dataset from the University of Minnesota
- Sophos/ReversingLabs 20 Million Malware Detection Dataset from Sophos AI
- CoversBR - A Large Dataset for Cover Song Identification from Dirceu G Silva
The AWS Open Data Sponsorship Program covers the cost of storage for publicly available high-value cloud-optimized datasets. We work with data providers who seek to:
- Democratize access to data by making it available for analysis on AWS
- Develop new cloud-native techniques, formats, and tools that lower the cost of working with data
- Encourage the development of communities that benefit from access to shared datasets
Learn how to propose your dataset to the AWS Open Data Sponsorship Program.