Listing Thumbnail

    NOAA Cloud Optimized Zarr Reference Files (Kerchunk)

     Info
    Open data
    |
    Deployed on AWS
    This repository contains references to datasets published to the NOAA Open Data Dissemination Program. These reference datasets serve as index files to the original data by mapping to the [Zarr V2 specification](https://zarr-specs.readthedocs.io/en/latest/v2/v2.0.html). When multidimensional model output is read through zarr, data can be lazily loaded (i.e. retrieving only the data chunks needed for processing) and data reads can be scaled horizontally to optimize object storage read performance. <br/> <br/> The process used to optimize the data is called [kerchunk](https://fsspec.github.io/kerchunk/). RPS runs [the workflow](https://asascience-open.github.io/nextgen-dmac/ingest/ingest-prototype.html) in their AWS cloud environment every time a new data notification is received from a relevant source data bucket. <br/> <br/> These are the current datasets being cloud-optimized. Refer to those pages for file naming conventions and other information regarding the specific model imp[...]

    Overview

    This repository contains references to datasets published to the NOAA Open Data Dissemination Program. These reference datasets serve as index files to the original data by mapping to the Zarr V2 specification . When multidimensional model output is read through zarr, data can be lazily loaded (i.e. retrieving only the data chunks needed for processing) and data reads can be scaled horizontally to optimize object storage read performance.

    The process used to optimize the data is called kerchunk . RPS runs the workflow  in their AWS cloud environment every time a new data notification is received from a relevant source data bucket.

    These are the current datasets being cloud-optimized. Refer to those pages for file naming conventions and other information regarding the specific model implementations:
    NOAA Operational Forecast System (OFS) 

    NOAA Global Real-Time Ocean Forecast System (Global RTOFS) 

    NOAA National Water Model Short-Range Forecast 

    Filenames follow the source dataset’s conventions. For example, if the source file is
    nos.dbofs.fields.f024.20240527.t00z.nc

    Then the cloud-optimized filename is the same, with “.zarr” appended
    nos.dbofs.fields.f024.20240527.t00z.nc.zarr

    Data Aggregations
    We also produce virtual aggregations to group an entire forecast model run, and the “best” available forecast.
    Best Forecast (continuously updated) - nos.dbofs.fields.best.nc.zarr Full Model Run - nos.dbofs.fields.forecast.[YYYYMMDD].t[CC]z.nc.zarr

    • CC is the model run cycles, 00, 06, 12, 18 , or 03, 09, 15, 21 for nowcast and forecast runs
    • YYYY = year, MM = month, DD = day

    Cloud optimization workflows supported by [RPS Group](), a Tetra Tech Company

    Features and programs

    Open Data Sponsorship Program

    This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.

    Pricing

    This is a publicly available data set. No subscription is required.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    AWS Data Exchange (ADX)

    AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

    Open data resources

    Available with or without an AWS account.

    How to use
    To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more 
    Description
    Cloud-optimized Zarr Reference Files
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::noaa-nodd-kerchunk-pds
    AWS region
    us-east-1
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://noaa-nodd-kerchunk-pds/
    Description
    New data notifications for Cloud-optimized Zarr Reference Files
    Resource type
    SNS topic
    Amazon Resource Name (ARN)
    arn:aws:sns:us-east-1:123901341784:NewNODDKerchunkObject
    AWS region
    us-east-1

    Resources

    Support

    Contact

    For questions regarding data content or quality, visit Email The Tetra Tech Team .
    For any questions regarding data delivery or any general questions regarding the NOAA Open Data Dissemination (NODD) Program, email the NODD Team at nodd@noaa.gov .
    We also seek to identify case studies on how NOAA data is being used and will be featuring those stories in joint publications and in upcoming events. If you are interested in seeing your story highlighted, please share it with the NODD team by emailing nodd@noaa.gov 

    How to cite

    NOAA Cloud Optimized Zarr Reference Files (Kerchunk) was accessed on DATE from https://registry.opendata.aws/noaa-nodd-kerchunk .

    License

    NOAA data disseminated through NODD are open to the public and can be used as desired.

    NOAA makes data openly available to ensure maximum use of our data, and to spur and encourage exploration and innovation throughout the industry. NOAA requests attribution for the use or dissemination of unaltered NOAA data. However, it is not permissible to state or imply endorsement by or affiliation with NOAA. If you modify NOAA data, you may not state or imply that it is original, unaltered NOAA data.

    Similar products