Listing Thumbnail

    REDASA COVID-19 Open Data

     Info
    Open data
    |
    Deployed on AWS
    The REaltime DAta Synthesis and Analysis (REDASA) COVID-19 snapshot contains the output of the curation protocol produced by our curator community. A detailed description can be found in [our paper](https://www.jmir.org/2021/5/e25714). The first S3 bucket listed in Resources contains a large collection of medical documents in text format extracted from the [CORD-19 dataset](https://registry.opendata.aws/cord-19/), plus other sources deemed relevant by the REDASA consortium. The second S3 bucket contains a series of documents surfaced by [Amazon Kendra](https://aws.amazon.com/kendra/) that were considered relevant for each medical question asked. The final S3 bucket contains the GroundTruth annotations created by our curator community.

    Overview

    The REaltime DAta Synthesis and Analysis (REDASA) COVID-19 snapshot contains the output of the curation protocol produced by our curator community. A detailed description can be found in our paper . The first S3 bucket listed in Resources contains a large collection of medical documents in text format extracted from the CORD-19 dataset , plus other sources deemed relevant by the REDASA consortium. The second S3 bucket contains a series of documents surfaced by Amazon Kendra  that were considered relevant for each medical question asked. The final S3 bucket contains the GroundTruth annotations created by our curator community.

    Features and programs

    Open Data Sponsorship Program

    This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.

    Pricing

    This is a publicly available data set. No subscription is required.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    AWS Data Exchange (ADX)

    AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

    Open data resources

    Available with or without an AWS account.

    How to use
    To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more 
    Description
    This is the raw data repository containing a common crawl of CORD-19 papers and other sources identified by the REDASA Project.
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::pansurg-curation-raw-open-data
    AWS region
    eu-west-2
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://pansurg-curation-raw-open-data/
    Description
    For all the questions curated during the REDASA project, we created a Kendra index. The documents available in this S3 bucket were surfaced by the Kendra index as being relevant to the research medical question.
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::pansurg-curation-workflo-kendraqueryresults50d0eb-open-data
    AWS region
    eu-west-2
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://pansurg-curation-workflo-kendraqueryresults50d0eb-open-data/
    Description
    An S3 bucket that contains the final curation data in GroundTruth format
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::pansurg-curation-final-curations-open-data
    AWS region
    eu-west-2
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://pansurg-curation-final-curations-open-data/

    Resources

    Support

    Managed By

    REDASA Consortium, Imperial College London, UK

    How to cite

    REDASA COVID-19 Open Data was accessed on DATE from https://registry.opendata.aws/redasa-covid-data .

    License

    CC-BY-4.0