
Overview
The REaltime DAta Synthesis and Analysis (REDASA) COVID-19 snapshot contains the output of the curation protocol produced by our curator community. A detailed description can be found in our paper . The first S3 bucket listed in Resources contains a large collection of medical documents in text format extracted from the CORD-19 dataset , plus other sources deemed relevant by the REDASA consortium. The second S3 bucket contains a series of documents surfaced by Amazon Kendra that were considered relevant for each medical question asked. The final S3 bucket contains the GroundTruth annotations created by our curator community.
Features and programs
Open Data Sponsorship Program
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
Legal
Content disclaimer
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- This is the raw data repository containing a common crawl of CORD-19 papers and other sources identified by the REDASA Project.
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::pansurg-curation-raw-open-data
- AWS region
- eu-west-2
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://pansurg-curation-raw-open-data/
- Description
- For all the questions curated during the REDASA project, we created a Kendra index. The documents available in this S3 bucket were surfaced by the Kendra index as being relevant to the research medical question.
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::pansurg-curation-workflo-kendraqueryresults50d0eb-open-data
- AWS region
- eu-west-2
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://pansurg-curation-workflo-kendraqueryresults50d0eb-open-data/
- Description
- An S3 bucket that contains the final curation data in GroundTruth format
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::pansurg-curation-final-curations-open-data
- AWS region
- eu-west-2
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://pansurg-curation-final-curations-open-data/
Resources
Vendor resources
Support
Managed By
REDASA Consortium, Imperial College London, UK
How to cite
REDASA COVID-19 Open Data was accessed on DATE from https://registry.opendata.aws/redasa-covid-data .
License
CC-BY-4.0