Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

COVID-19 Open Research Dataset (CORD-19) | Allen Institute for AI

Provided By: Rearc

COVID-19 Open Research Dataset (CORD-19) | Allen Institute for AI

Provided By: Rearc

In response to the COVID-19 pandemic, the Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 Open Research Dataset (CORD-19), a free resource of over 44,000 scholarly articles, including over 29,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community.

Product offers

The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that your request is approved by the provider. Additional taxes or fees might apply.

Public offer

Payment schedule: Upfront payment | Offer auto-renewal: Supported
$0 for 12 months

Overview

The source code outlining how this product gathers, transforms, revises and publishes its datasets is available at https://github.com/rearc-data/covid-19-open-research .

Product Description

In response to the COVID-19 pandemic, the Allen Institute for AI has partnered with leading research groups to prepare and distribute the COVID-19 Open Research Dataset (CORD-19), a free resource of over 44,000 scholarly articles, including over 29,000 with full text, about COVID-19 and the coronavirus family of viruses for use by the global research community.

This dataset is intended to mobilize researchers to apply recent advances in natural language processing to generate new insights in support of the fight against this infectious disease. The corpus will be updated weekly as new research is published in peer-reviewed publications and archival services like bioRxiv, medRxiv, and others.

Data Sources

This resource includes the metadata.csv file released weekly by the Allen Institute for AI, which documents COVID-19 updates and new research published in peer-reviewed publications. The columns of the dataset are:

cord_uid, sha,source_x, title,doi, pmcid, pubmed_id, license, abstract, publish_time, authors, journal, microsoft_academic_paper_id, who_covidence, has_pdf_parse, has_pmc_xml_parse, full_text_file, url

To explore addtional COVID-19 resources distributed by the Allen Institute for AI, please click here .

More Information

Contact Details

  • If you find any issues with or have enhancement ideas for this product, open up a GitHub issue  and we will gladly take a look at it. Better yet, submit a pull request. Any contributions you make are greatly appreciated ❤️.
  • If you are looking for specific open datasets currently not available on ADX, please submit a request on our project board here .
  • If you have questions about the source data, please contact feedback@semanticscholar.org.
  • If you have any other questions or feedback, send us an email at data@rearc.io.

About Rearc

Rearc is a cloud, software and services company. We believe that empowering engineers drives innovation. Cloud-native architectures, modern software and data practices, and the ability to safely experiment can enable engineers to realize their full potential. We have partnered with several enterprises and startups to help them achieve agility. Our approach is simple — empower engineers with the best tools possible to make an impact within their industry.

Provided By
Fulfillment Method
AWS Data Exchange

Data sets (1)

You will receive access to the following data sets

Revision access rules
All historical revisions | All future revisions
Name
Type
Data dictionary
AWS Region
COVID-19 Open Research Dataset (CORD-19) | Allen Institute for AI
Not included
US East (N. Virginia)

Usage information

By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement  or other agreement with AWS governing your use of such services.

Support information

Support contact email address
Support contact URL
Refund policy
Refunds Not Applicable
General AWS Data Exchange support