Open data
    |
    Deployed on AWS
    The International Cancer Genome Consortium (ICGC) coordinates projects with the common aim of accelerating research into the causes and control of cancer. The PanCancer Analysis of Whole Genomes (PCAWG) study is an international collaboration to identify common patterns of mutation in whole genomes from ICGC. More than 2,400 consistently analyzed genomes corresponding to over 1,100 unique ICGC donors are now freely available on Amazon S3 to credentialed researchers subject to ICGC data sharing policies.

    Overview

    The International Cancer Genome Consortium (ICGC) coordinates projects with the common aim of accelerating research into the causes and control of cancer. The PanCancer Analysis of Whole Genomes (PCAWG) study is an international collaboration to identify common patterns of mutation in whole genomes from ICGC. More than 2,400 consistently analyzed genomes corresponding to over 1,100 unique ICGC donors are now freely available on Amazon S3 to credentialed researchers subject to ICGC data sharing policies.

    Features and programs

    Open Data Sponsorship Program

    This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.

    Pricing

    This is a publicly available data set. No subscription is required.

    How can we make this page better?

    We'd like to hear your feedback and ideas on how to improve this page.
    We'd like to hear your feedback and ideas on how to improve this page.

    Legal

    Content disclaimer

    Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

    Usage information

     Info

    Delivery details

    AWS Data Exchange (ADX)

    AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

    Open data resources

    Available with or without an AWS account.

    How to use
    To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more 
    Description
    BAM and VCF files from the The PanCancer Analysis of Whole Genomes (PCAWG) study.
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::icgc
    AWS region
    us-east-1
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://icgc/
    Description
    This public Amazon S3 bucket contains analysis metadata in XML format for genome analysis results. More information at http://oicr.icgc.meta.s3.amazonaws.com/metadata/README
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::oicr.icgc.meta/metadata
    AWS region
    us-east-1
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://oicr.icgc.meta/metadata/
    Description
    Raw sequencing and other primary data from non-TCGA ICGC projects
    Resource type
    S3 bucket
    Amazon Resource Name (ARN)
    arn:aws:s3:::oicr.icgc
    AWS region
    us-east-1
    AWS CLI access (No AWS account required)
    aws s3 ls --no-sign-request s3://oicr.icgc/

    Resources

    Support

    How to cite

    ICGC on AWS was accessed on DATE from https://registry.opendata.aws/icgc .

    License

    Data use is subject to the access and publication polices of the source. Distribution of the data is subject to ICGC Trusted Partner Approval. More information on terms of use is available at https://icgc.org/daco