Posted On: Nov 18, 2015

Qualified researchers can now access two of the world’s largest collections of cancer genome data as AWS Public Data Sets.

The Cancer Genome Atlas (TCGA) corpus of raw and processed genomic, transcriptomic, and epigenomic data from thousands of cancer patients is now freely available on Amazon S3 for registered users of the Cancer Genomics Cloud, one of the funded cancer cloud pilots of the National Cancer Institute.

Over 2,400 whole genomes from 1,100 unique donors from the International Cancer Genome Consortium (ICGC) PanCancer Analysis of Whole Genomes (PCAWG) data set is also freely available on Amazon S3 for credentialed researchers subject to the ICGC data sharing policies.

Access to the TCGA and ICGC data sets on AWS will be administered by third-party Trusted Partners who will also curate a set of cloud-based tools to accelerate use of the data. Making these data and tools available in the cloud to qualified researchers drastically lowers the barrier to entry in working with petabyte scale cancer genomic data, enhance collaboration across research groups, and potentially accelerate the development of new treatments for cancer patients.

Learn more about these genomics resources on our blog:
Visit the TCGA on AWS page for information on accessing the data:
Visit the ICGC on AWS page for information on accessing the data: