YRI Trio Dataset

Public Data Sets>Biology>YRI Trio Dataset
Complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria

Details

Submitted By: Santiago@AWS
US Snapshot ID (Linux/Unix): snap-9637b3ff
Size: 700GB
Source: Illumina and NCBI
Created On: October 17, 2009 2:48 AM GMT
Last Updated: October 19, 2009 4:57 PM GMT

The YRI Trio Dataset provides complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria, which represent the first human genomes sequenced using Illumina's next generation Sequence-by-Synthesis technology. For each genome, the dataset contains >30x average depth of paired 35-base reads.

This data set can be used for the following applications:

  • The development of alignment algorithms
  • The development of de novo assembly algorithms
  • The development of algorithms that define genetic regions of interest, sequence motifs, structural variants, copy number variations, and site-specific polymorphisms
  • To test the viability of annotation engines that start with raw sequence data
©2014, Amazon Web Services, Inc. or its affiliates. All rights reserved.