
Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge
Provided by: Langmead Lab at Johns Hopkins University & Kim Lab at University of Texas Southwestern, part of the AWS Open Data Sponsorship Program
Provided by: Langmead Lab at Johns Hopkins University & Kim Lab at University of Texas Southwestern, part of the AWS Open Data Sponsorship Program

Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge
Provided by: Langmead Lab at Johns Hopkins University & Kim Lab at University of Texas Southwestern, part of the AWS Open Data Sponsorship Program
Provided by: Langmead Lab at Johns Hopkins University & Kim Lab at University of Texas Southwestern, part of the AWS Open Data Sponsorship Program
This product is part of the AWS Open Data Sponsorship Program and contains data sets that are publicly available for anyone to access and use. No subscription is required. Unless specifically stated in the applicable data set documentation, data sets available through the AWS Open Data Sponsorship Program are not provided and maintained by AWS.
Description
Genomic tools use reference databases as indexes to operate quickly and efficiently, analogous to how web search engines use indexes for fast querying. Here, we aggregate genomic, pan-genomic and metagenomic indexes for analysis of sequencing data.
License
Public Domain
Documentation
How to cite
Cloud Indexes for Bowtie, Kraken, HISAT, and Centrifuge was accessed on DATE
from https://registry.opendata.aws/jhu-indexes .
Update frequency
As new data becomes available; roughly quarterly
Support information
Managed by: Langmead Lab at Johns Hopkins University & Kim Lab at University of Texas Southwestern
General AWS Data Exchange support
Resources on AWS
Description
This bucket contains genomic indexes for Bowtie, Kraken, HISAT, and Centrifuge.
Resource type
S3 Bucket
Amazon Resource Name (ARN)
arn:aws:s3:::genome-idx
AWS Region
us-east-1
AWS CLI Access (No AWS account required)
aws s3 ls --no-sign-request s3://genome-idx/
Usage examples
Publications
- Reducing reference bias using multiple population reference genomes by Chen et al (2020)