
Overview
The Open Human Genome Library (OpenHGL) is a collection of high-quality de novo human assemblies that are publicly available in genomic databases (e.g. NCBI and CNCB) or from individual research papers. It provides consistent naming and uniform formats across datasets, supporting efficient subsequence retrieval and approximate string search.
Features and programs
Open Data Sponsorship Program
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
Legal
Content disclaimer
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- This bucket contains genomic sequences in the AGC format and the corresponding FM-index in the ropebwt3 format.
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::openhgl
- AWS region
- us-east-1
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://openhgl/
- Description
- Notifications for OpenHGL updates
- Resource type
- SNS topic
- Amazon Resource Name (ARN)
- arn:aws:sns:us-east-1:104240442756:openhgl-object_created
- AWS region
- us-east-1
Resources
Vendor resources
Support
Managed By
Heng Li lab at Dana-Farber Cancer Institute and Harvard Medical School
How to cite
Open Human Genome Library was accessed on DATE from https://registry.opendata.aws/openhgl .
License
Creative Commons Zero (CC0)
Similar products
