
LongBench - cross-platform reference dataset profiling cancer cell lines with bulk and single-cell approaches
InfoOverview
LongBench is a comprehensive benchmark dataset of the latest long-read transcriptomics technologies from Oxford Nanopore (ON) and Pacific Biosciences, alongside a comparison with next-generation sequencing from Illumina. We generated bulk and single-cell libraries from lung cancer cell lines which include different cancer subtypes to capture real biological variation. To further compare and assess sequencing platform performance, Sequins and SIRVs (Set 4) synthetic spike-ins have been included.
Features and programs
Open Data Sponsorship Program
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
Legal
Content disclaimer
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- Bulk, single-cell, and single-nucleus RNA-seq data from the LongBench project, covering eight human lung cancer cell lines. Bulk sequencing (FASTQ) was performed on ONT PCR-cDNA, ONT direct RNA (including pod5 files for RNA modification analysis), PacBio Kinnex, and Illumina platforms. Single-cell and single-nucleus sequencing (FASTQ) was performed on ONT PCR-cDNA, PacBio Kinnex, and Illumina platforms. Aligned reads (BAM), variant calls (VCF), and processed gene expression data are also provided, along with reference genome annotations (GTF and FASTA).
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::longbench-data
- AWS region
- ap-southeast-2
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://longbench-data/
Resources
Vendor resources
Support
Contact
Managed By
Richie Lab, Walter and Eliza Hall Institute of Medical Research
How to cite
LongBench - cross-platform reference dataset profiling cancer cell lines with bulk and single-cell approaches was accessed on DATE from https://registry.opendata.aws/longbench .
License
CC BY-4.0