CZ Grand Challenges - Transcriptomic MIT Licensed data and models

Sold by: Chan Zuckerberg Initiative Foundation

This dataset contains a transcriptomics biological data and models. The models embed transcriptomic data and facilitate transcriptomic analysis. The data is sourced and curated by a team of experts at CZI and is made available as part of these datasets only when it is not publicly accessible or requires transformations to support model training.

Overview

Features and programs

Open Data Sponsorship Program

This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.

Learn more

Pricing

This is a publicly available data set. No subscription is required.

How can we make this page better?

We'd like to hear your feedback and ideas on how to improve this page.

Legal

Content disclaimer

Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.

Usage information

Info

Delivery details

AWS Data Exchange (ADX)

AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.

Open data resources

Available with or without an AWS account.

How to use: To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more

Description: scGenePT model & training data
Resource type: S3 bucket
Amazon Resource Name (ARN): arn:aws:s3:::czi-scgenept-public
AWS region: us-west-2
AWS CLI access (No AWS account required): aws s3 ls --no-sign-request s3://czi-scgenept-public/

Description: TranscriptFormer model
Resource type: S3 bucket
Amazon Resource Name (ARN): arn:aws:s3:::czi-transcriptformer
AWS region: us-west-2
AWS CLI access (No AWS account required): aws s3 ls --no-sign-request s3://czi-transcriptformer/

Resources

Vendor resources

View this dataset on Github

Support

Contact

data@chanzuckerberg.com

Managed By

Chan Zuckerberg Initiative Foundation

How to cite

CZ Grand Challenges - Transcriptomic MIT Licensed data and models was accessed on DATE from https://registry.opendata.aws/czi-transcriptomics-mit .

License

MIT License

Similar products

Windows Server 2025 with AWS CLI | Support by Belinda CZ s.r.o.

By Belinda CZ s.r.o.

This product has charges associated with it for seller support. Lightweight, secure, and agile server solution designed for small businesses, branch offices, and edge deployments. While optimized for smaller environments, it also provides value for larger organizations that require efficient, scalable server solutions. Featuring the latest enhancements in hybrid cloud integration, AI-driven management, advanced container support, and robust security, this image streamlines operations while reducing resource overhead. With simplified connectivity to Azure and improved performance focused on essential functionality, it is the ideal choice for those who need modern server capabilities without the bulk of larger editions.

View product

Historical Job Postings for Czechia (CZ)

By Techmap

This dataset includes historical and upcoming job postings for Czechia (CZ) collected by Techmap since January 2020, with an average of 46.8k new postings added monthly. Use it to identify leads, track hiring trends, analyze markets, spot company signals, or enhance job boards. Gain actionable insights into emerging technologies and potential prospects to stay competitive.

View product

3 months job postings data feed for Czechia (CZ)

By Techmap

This data feed provides access to the last 3 months of job postings from Czechia (CZ). On average, we add 480 new job postings daily. Old files with job postings are removed after 100 days. Utilize this data to gain actionable insights into companies, markets, services, or technologies, or to backfill a job board. Identify company signals, analyze hiring trends, spot emerging technologies, and discover potential leads to stay ahead of the competition.

View product

CZ CELLxGENE Discover Census

By Chan Zuckerberg Initiative Foundation part of the AWS Open Data Sponsorship Program

CZ CELLxGENE Discover (cellxgene.cziscience.com) is a free-to-use platform for the exploration, analysis, and retrieval of single-cell data. CZ CELLxGENE Discover hosts the largest aggregation of standardized single-cell data from the major human and mouse tissues, with modalities that include gene expression, chromatin accessibility, DNA methylation, and spatial transcriptomics. This year, CZ CELLxGENE Discover has made available all of its human and mouse RNA single-cell data through Census (https://chanzuckerberg.github.io/cellxgene-census/) – a free-to-use service with an API and data that allows for querying its single-cell data corpus directly from Python or R. The API uses a new technology, TileDB-SOMA, that allows for efficient and low-latency querying. The data are fully standardized and hosted publicly for free access, and they are composed by a count matrix of tens of millions of cells (observations) by >60 k genes (features) accomp[...]

View product

CZ Grand Challenges - Model Benchmarking

By Chan Zuckerberg Initiative Foundation

This dataset includes data and models relevant to benchmarking multimodal biological models. The data has been sourced and curated by a team of experts at CZI and is provided as part of these datasets only when it is not publicly available or requires transformation to support effective model benchmarking.

View product