
Open data
|
Deployed on AWS
This dataset contains a transcriptomics biological data and models. The models embed transcriptomic data and facilitate transcriptomic analysis. The data is sourced and curated by a team of experts at CZI and is made available as part of these datasets only when it is not publicly accessible or requires transformations to support model training.
Overview
This dataset contains a transcriptomics biological data and models. The models embed transcriptomic data and facilitate transcriptomic analysis. The data is sourced and curated by a team of experts at CZI and is made available as part of these datasets only when it is not publicly accessible or requires transformations to support model training.
Features and programs
Open Data Sponsorship Program
This dataset is part of the Open Data Sponsorship Program, an AWS program that covers the cost of storage for publicly available high-value cloud-optimized datasets.
Pricing
This is a publicly available data set. No subscription is required.
How can we make this page better?
We'd like to hear your feedback and ideas on how to improve this page.
Legal
Content disclaimer
Vendors are responsible for their product descriptions and other product content. AWS does not warrant that vendors' product descriptions or other product content are accurate, complete, reliable, current, or error-free.
Delivery details
AWS Data Exchange (ADX)
AWS Data Exchange is a service that helps AWS easily share and manage data entitlements from other organizations at scale.
Open data resources
Available with or without an AWS account.
- How to use
- To access these resources, reference the Amazon Resource Name (ARN) using the AWS Command Line Interface (CLI). Learn more
- Description
- scGenePT model & training data
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::czi-scgenept-public
- AWS region
- us-west-2
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://czi-scgenept-public/
- Description
- TranscriptFormer model
- Resource type
- S3 bucket
- Amazon Resource Name (ARN)
- arn:aws:s3:::czi-transcriptformer
- AWS region
- us-west-2
- AWS CLI access (No AWS account required)
- aws s3 ls --no-sign-request s3://czi-transcriptformer/
Resources
Vendor resources
Support
Contact
Managed By
How to cite
CZ Grand Challenges - Transcriptomic MIT Licensed data and models was accessed on DATE from https://registry.opendata.aws/czi-transcriptomics-mit .
License
Similar products
This product has charges associated with it for seller support. Lightweight, secure, and agile server solution designed for small businesses, branch offices, and edge deployments. While optimized for smaller environments, it also provides value for larger organizations that require efficient, scalable server solutions. Featuring the latest enhancements in hybrid cloud integration, AI-driven management, advanced container support, and robust security, this image streamlines operations while reducing resource overhead. With simplified connectivity to Azure and improved performance focused on essential functionality, it is the ideal choice for those who need modern server capabilities without the bulk of larger editions.

This dataset includes historical and upcoming job postings for Czechia (CZ) collected by Techmap since January 2020,
with an average of 46.8k new postings added monthly.
Use it to identify leads, track hiring trends, analyze markets, spot company signals, or enhance job boards.
Gain actionable insights into emerging technologies and potential prospects to stay competitive.

This data feed provides access to the last 3 months of job postings from Czechia (CZ). On average, we add 480 new job postings daily. Old files with job postings are removed after 100 days.
Utilize this data to gain actionable insights into companies, markets, services, or technologies, or to backfill a job board. Identify company signals, analyze hiring trends, spot emerging technologies, and discover potential leads to stay ahead of the competition.

CZ CELLxGENE Discover (cellxgene.cziscience.com) is a free-to-use platform for the exploration, analysis, and retrieval of single-cell data. CZ CELLxGENE Discover hosts the largest aggregation of standardized single-cell data from the major human and mouse tissues, with modalities that include gene expression, chromatin accessibility, DNA methylation, and spatial transcriptomics.
This year, CZ CELLxGENE Discover has made available all of its human and mouse RNA single-cell data through Census (https://chanzuckerberg.github.io/cellxgene-census/) – a free-to-use service with an API and data that allows for querying its single-cell data corpus directly from Python or R.
The API uses a new technology, TileDB-SOMA, that allows for efficient and low-latency querying. The data are fully standardized and hosted publicly for free access, and they are composed by a count matrix of tens of millions of cells (observations) by >60 k genes (features) accomp[...]

This dataset includes data and models relevant to benchmarking multimodal biological models. The data has been sourced and curated by a team of experts at CZI and is provided as part of these datasets only when it is not publicly available or requires transformation to support effective model benchmarking.