Skip to main content

AWS HealthOmics

Why AWS HealthOmics

AWS HealthOmics is a HIPAA-eligible service that accelerates clinical diagnostic testing, drug discovery, and agriculture research by fully managing the complex infrastructure behind your bioinformatics workflows. HealthOmics supports industry-standard workflow languages (WDL, Nextflow, CWL) and seamlessly scales bioinformatics infrastructure to support data from tens of thousands of tests per day—all with predictable cost per-sample. HealthOmics handles the technical complexities like managing compute resources and maintaining workflow engines so you can focus entirely on scientific breakthroughs.

AWS HealthOmics features

Automated Workflow Orchestration

HealthOmics automatically provisions and orchestrates all infrastructure components including workflow engines, compute resources, and filesystems. It coordinates the execution of bioinformatics workflows across distributed computing resources, managing job dependencies, data movement, and resource allocation without manual intervention. The orchestration layer handles failure recovery, retry logic, and resource optimization to ensure reliable workflow completion at any scale.

Multiple Workflow Language Support

HealthOmics provides native support for domain-specific workflow languages including WDL, Nextflow, and CWL, enabling teams to use existing workflow definitions without modification. The workflow engine architecture allows seamless integration of custom workflows while maintaining consistent execution and monitoring capabilities.

Domain-Specific API Architecture

HealthOmics offers purpose-built APIs designed specifically for bioinformatics and life sciences workloads, providing high-level abstractions for complex operations. These APIs enable programmatic access to workflow submission, monitoring, and data management functions while supporting interactive processing modes. The architecture simplifies integration with existing laboratory information management systems and processing pipelines through standardized interfaces.

Sequence Store

HealthOmics includes a purpose-built Sequence Store repository that efficiently stores, indexes, and retrieves genomic sequence data at petabyte scale. It automatically compresses and optimizes storage for genomic file formats, reducing storage costs while maintaining rapid access performance. The Sequence Store provides built-in version control, metadata management through customizable tags, and supports secure data sharing with fine-grained access controls.

Scalability

HealthOmics scales elastically to process tens of thousands of samples daily across 100,000+ concurrent vCPUs, automatically adjusting resources based on workload demands. It has transparent pricing that eliminates the unpredictability of fixed licensing models, especially important for workload varying from small research projects to large-scale production environments. The infrastructure dynamically allocates compute, memory, and storage resources to optimize performance while controlling costs through efficient resource utilization.

Compliance and Audit Trails

HealthOmics maintains comprehensive audit logs that track every operation performed on data, capturing who accessed what information and when. It provides detailed provenance tracking throughout the workflow lifecycle, documenting all transformations and processing steps for regulatory compliance. The audit trail capabilities support clinical diagnostic requirements and research reproducibility standards through immutable logging and chain-of-custody documentation.

HIPAA Eligibility

HealthOmics is HIPAA-eligible, enabling organizations to process protected health information (PHI) within a compliant infrastructure.