reference deployment

Illumina DRAGEN on AWS

Analysis of next-generation sequencing data

This Partner Solution deploys the Illumina DRAGEN (Dynamic Read Analysis for GENomics) Bio-IT Platform to the Amazon Web Services (AWS) Cloud.

The DRAGEN Bio-IT Platform enables ultra-rapid analysis of next-generation sequencing (NGS) data, significantly reduces the time required to analyze genomic data, and improves accuracy. It includes bioinformatics pipelines that provide optimized algorithms for mapping, aligning, sorting, duplicate marking, and haplotype variant calling. These pipelines include Germline, Somatic (tumor and tumor/normal), RNA, Single Cell RNA, Methylation, Joint Genotyping, and DRAGEN-GATK.

The Partner Solution builds an AWS environment that spans two Availability Zones for high availability and provisions two AWS batch-compute environments for spot instances and on-demand instances. These environments include DRAGEN F1 instances that are connected to field programmable gate arrays (FPGAs) for hardware acceleration.

Illumina logo

This Partner Solution was developed by Illumina in collaboration with AWS. Illumina is an AWS Partner.


AWS Service Catalog administrators can add this architecture to their own catalog.  

  •  What you'll build
  • Use this Partner Solution to set up the following configurable environment on AWS:

    • A highly available architecture that spans two Availability Zones.*
    • A virtual private cloud (VPC) configured with public and private subnets according to AWS best practices. This provides the network infrastructure for your deployment.*
    • An internet gateway to provide access to the internet.*
    • In the public subnets, managed NAT gateways to allow outbound internet access for resources in the private subnets.*
    • An AWS CodePipeline pipeline that builds a Docker image and uploads it into an Amazon Elastic Container Registry (Amazon ECR) repository.
    • Two AWS Batch compute environments: one for Amazon Elastic Compute Cloud (Amazon EC2) Spot Instances and the other for On-Demand Instances.
    • An AWS Batch job queue that prioritizes submission to the compute environment for Spot Instances to optimize for cost.
    • An AWS Batch job definition to run DRAGEN.
    • AWS Identity and Access Management (IAM) roles and policies for the AWS Batch jobs to run.

    * The template that deploys the Partner Solution into an existing VPC skips the tasks marked by asterisks and prompts you for your existing VPC configuration.

  •  How to deploy
  • To deploy Illumina DRAGEN, follow the instructions in the deployment guide. The deployment process includes these steps:

    1. If you don't already have an AWS account, sign up at https://aws.amazon.com.
    2. Subscribe to DRAGEN Bio-IT Platform in AWS Marketplace.
    3. Launch the Partner Solution. Each deployment takes about 15 minutes. You can choose from two options:
    4. Test the deployment by running a DRAGEN job.

    To customize your deployment, configure the network architecture, set the desired number of virtual CPUs for the AWS batch environment, specify a bid percentage for Spot Instances, and set the number of AWS batch job retries.

    Amazon may share user-deployment information with the AWS Partner that collaborated with AWS on this solution.  

  •  Costs and licenses
  • This Partner Solutiont requires a subscription to the Amazon Machine Image (AMI) for DRAGEN Bio-IT Platform, which is available with per-hour pricing from AWS Marketplace.

    You are responsible for the cost of the AWS services and any third-party licenses used while running this solution. There is no additional cost for using the solution.

    This solution includes configuration parameters that you can customize. Some of these settings, such as instance type, affect the cost of deployment. For cost estimates, refer to the pricing pages for each AWS service you use. Prices are subject to change.

    Tip: After you deploy a solution, create AWS Cost and Usage Reports to track associated costs. These reports deliver billing metrics to an Amazon Simple Storage Service (Amazon S3) bucket in your account. They provide cost estimates based on usage throughout each month and aggregate the data at the end of the month. For more information, refer to What are AWS Cost and Usage Reports?