AWS HPC Blog

Tag: Genomics

Dataset of protein-ligand complexes now available in the Registry of Open Data on AWS

by Deva Priyakumar, Beryl Rabindran, Alex Iankoulski, Prathit Chatterjee, Rakesh Srivastava, Ramanathan Sethuraman, Vladimir Aladinskiy, and Yusong Wang on in High Performance Computing Permalink Share

This post was contributed by U. Deva Priyakumar, Rakesh Srivatsava, Prathit Chatterjee, Vladimir Aladinskiy, Ramanathan Sethuraman, Yusong Wang, Alex Iankoulski, and Beryl Rabindran Today, we’re excited to announce the release of a comprehensive dataset featuring molecular dynamics (MD) trajectories for over 16,000 protein-ligand complexes (PLCs). This dataset, now available on AWS as part of the […]

Enabling Rapid Genomic and Multiomic Data Analysis with Illumina DRAGEN™ v4.4 on Amazon EC2 F2 Instances

Streamline your genomic and multiomic data analysis with DRAGEN on Amazon EC2 F2 instances. Our latest blog post explores the performance benefits of this hardware-accelerated solution, helping you unlock insights faster.

How Caris Life Sciences processed 400,000 RNAseq samples in 2.5 days with AWS Batch

How Caris Life Sciences processed 400,000 RNAseq samples in 2.5 days with AWS Batch

In the race to deliver precision medicine, time is of the essence. Caris Life Sciences, a pioneer in this field, leveraged AWS Batch to build a highly scalable solution that processed hundreds of thousands of genomic samples in record time. Discover how they achieved this remarkable feat and the key services that powered their breakthrough.

Leveraging Seqera Platform on AWS Batch for machine learning workflows - Part 1 of 2

Leveraging Seqera Platform on AWS Batch for machine learning workflows – Part 1 of 2

Nextflow is popular workflow framework for genomics pipelines, but did you know you can also use it for machine-learning? ML is already being used for medical imaging, protein folding, drug discovery, and gene editing. In this post, we explain how to build an example Nextflow pipeline that performs ML model-training and inference for image analysis.

Running accurate, comprehensive, and efficient genomics workflows on AWS using Illumina DRAGEN v4.0

In this blog, we provide a walkthrough of running Illumina DRAGEN v4.0 genomic analysis pipelines on AWS, showing accuracy and efficiency, copy number analysis, structural variants, SMN callers, repeat expansion detection, and pharmacogenomics insights for complex genes. We also highlight some benchmarking results for runtime, cost, and concordance from the Illumina DRAGEN DNA sequencing pipeline.