2024
The Institut Pasteur and AWS are analysing the world's DNA, using a public database
Institut Pasteur, a leading French virology research center, processed 20 petabytes of DNA data in record 30 hours, leveraging AWS Batch over a cluster of 2.18M AWS Graviton cores.
Benefits
30
hours reduce the computing time required of 30 million vCPU hours to 30 hours, with 2.18 million vCPUs mobilised at peak20
petabyte first exhaustive use of a 20-petabyte DNA databaseOverview
To date, less than 0.01% of existing viruses have been identified. And among these countless as yet unknown species may lie the culprit of a future pandemic. Following the Covid-19 crisis, and to more easily identify future threats, a research project called "IndexThePlanet" at the Institut Pasteur set about analysing and mapping the DNA of the entire living world, using a public database. However, to process such a large volume of data, the project had to set up an appropriate infrastructure to meet the challenge of processing some 20 petabytes of data. This is the purpose of the partnership with Amazon Web Services (AWS), which has provided the researchers with a cluster of more than 2 million vCPUs to carry out this massive task.

About Institut Pasteur
Founded by Louis Pasteur in 1887, the Institut Pasteur is a world-renowned French biomedical research centre conducting cutting-edge scientific research on infectious diseases and public health.
Architecture Diagram
AWS Services Used
Get Started
Organizations of all sizes across all industries are transforming their businesses and delivering on their missions every day using AWS. Contact our experts and start your own AWS journey today.
Contact Sales
Did you find what you were looking for today?
Let us know so we can improve the quality of the content on our pages