AWS Open Source Blog
Category: Storage
fMRI data preprocessing on AWS using fMRIprep
A typical fMRI study often produces imaging data of terabytes or more. Storing and preprocessing this data can be challenging on a single computer because it often has neither enough disk space to store the data nor enough computing power to preprocess it. Traditionally, researchers use a combination of cloud-based storage and on-premises high-performance clusters […]
EKS support for the EBS CSI driver
Today, we are announcing EKS support for the EBS Container Storage Interface driver, an initiative to create unified storage interfaces between container orchestrators such as Kubernetes and storage vendors like AWS. A History of Storage in Kubernetes As originally conceived, containers were a great fit for stateless applications. However, there was no provision for persistent […]
Announcing PartiQL: One query language for all your data
Data is being gathered and created at rates unprecedented in history. Much of this data is intended to drive business outcomes but, according to the Harvard Business Review, “…on average, less than half of an organization’s structured data is actively used in making decisions…” The root of the problem is that data is typically spread […]
Best Practices for Running Ansys Fluent Using AWS ParallelCluster
Using HPC (high performance computing) to solve Computational Fluid Dynamics (CFD) challenges has become common practice. As the growth from HPC workstation to supercomputer has slowed over the last decade or two, compute clusters have increasingly taken the place of single, big SMP (shared memory processing) supercomputers, and have become the ‘new normal’. Another, more […]
Centralized Container Logging with Fluent Bit
September 8, 2021: Amazon Elasticsearch Service has been renamed to Amazon OpenSearch Service. Visit the website to learn more. by Wesley Pettit and Michael Hausenblas AWS is built for builders. Builders are always looking for ways to optimize, and this applies to application logging. Not all logs are of equal importance. Some require real-time analytics, […]
Open Source News Roundup: April 22, 2019
Upcoming Events RailsConf (April 30-May 2 in Minneapolis, Minnesota) – Lounge & Lanyard Sponsor. Workshop on Going Serverless with Ruby on AWS Lambda by Alex Wood and Jingyi Chen. PyCon (May 1-9 in Cleveland, Ohio) – Platinum Sponsor. Come find us at Booth #439 to see how AWS <3 Python. Percona Live (May 28-30 in […]
Build and Run the Open Distro For Elasticsearch SQL Plugin with Elasticsearch OSS
Note: These tutorial instructions are not current and will not be updated. OpenSearch was announced on April 12, 2021. Visit opensearch.org to learn more. Open Distro for Elasticsearch comprises four plugins: Security — supports node-to-node encryption, five types of authentication, role-based access controls, audit logging, and cross-cluster search. Alerting — notifies you when data from […]
Parallelizing S3 Workloads with s5cmd
This open source project comes from our customer community. It was developed by Peak Games to assist with their own S3 workflow, and includes features such as tab completion and built-in wild card support for files in S3 commands. Enjoy! – Deirdré Background Up until now, working on multiple objects on Amazon S3 from the […]