AWS Storage Blog

Category: Technical How-to

Automate Amazon S3 File Gateway on Amazon EC2 with Terraform by HashiCorp

Infrastructure as Code (IaC) involves managing IT infrastructure through code and automation tools to reduce manual management prone to errors, slow scaling, and overhead. For organizations implementing a hybrid cloud infrastructure, automation can ensure uniformity, scalability, and cost reduction while getting cloud resources provisioned efficiently. Automated provisioning and configuration enable organizations to adapt, innovate, and […]

AWS DataSync Featured Image 2020

Transferring data from Google Cloud Filestore to Amazon EFS using AWS DataSync

Organizations may need to transfer large numbers of files from one cloud provider to another for a variety of reasons like workload migration, disaster recovery, or a requirement to process data in other clouds. Data transfers typically require end-to-end encryption, the ability to detect changes, object validation, network throttling, monitoring, and cost optimization. Building a […]

Retaining Amazon EC2 AMI snapshots for compliance using Amazon EBS Snapshots Archive

Many organizations have the need to retain data for a number of years to comply with regulations or IT requirements. They move cold data to archive storage in the cloud to optimize storage costs while staying compliant. For example, Amazon Machine Image (AMI) is a critical data resource that many customers want to retain long term to meet compliance. Until […]

Amazon S3 featured image - new

Getting visibility into storage usage in multi-tenant Amazon S3 buckets

SaaS providers with multi-tenant environments use cloud solutions to dynamically scale their workloads as customer demand increases. As their cloud footprint grows, having visibility into each end-customer’s storage consumption becomes important to distribute resources accordingly. An organization can use storage usage data per customer (tenant) to adjust its pricing model or better plan its budget. […]

Amazon S3 featured image - new

Consolidate and query Amazon S3 Inventory reports for Region-wide object-level visibility

Organizations around the world store billions of objects and files representing terabytes to petabytes of data. Data is often owned by different teams, departments, or business units, spanning multiple locations. As the amount of datastores, locations, and owners grow, you need a way to cost-effectively maintain visibility on important characteristics of your data, including based […]

AWS Application Migration Service

Migrate compute from Google Cloud Platform (GCP) to AWS using AWS Application Migration Service

Customers using Google Cloud Platform (GCP) might explore the option of spreading or transitioning their cloud usage away from GCP to alternative providers for various reasons, including cost evaluations, data centralization, or changes in business requirements. Regardless of the motivating factors, adopting effective migration solutions can lead to time and cost savings while reducing downtime. […]

Amazon S3 Archive Storage Classes

Identify cold objects for archiving to Amazon S3 Glacier storage classes

Update (02/13/2024): Consider Amazon S3 Lifecycle transition fees that are charged based on the total number of objects being transitioned, the destination storage class (listed on the Amazon S3 pricing page), as well as the additional metadata charges applied. You can use the S3 pricing calculator to estimate the total upfront and monthly costs by […]

AWS DataSync Featured Image 2020

Derive insights from AWS DataSync task reports using AWS Glue, Amazon Athena, and Amazon QuickSight

Update (9/22/2023): Step 6b updated to automatically detect and update the Amazon Athena table schema when crawler detects large data transfer values reported in bytes that would consume the table’s maximum integer value while storing data. As customers scale their migration of large datasets with millions of files across multiple data transfers, they are faced […]

Amazon S3 featured image - new

How Continental uses Mountpoint for Amazon S3 in autonomous driving development – accelerating simulation performance by 20%

Continental and AWS have been collaborating to create the Continental Automotive Edge (CAEdge) framework – a modular hardware and software environment that connects the vehicle to the cloud and features virtual workbenches. The platform includes a virtual workbench offering numerous options to develop, supply, and maintain software-intensive system functions. It supports a wide range of automotive […]

Amazon S3 featured image - new

Optimizing performance of Apache Spark workloads on Amazon S3

This blog covers performance metrics, optimizations, and configuration tuning specific to OSS Spark running on Amazon EKS. For customers using or considering Amazon EMR on EKS, refer to the service documentation to get started and this blog post for the latest performance benchmark. Performance is top of mind for customers running streaming, extract transform load […]