AWS Storage Blog
Category: Expert (400)
Customization and field maintenance with AWS Snowball Edge – sideloading AMIs
When the AWS Snowball Edge launched in 2016, if you wanted to run Amazon Elastic Compute Cloud (Amazon EC2) instances on the device, you were required to specify an Amazon Machine Image (AMI) to be installed on it at time of ordering. You were then able to launch Amazon EC2 instances on the device based […]
Enabling SMB access for serverless workloads with Amazon FSx
Many customers adopting a cloud-first strategy prioritize serverless technologies and cloud file storage that offer automatic scaling, built-in high availability, and a pay-for-use billing model to increase agility and optimize costs. As customers adopt serverless architectures, they may need shared data access on a persistent storage layer. While AWS Lambda includes a 512-MB temporary file […]
Copying objects greater than 5 GB with Amazon S3 Batch Operations
Update (3/4/2022): Added support for Glacier Instant Retrieval storage class. Update (4/19/2022): Included the copy destination prefix parameter in the Amazon CloudFormation template. Update (10/26/2022):Added performance guidance and best practices, and included template optimized for copying objects restored from archive to a different storage class. A large number of customers store their data in Amazon […]
Monitoring CloudEndure Disaster Recovery with AWS Lambda
Many organizations must monitor and track their disaster recovery (DR) initiatives to meet business and audit objectives. CloudEndure Disaster Recovery provides fast and reliable recovery of physical, virtual, and cloud-based servers into AWS. It uses email alerts and has a high-level dashboard for monitoring DR jobs. However, many organizations require more robust reporting mechanisms. This […]
Validate your disaster recovery solution and simplify compliance reporting on AWS
Data protection is a key element of compliancy, and organizations must deploy controls to manage the protection of their data and handle operational disruptions. With ongoing configuration and resource changes within IT infrastructure, it can be challenging to continuously and efficiently validate, maintain, and report on compliance to ensure that internal policies and regulatory standards […]
Automate data recovery validation with AWS Backup
Your data may be your most valuable asset. Disaster events that affect your workloads can result in a loss of data. A disaster is an event that causes a serious negative impact on your business. Having backups of your data helps minimize the impact of these disaster events by giving you the ability to recover […]
Event-driven data transfer to container-shared storage on AWS
Businesses using data lake solutions built on Amazon S3 often want their data science teams to have access to that same data for machine learning or analytics projects deployed on tools like RStudio Server and Shiny. To do so, they can easily deploy these tools in the cloud using Amazon ECS or Amazon EKS serverless containers with AWS Fargate, and can access […]
Optimizing storage management with Amazon FSx for Lustre storage quotas
Data-intensive use cases across industries are getting larger each year. Use cases that require multi-user file systems, such as user shares for data science or computational engineering, can benefit from cloud solutions to help manage file storage consumption. In this blog post, I walk you through the Amazon FSx for Lustre storage quota feature, discussing […]
Implementing a backup solution with AWS Storage Gateway
Backups are the insurance policy for our data. We hope to never use them, but if the time comes when we need them, they had better be there for us. At a high level, there are two different variations: local and offsite. A local backup can be as simple as copying data to another physical […]
Clustered storage simplified: GFS2 on Amazon EBS Multi-Attach enabled volumes
One of the design patterns for high availability of mission critical applications is the use of shared storage. This architectural pattern enables you to access storage from multiple hosts simultaneously, making your applications resilient to node failures. Customers with demanding transaction processing systems, analytics workloads, or high performance computing clusters need highly available, high-performance storage […]