AWS Storage Blog
The inside story on Mountpoint for Amazon S3, a high-performance open source file client
Amazon S3 is the best place to build data lakes because of its durability, availability, scalability, and security. Hundreds of thousands of data lakes are built on S3, storing diverse sets of unstructured data for use in analytics pipelines, machine learning training, business intelligence, and more. Often, these tasks build on top of open-source analytics […]
Simplify and scale access management to shared datasets with cross-account Amazon S3 Access Points
In today’s interconnected and data centric world, businesses must have access to the right data for data-driven decision-making, ultimately driving better business results. Collecting all the relevant data takes time and capital as it requires setting up data ingestion pipelines, hiring analysts to validate and interpret the data, and incorporating data insights that influence important […]
Data migration and cost saving at scale with Amazon S3 File Gateway
Migrating data to the cloud requires experience with different data types and the ability to preserve source data structure and metadata attributes. Customers often have on-premises file data stored on traditional file servers, retaining original timestamp of data creation for varying reasons including data lifecycle management. Customers find it challenging to identify a path to […]
On-demand archival and retrieval of documents from Amazon WorkDocs to Amazon S3
Cloud storage of documents has seen rapid growth over the years as more and more customers and businesses move away from traditional physical storage. As the size and number of documents continue to grow, customers want to manage their documents and retain them using long term, durable, cost-effective document archives. Businesses such as medical research […]
Automatically compress and archive satellite imagery for Amazon S3
Satellite imagery often comes as large, high-resolution files, and organizations that work with this data typically have high storage costs. Additionally, large imagery files can take time and resources when downloaded for use with machine learning (ML), data analytics tools, or manual analyst review. Using standard compression techniques lets us achieve reductions in file size […]
Large scale migration of encrypted objects in Amazon S3 using S3 Batch Operations
Many organizations have data governance strategies or compliance requirements that mandate their data be replicated and redundant across different management accounts and global regions. Moving encrypted data at scale can often take a few additional steps due to the need to decrypt and re-encrypt objects as part of the replication process. Amazon Simple Storage Service […]
Deploying self-managed MariaDB high availability using Amazon EBS Multi-Attach
For most organizations, the availability of workloads is a key performance indicator affecting operations ensuring goods, services, and critical business transactions. Availability needs vary from workload to workload and are aligned with an organization’s business requirements and the criticality of their services. To learn more about how to architect in AWS to meet your availability […]
Modern data protection architecture on Amazon S3: Part 2
Keeping data secure and usable in unforeseen circumstances like accidental breaches, human error, and hacking is critical to business continuity and success. To effectively mitigate the impact of these events on business-critical assets, one of the recommended strategies is creating immutable, unchangeable copies of those assets and storing them in isolated, secondary accounts with restricted […]
Modern data protection architecture on Amazon S3: Part 1
Keeping data secure and usable in unforeseen circumstances like accidental breaches, human error, and hacking is critical to business continuity and success. To effectively mitigate the impact of these events on business-critical assets, one of the recommended strategies is creating immutable, unchangeable copies of those assets and storing them in isolated, secondary accounts with restricted […]
Protecting domain-joined workloads with AWS Elastic Disaster Recovery
Disaster recovery (DR) solutions for workloads that are domain-joined to Microsoft Active Directory (AD) must take into account the AD requirements of those workloads. A domain-joined workload will expect to find an AD controller to provide keys services like DNS and security related services including user and machine-based authentication. If the AD requirements are not […]