AWS Storage Blog
Category: Advanced (300)
How Amazon S3 Tables use compaction to improve query performance by up to 3 times
Today businesses managing petabytes of data must optimize storage and processing to drive timely insights while being cost-effective. Customers often choose Apache Parquet for improved storage and query performance. Additionally, customers use Apache Iceberg to organize Parquet datasets to take advantage of its database-like features such as schema evolution, time travel, and ACID transactions. Customers […]
Manage costs for replicated delete markers in a disaster recovery setup on Amazon S3
Many businesses recognize the critical importance of safeguarding their essential data from potential disasters such as fires, floods, or ransomware events. Designing an effective disaster recovery (DR) strategy includes thoughtfully evaluating and selecting cost-effective solutions that fulfill compliance requirements. By using Amazon S3 features such as S3 object tags, S3 Versioning, and S3 Lifecycle, you can […]
Migrating data access and Microsoft Active Directory with Amazon FSx for NetApp ONTAP
In today’s digital era, enterprises face significant challenges in data center modernization during their digital transformation journey. Traditional on-premises solutions struggle with high costs, complex management, and data growth. Organizations with intricate file-sharing systems and user permissions face difficulties in preserving user experiences and security. The tight integration of enterprise IDCs with complex Microsoft Active […]
How Amazon Ads uses Iceberg optimizations to accelerate their Spark workload on Amazon S3
In today’s data-driven business landscape, organizations are increasingly relying on massive data lakes to store, process, and analyze vast amounts of information. However, as these data repositories grow to petabyte scale, a key challenge for businesses is implementing transactional capabilities on their data lakes efficiently. The sheer volume of data requires immense computational power and […]
Derive insights from AWS DataSync task reports using AWS Glue, Amazon Athena, and Amazon QuickSight
Update (10/30/2024): On October 30, 2024, AWS DataSync launched Enhanced mode tasks, prompting updates to this blog. Updates include a new step in the “Step 2: Populate Glue catalog with task reports data using a Glue crawler” section and detailed information on the new capabilities in “Updated steps for working with task reports of new […]
Migrate data from Dropbox to Amazon S3 using Rclone
Whether you choose to operate entirely on AWS or in multicloud and hybrid environments, one of the primary reasons to adopt AWS is the broad choice of services we offer, enabling you to explore, build, deploy, and monitor your workloads. Amazon S3 is a great option for Dropbox users seeking a comprehensive storage solution. Amazon […]
Access a point in time with Amazon S3 Object Lambda
Point-in-time ‘snapshots’ enable administrators, developers, testers, and end users to quickly access a storage volume or share how it was at an earlier point-in-time. They are a longstanding approach to data protection and recovery, tracking changes within a storage system to reduce both Recovery Point Objective (RTO) and Recovery Time Objective (RTO). However, traditional snapshots […]
Enhance PostgreSQL resiliency and recoverability using Amazon FSx for OpenZFS
Organizations today often depend on mission-critical database workloads. When these databases become unavailable, it can significantly impact revenue and erode user trust. To mitigate this risk, these critical environments need the fastest possible recovery capabilities if there is a failure. This means having the lowest recovery time objective (RTO) and recovery point objective (RPO). Achieving […]
Migration to Amazon FSx for Windows File Server: cost optimization strategies and automation
Businesses today rely heavily on their file server infrastructure to store, manage, and share mission-critical data. However, maintaining and scaling these environments can be a complex and resource-intensive task. As customer needs and data volumes continue to grow, organizations often face the need to modernize their file server infrastructure. When migrating to Amazon FSx for […]
Streamline petabyte-scale data migrations with Cloud Write mode on Amazon FSx for NetApp ONTAP
International Data Corporation (IDC)1 predicts that global data creation and consumption will surge to 175 zettabytes (ZB) by 2025. As a result, organizations are looking for swift, dependable, and scalable cloud migration solutions to lift-and-shift their growing on-premises datasets into the cloud. Whether prompted by an imminent lease renewal, termination of a data center, or […]