AWS Storage Blog
Tag: Amazon S3 Data Lake
Best practices for data lake protection with AWS Backup
Data lakes, powered by Amazon Simple Storage Service (Amazon S3), provide organizations with the availability, agility, and flexibility required for modern analytics approaches to gain deeper insights. Protecting sensitive or business-critical information stored in these S3 buckets is a high priority for organizations. AWS Backup for Amazon S3 makes it easier to centrally automate the […]
How Arc XP lowered data transfer costs by $500k per year with Amazon CloudFront and Lambda@Edge on AWS
The Washington Post, an American daily newspaper company, delivers digital news content using Arc XP’s digital experience platform. Arc XP originated in The Post and has grown into a Software-as-a-Service (SaaS) business used by publishers, broadcasters, and brands to create, host, and monetize engaging content for over 1,500 websites globally. Photo Center is an Arc […]
Automatically archive and restore data with Amazon S3 Intelligent-Tiering
Customers of all sizes, in all industries, are using data lakes to transform data from a cost that must be managed, to a business asset. From time to time, data scientists and business analysts need to restore subsets of historical datasets for longitudinal studies, machine learning retraining, and more. However, users commonly write queries that don’t […]
See what’s in store for Amazon S3 at AWS re:Invent 2020-2021
UPDATE 9/8/2021: Amazon Elasticsearch Service has been renamed to Amazon OpenSearch Service. See details. This time last year, the AWS Storage services and product marketing teams were entrenched in Las Vegas feverishly putting the final touches on content for re:Invent 2019 launches, sessions, workshops, and building makeshift workstations in a hotel ballroom for the biggest […]
How Zalando built its data lake on Amazon S3
Founded in 2008, Zalando is Europe’s leading online platform for fashion and lifestyle with over 32 million active customers. I am a lead data engineer at Zalando and a steady contributor to the company’s cloud journey. In this blog post, I cover how Amazon Simple Storage Service (Amazon S3) became a cornerstone of the data […]
Migrate HDFS files to an Amazon S3 data lake with AWS Snowball Edge
The need to store newly connected data grows as the sources of data increase. Enterprise customers use Hadoop Distributed File System (HDFS) as their data lake storage repository for on-premises Hadoop applications. Customers are migrating their data lakes to AWS for a more secure, scalable, agile, and cost-effective solution. For HDFS migrations where high-speed transfer […]
Free AWS Loft Events: Attend “AWS Storage Days” in San Francisco or NYC
As we gear up for AWS re:Invent 2019 December 2 – 6, we want to ensure you are up to speed on the full portfolio of AWS storage services. In San Francisco September 10 – 11 and in NYC September 24- 25, we will be conducting ‘AWS Storage Days’ at the AWS Loft locations. These […]
Build a data lake on Amazon S3: Recent customer case studies
Amazon Simple Storage Service (S3) is the largest and most performant object storage service for structured and unstructured data and the storage service of choice to build a data lake. With Amazon S3, you can cost-effectively build and scale a data lake of any size in a secure environment where data is protected by 99.999999999% […]
New on the APN Blog: Building a Data Lake Foundation for Salesforce in AWS
Over on the AWS Partner Network Blog, a recent blog post caught my eye and I thought it was worth sharing with our growing storage audience. The post, Building a Data Lake Foundation for Salesforce in AWS, is written by Simon Ejsing, Director of Analytics at FinancialForce. Simon’s post outlines their approach to unlocking the potential of […]