Amazon Web Services
This video from AWS re:Invent 2023 explores building and optimizing data lakes on Amazon S3. The speakers discuss S3's architecture and how it scales to support massive data volumes and workloads. They cover best practices for performance, cost optimization, and data governance using S3 features like Intelligent-Tiering and Access Grants. The session also introduces S3 Express One Zone for low-latency workloads. Finally, Ryan Blue discusses Apache Iceberg, an open table format that enables transactional data lakes and universal analytics storage on S3. The presenters highlight how S3's scale and innovations enable modern data architectures that combine the best of data warehouses and data lakes.