Amazon Web Services
This workshop introduces Apache Iceberg, a data lake table format that enhances data lake performance and capabilities. Christopher Sharkey, an AWS Solutions Architect, demonstrates how to set up and use Apache Iceberg tables on AWS using Amazon EMR and Amazon Athena. The session covers Iceberg's definition, benefits, and technical architecture, followed by hands-on demonstrations of common operations like creating tables, inserting data, and performing time travel queries. The workshop showcases Iceberg's advantages in data management, including ACID transactions, schema evolution, and hidden partitioning, making it valuable for organizations dealing with large-scale analytical data.