Posted On: Dec 15, 2022

With Amazon Athena, you can run SQL queries on data stored in relational, non-relational, object, and custom data sources without the need to pre-process or move data to another storage solution. Starting today, you can use Athena to query real-time streaming data held in Amazon Managed Streaming for Apache Kafka (MSK) and self-managed Apache Kafka.

Federated queries in Athena allow you to use your SQL expertise to extract insights from multiple data sources and for use cases spanning interactive analysis, business intelligence dashboards, and more. Today’s release further expands the number and type of data sources that you can query with Athena’s standard SQL interface. For example, you can now run analytical queries on real-time streaming data held in a Kafka topic and join it with data in additional Kafka topics or data in your Amazon S3 data lake. This reduces the friction of having to first configure MSK to write data to Amazon S3 before it can be analyzed with Athena.