Skip to main content

Guidance for Live Meeting Assistant on AWS

Automated meeting notes, translation, and transcription with generative AI

Overview

This Guidance shows how you can add generative artificial intelligence (generative AI) to your virtual meetings to translate languages, summarize conversations, and capture live insights. As virtual meetings become more prevalent, retaining all the information presented can be challenging for participants. The Live Meeting Assistant (LMA) on AWS collects speaker audio and metadata from a browser-based meeting app, softphone app, or audio source. It then uses large language models (LLMs) to generate customizable transcriptions, insights, and summaries, helping participants derive more value from every meeting.

How it works

These technical details feature an architecture diagram to illustrate how to effectively use this solution. The architecture diagram shows the key components and their interactions, providing an overview of the architecture's structure and functionality step-by-step.

Well-Architected Pillars

The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.

CloudFormation creates a stack in the AWS Management Console, providing an infrastructure as code mechanism that enables updates to the deployed solution. QnABot on AWS, nested within the stack, can quickly create a meeting assistant interface, and you can version it to enable updates as new features become available. Lambda divides functions into smaller, event-driven components so that you can more efficiently track the flow of data and isolate potential issues in call processing.

Read the Operational Excellence whitepaper 

Amazon Cognito is tightly integrated within the AWS services that comprise the LMA outlined in this Guidance. The service provides authentication and authorization capabilities, so that only authorized users can access and interact with the LMA web application and the supporting Lambda functions. Through this Amazon Cognito integration, the system maintains a secure and controlled provisioning of access to its various components, aligned with the principle of least privilege, enhancing the overall confidentiality and reliability of the meeting data processed by the LMA application.

Read the Security whitepaper 

Application Load Balancer routes traffic requests to the call processing functions and prevents the overload of any one container. Fargate scales audio processing as needed during call events, and its default health monitoring functionality quickly alerts you to any failure of the audio processing component. AWS AppSync provides a managed GraphQL service to provide a serverless, automatically scaling API to the frontend. Kinesis Data Streams synchronously replicates data across three Availability Zones, providing high availability and data durability.

Read the Reliability whitepaper 

CloudFront caches assets that serve the web application on the edge so that they can quickly respond to user requests. It also supports using WebSocket, a TCP-based protocol that is useful when you need long-lived bidirectional connections between clients and servers. As a result, it can support real-time audio processing.

Read the Performance Efficiency whitepaper 

Unlike in monolithic applications, which might require you to run and pay for unnecessary resources, this Guidance breaks call processing into smaller components, using discreet Lambda functions. Additionally, the DynamoDB on-demand throughput mode conserves read and write capacity and provides pay-per-request pricing so that you only pay for what you use. Finally, Fargate scales audio processing functions up and down with demand, conserving resources and optimizing costs.

Read the Cost Optimization whitepaper 

By using on-demand services so that you only use the resources you need, this Guidance conserves computational resources and minimizes your workload’s environmental impact. For example, CloudFront caches web assets on the edge, reducing the need to serve requests from the origin. Kinesis Data Streams is fully managed and can stream massive amounts of data on demand. And DynamoDB scales database functions on-demand so that its provisioned capacity matches your application’s needs.

Read the Sustainability whitepaper 

Disclaimer

The sample code; software libraries; command line tools; proofs of concept; templates; or other related technology (including any of the foregoing that are provided by our personnel) is provided to you as AWS Content under the AWS Customer Agreement, or the relevant written agreement between you and AWS (whichever applies). You should not use this AWS Content in your production accounts, or on production or other critical data. You are responsible for testing, securing, and optimizing the AWS Content, such as sample code, as appropriate for production grade use based on your specific quality control practices and standards. Deploying AWS Content may incur AWS charges for creating or using AWS chargeable resources, such as running Amazon EC2 instances or using Amazon S3 storage.