Archana Inapudi | Artificial Intelligence

Talk to your slide deck using multimodal foundation models on Amazon Bedrock – Part 3

In Parts 1 and 2 of this series, we explored ways to use the power of multimodal FMs such as Amazon Titan Multimodal Embeddings, Amazon Titan Text Embeddings, and Anthropic’s Claude 3 Sonnet. In this post, we compared the approaches from an accuracy and pricing perspective.

Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock – Part 2

In Part 1 of this series, we presented a solution that used the Amazon Titan Multimodal Embeddings model to convert individual slides from a slide deck into embeddings. We stored the embeddings in a vector database and then used the Large Language-and-Vision Assistant (LLaVA 1.5-7b) model to generate text responses to user questions based on […]

Artificial Intelligence

Author: Archana Inapudi

Talk to your slide deck using multimodal foundation models on Amazon Bedrock – Part 3

Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock – Part 2

Learn

Resources

Developers

Help