Amazon Q introduces support for scanned PDFs and embedded images in PDF documents

Posted on: Jul 12, 2024

Amazon Q Business is a fully managed, generative-AI powered assistant that enhances employee productivity by answering questions, providing summaries, generating content, and completing tasks based on customer's enterprise data. Across various industries, users want to derive insights from document types such as invoices, tax statements, which are frequently in scanned PDF format. Starting today, Amazon Q Business users can get answers from text content in scanned PDFs, and images embedded in PDF documents.

Prior to today, customers who wanted to derive insights from scanned PDFs and images in PDF documents would first have to do preprocessing to extract the text from these documents using Optical Character Recognition (OCR) followed by ingestion into Amazon Q Business. Starting today, customers can directly feed these documents into Q Business, and search and act on them without the need for preprocessing of any kind. With this launch, customers can simplify the process of building their own generative AI assistants using Q Business APIs or web applications.

This feature is available in all AWS regions where Amazon Q for Business is available. To learn more about the support for Scanned PDFs and embedded images, visit the documentation page or refer to the blog Improve productivity when processing scanned PDFs using Amazon Q Business. To explore Amazon Q, visit the Amazon Q website.