Document Understanding Solution

Search for information across multiple scanned documents, PDFs, and images with compliance capabilities to redact information from documents

Overview

The Document Understanding Solution delivers an easy-to-use web application that ingests and analyzes files, extracts text from documents, identifies structural data (tables, key value pairs), extracts critical information (entities), and creates smart search indexes from the data. Additionally, files can be uploaded directly to and analyzed files can be accessed from an Amazon Simple Storage Service (Amazon S3) bucket in your AWS account.

You can upload and process documents in bulk and, optionally, enable Amazon Kendra support for machine learning-based enterprise search.

 

Benefits

Search and discovery

Search for information across multiple scanned documents, PDFs, and images.

Compliance

Redact information from documents.

Workflow automation

Easily plugs into your existing upstream and downstream applications.

Leverage AWS AI services

Use Amazon Textract to extract text and structural information from the files and then pass to Amazon Comprehend and Amazon Comprehend Medical for deeper analysis.

Technical details

The diagram below presents the architecture you can automatically deploy using the solution's implementation guide and accompanying AWS CloudFormation template.

The AWS CloudFormation template deploys a static web application hosted in an Amazon S3 bucket and served by an Amazon CloudFront distribution.

Training
Introduction to Artificial Intelligence

In the course, we discuss what AI is and why it is important, and take a brief look at machine learning and deep learning—which are subsets of AI—and describe how Amazon uses AI in its products.

Enroll now 
Training
Introduction to AWS Machine Learning Services

This course introduces Amazon Machine Learning and Artificial Intelligence tools that enable capabilities across frameworks and infrastructure, machine learning platforms, and API-driven services.

Enroll now 
About this deployment
Version
1.0.10
Released
03/2023
Author
AWS
Est. deployment time
30-60 mins
Estimated cost
Download implementation guide  Source code  CloudFormation template  Subscribe to RSS feed 
Deployment options
Ready to get started?
Deploy this solution by launching it in your AWS Console
Did this AWS Solution help you?
Provide feedback