Amazon Textract

Automatically extract printed text, handwriting, layout elements, and data from any document

Amazon Intelligent document processing delivers 73% ROI.

Drive higher business efficiency and faster decision making while reducing costs.

Extract key insights with high accuracy from virtually any document.

Scale up or scale down the document processing pipeline to quickly adapt to market demands.

Automate data processing securely with data privacy, encryption, and compliance standards.

How it works

Amazon Textract is a machine learning (ML) service that automatically extracts text, handwriting, layout elements, and data from scanned documents. It goes beyond simple optical character recognition (OCR) to identify, understand, and extract specific data from documents. Today, many companies manually extract data from scanned documents such as PDFs, images, tables, and forms, or through simple OCR software that requires manual configuration (which often must be updated when the form changes). To overcome these manual and expensive processes, Textract uses ML to read and process any type of document, accurately extracting text, handwriting, tables, and other data with no manual effort. You can use one of our pretrained or custom features to quickly automate document processing, whether you’re automating loans processing or extracting information from invoices and receipts. Textract provides you the ability to customize our pretrained features to meet the document processing needs specific to your business. Textract can extract the data in minutes instead of hours or days.
What is Amazon Textract?
Introduction to Amazon Textract
Automate document processing with Amazon Textract
Introduction to Amazon Textract
Automate document processing with Amazon Textract

Use cases

Financial services

Accurately extract critical business data such as mortgage rates, applicant names, and invoice totals across a variety of financial forms to process loan and mortgage applications in minutes.

Healthcare and life sciences

Better serve your patients and insurers by extracting important patient data from health intake forms, insurance claims, and pre-authorization forms. Keep data organized and in its original context, and eliminate manual review of output.

Public sector

Easily extract relevant data from government-related forms such as small business loans, federal tax forms, and business applications with a high degree of accuracy.

How to get started

Find out how Amazon Textract works

Read about OCR, form extraction, table extraction, and more.

Explore Amazon Textract features »

Try the AWS Free Tier


Start using Amazon Textract for free today.

Sign up for a free account »

Explore Amazon Textract


Get started building with Amazon Textract in the AWS Management Console.

Get started in the console »

Explore more of AWS