Amazon Textract

Automatically extract printed text, handwriting, and data from any document

Analyze up to 1,000 pages per month free for 3 months

with the AWS Free Tier 

Extract text and structured data such as tables and forms from documents using artificial intelligence (AI) - no configuration or templates necessary.

Go beyond simple Optical Character Recognition (OCR) by extracting relationships, structure, and text from documents.

Improve security and compliance through robust data privacy, encryption, security controls, and support compliance standards such as HIPAA, GDPR, and more.

Easily implement human reviews with Amazon Augmented AI (Amazon A2I) to manage nuanced or sensitive workflows and audit predictions on an ongoing basis.

How it works

Amazon Textract is a machine learning service that automatically extracts text, handwriting and data from scanned documents that goes beyond simple optical character recognition (OCR) to identify and extract data from forms and tables. Today, many companies manually extract data from scanned documents like PDFs, images, tables and forms, or through simple OCR software that requires manual configuration which often times requires reconfiguration when the form changes. To overcome these manual and expensive processes, Textract uses machine learning to read and process any type of document, accurately extracting text, handwriting, tables and other data without any manual effort. You can quickly automate document processing and take action on the information extracted whether it be automating loans processing or extracting information from invoices and receipts. Textract can extract the data in minutes vs. hours or days. Additionally, you can add in human reviews with Amazon Augmented AI to provide oversight of your models and perform reviews for sensitive data.
Panorama - How it Works
 Click to enlarge

Use cases

Financial Services

Accurately extract critical business data like mortgage rates, applicant names, and invoice totals across a variety of financial forms like mortgage applications, invoices and more to process loan and mortgage applications in minutes.

Healthcare and Life Sciences

Better serve your patients and insurers by extracting important patient data from health intake forms, insurance claims, and pre-authorization forms. Keep data organized and in its original context, and eliminate manual review of output.

Public Sector

Easily extract relevant data from government-related forms like small business loans, federal tax forms or business applications with a high degree of accuracy. 

How to get started

Find out how Amazon Textract works

Read about OCR, form extraction, table extraction, and more.

Explore Amazon Textract features »

Try the AWS Free Tier


Start using Amazon Textract for free today.

Sign up for a free account »

Explore Amazon Textract


Get started building with Amazon Textract in the AWS Management Console

Get started in the console »

Explore more of AWS