Introducing Amazon Textract: Now in Preview—easily extract text and data from virtually any document

Posted on: Nov 28, 2018

Amazon Textract is a service that automatically extracts text and data from scanned documents. Amazon Textract goes beyond simple optical character recognition (OCR) to also identify the contents of fields in forms and information stored in tables.

Many companies today extract data from documents and forms through manual data entry which is slow and expensive, or using simple OCR software, which is often inaccurate and typically produces output that requires extensive post-processing to put the extracted content in a format that is usable by a developer's application. Amazon Textract uses machine learning to instantly read virtually any type of document to accurately extract text and data without the need for any manual review or custom code. Amazon Textract allows developers to quickly automate document workflows, processing millions of document pages in a few hours.

To get started with Amazon Textract, sign up here for preview access and register here for the upcoming webinar.

Introducing Amazon Textract: Now in Preview—easily extract text and data from virtually any document

Learn

Resources

Developers

Help