Posted On: Sep 23, 2022

Amazon Textract is a machine learning service that automatically extracts text, handwriting, and data from any document or image. We continuously improve the underlying machine learning models based on customer feedback to provide even better accuracy. Today, we are pleased to announce quality enhancements to our text extraction feature available via the DetectDocumentText API.

The latest Text detection models available via the DetectDocumentText API now provide improvements to word and line extraction accuracy and specifically for E13B fonts commonly found in checks/cheques, International Bank Account Numbers found in banking documents, and long words (e.g., email addresses).

Finally, we are pleased to announce that we delivered enhancements to the underlying machine learning models resulting in reduced latency when calling our DetectDocumentText API.

This update will be available in US East (Ohio, N. Virginia), US West (N. California), US West (Oregon), Asia Pacific (Mumbai, Seoul, Singapore, Sydney), Canada (Central), Europe (Frankfurt, Ireland, London, Paris), and AWS GovCloud (US) Regions starting September 20th.

To get started, log on to the Amazon Textract console to try out the new feature. To learn more about Textract capabilities, please visit the Amazon Textract website, developer guide, or resources page.