Artificial Intelligence
PDF document pre-processing with Amazon Textract: Visuals detection and removal
Amazon Textract is a fully managed machine learning (ML) service that automatically extracts printed text, handwriting, and other data from scanned documents that goes beyond simple optical character recognition (OCR) to identify, understand, and extract data from forms and tables. Amazon Textract can detect text in a variety of documents, including financial reports, medical records, […]
Extracting custom entities from documents with Amazon Textract and Amazon Comprehend
July 2024: This post was reviewed and updated for accuracy. Amazon Textract is a machine learning (ML) service that makes it easy to extract text and data from scanned documents. Textract goes beyond simple optical character recognition (OCR) to identify the contents of fields in forms and information stored in tables. This allows you to […]
