Amazon Textract is a fully managed machine learning service that automatically extracts text and data from scanned documents that goes beyond simple optical character recognition (OCR) to identify, understand, and extract data from forms and tables.
AWS Textract Activities is a wrapper around of Amazon Textract’s API, so you can use it when designing a workflow in Auteros.
This combination will help you build automation solutions that need to deal with unstructured document.
Before you can use these activities, you need to setup an IAM user in AWS and get the AccessKey, SecretKey, Region info.
Analyze Document activity is a wrapper around AnalyzeDocument API.
Start Document Analysis activity is a wrapper around StartDocumentAnalysis API.
Get Document Analysis activity is a wrapper around GetDocumentAnalysis API.
Dump TextractDocument To Excel activity is an helper method to dump result from AWS API to an Excel file. The result has been parsed to an easy format to understand and saved in a TextractDocument object.