Amazon S3 Object Lambda using Amazon Textract

S3 Object Lambda Access Point → S3 Object Lambda → Amazon Textract

Pattern for extracting key value pair from documents for intelligent document processing

This demonstrates how to setup AWS S3 Object Lambda to invoke Amazon Textract. Amazon Textract can be used on an Object Lambda Access point to provide the extract of a document PDF or image.
For example, if an enterprise has a large volume of PDF documents and does not want to run a batch job due to the cost of processing of extracting or processing all the documents, S3 Object Lambda provides a way to extract only the OCR document without major changes to the architecture, optimizing cost.

< Back to all patterns

GitHub icon Download this pattern (.zip)

GitHub icon View this pattern on GitHub


Clone repo

git clone https://github.com/aws-samples/serverless-patterns/cd serverless-patterns/s3-object-lambda-textract

Deploy

sam deploy


Testing

See the GitHub repo for detailed testing instructions.

Cleanup

Delete the stack: sam delete.

Created by:

Hemish Soni

Hemish Soni

Sr Partner Solution Architect, Serverless Enthusiast

Follow on LinkedIn