aws_textract_utils

What is AWS Textract service :

Textract is a machine learning service that automatically extracts text, handwriting, and data from scanned documents that goes beyond simple optical character recognition (OCR) to identify and extract data from forms and tables. Today, many companies manually extract data from scanned documents like PDFs, images, tables, and forms, or through simple OCR software that requires manual configuration which oftentimes requires reconfiguration when the form changes. To overcome these manual and expensive processes, Textract uses machine learning to read and process any type of document, accurately extracting text, handwriting, tables and other data without any manual effort

$ pip install -r requirements.txt

$ streamlit run app.py

$ Upload .pdf file it will call aws texract service display below result .

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Comprehend_extract_Examples		Comprehend_extract_Examples
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aws_textract_utils

What is AWS Textract service :

Reference use cases

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

aws_textract_utils

What is AWS Textract service :

Reference use cases

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages