Skip to content
#

tesseract-4

Here are 17 public repositories matching this topic...

A simple implementation of ocrmypdf and tesseract with flask for hosting to a server as an API. The code was written on CentOS7. This code works on linux only as ocrmypdf library does not have support on windows because of missing leptonica dll. For windows consider https://github.com/lakshay1296/OCR_Conversion_JPEG2PDF. This is image to ocr pdf…

  • Updated Dec 26, 2019
  • Python

Legal Document Summarize is a Python-based application that automates the process of extracting text from legal documents using Optical Character Recognition (OCR), summarizing the extracted text, and translating it into multiple languages, including Hindi, Marathi, and Bengali. This tool is designed to make legal documents available in more lang.

  • Updated Oct 7, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the tesseract-4 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tesseract-4 topic, visit your repo's landing page and select "manage topics."

Learn more