Skip to content

Latest commit

 

History

History
25 lines (25 loc) · 2.49 KB

README.md

File metadata and controls

25 lines (25 loc) · 2.49 KB

Application: toText

License Static Bradge Static Badge Setup app --- alt text

Libraries that helped make the project❤️:

Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge

The main functions of the application:

  • Optical text recognition (OCR) from images.
  • Speech recognition from audio files.
  • Convert PDF files to text documents, extract images and tables

Installing the application

Link to the installer file archive: Yandex.disk

About the launch

Install the libraries using the command:

pip install -r requirements.txt

The file to run is main.py - main.py
Files with information about the application in the docs folder.
The settings are stored in JSON files, which are located in Settings

Additional information and notes:

The weight of the final application is ~288 Mb But during recognition, language sets and recognition models (in speech recognition) will be downloaded to your computer as needed. Unfortunately, version 1.1 has almost no GPU support. The ffmpeg package must be installed for speech recognition to work, if it is not installed.