-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Add Paddle OCR recognition to Bookworm #129
Comments
Hello @cary-rowen Best |
Hi @mush42 As far as I know, those screen reader bundles add less than 20 MB, Paddle OCR has a different recognition model, we might consider adding a light built-in model to Bookworm. |
Hello @cary-rowen I've been studying paddle OCR and the ways it can be added to Bookworm without bringing in a huge number of additional dependencies. The major issue is that most of the development documentation is written in Chinese, but through Google Translate, I was able to understand the basics of the process. Paddle OCR can be embedded using one of the two following ways:
Paddle has official support for ONNX runtime, but I couldn't find any official confirmation from paddle developers as to whether all of the models support ONNX runtime. ONNX is fairly fast, but a major downside of this is that the majority of OCR processing happens in python, which is slower of course. The above technical details are notes to my future self, and for other interested parties. Best |
Hi @mush42 , great to hear you're working on this. thanks |
Is your feature request related to a problem? Please describe.
Whether it is Windows10OCR or Tesseract OCR, the recognition effect in Simplified Chinese environment is not ideal.
Describe the solution you'd like
There is a open source project called Paddle OCR. This project supports multiple languages. In the Chinese and English scenarios I tested, its recognition rate exceeded Windows OCR and Tesseract OCR.
It would be nice if Bookworm could add a Paddle recognition engine
Describe alternatives you've considered
None
Additional context
Screen readers in China have distributed it as part of their screen readers, and so far everything seems to be working fine.
Paddle OCR repo: https://github.com/PaddlePaddle/PaddleOCR/
The text was updated successfully, but these errors were encountered: