-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hindi Language Support #16
Comments
It is not possible to add a recognition mode that universally supports all characters across all languages, which is what supporting the entire UTF-8 character set would imply. However, we can add support for additional languages, which can be selected in the recognize tab (see below). What Indian languages are the bulk of your documents in? |
Most of these are in hindi language.
We are trying to support one of the largest volunteer organization which
actually gave books translated in LOTS of Indian languages.
But hindi would be great support.
Thank you for the wonderful tool and your efforts.
Kind regards.
…On Fri, Apr 5, 2024, 4:05 PM Balearica ***@***.***> wrote:
It is not possible to add a recognition mode that universally supports all
characters across all languages, which is what supporting the entire UTF-8
character set would imply. However, we can add support for additional
languages, which can be selected in the recognize tab (see below). What
Indian languages are the bulk of your documents in?
recognize_lang_screencap1.png (view on web)
<https://github.com/scribeocr/scribeocr/assets/100809261/83a92b2b-01ee-469c-93b2-80b86a8c5e51>
—
Reply to this email directly, view it on GitHub
<#16>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AQJ5BXKOPCPSE5JHLRB4N5LY34U5FAVCNFSM6AAAAABFZ2GU72VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDANBQG42DKMZRHA>
.
You are receiving this because you authored the thread.Message ID:
***@***.***>
|
is there any possibility for Hindi language support? |
Languages that do not use Latin script will be added eventually, however this will take a non-trivial amount of work to implement. I currently don't have any timeline for this. |
Is there any possibility to provide UTF-8 language support.? We are working on massive trove of documents which are in UTF-8 support format. These are primarily Indian languages.
Thanks for the wonderful work.
The text was updated successfully, but these errors were encountered: