Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chinese book content in Mobi format is displayed as garbled characters #110

Open
cary-rowen opened this issue Feb 26, 2022 · 7 comments
Open
Labels
bug Something isn't working Improvement Improving or fixing an existing feature

Comments

@cary-rowen
Copy link
Collaborator

Describe the Problem

Chinese book content in Mobi format is displayed as garbled characters, The Epub format of the same content does not have this problem.

To Reproduce

Steps to reproduce the behavior:

  1. Download the attached sample test book: testSample.zip;
  2. Open Sample_Epub.epub in Bookworm;
  3. View a book's table of contents in a table of contents tree view;
  4. Open Sample_mobi.mobi in Bookworm;
  5. View a book's table of contents in a table of contents tree view.

Expected behavior

Step 3 and 5 we should get the same result
P.S. The result of the Step 3 is correct.

Actual behavior

Execute step 3 and we can see the complete table of contents of the book
Execute step 5, the book content is not fully rendered, and NVDA reports it as a "question mark", which means garbled characters.

The File You've Opened

testSample.zip

Desktop (please complete the following information):

  • OS: Windows 10 21H2 (x64) build 19044.1526

  • Bookworm version 2022.1a5

  • Recent settings you may have changed in Bookworm: None

Additional context

It is suspected to be an encoding problem, because Epub uniformly uses UTF-8 encoding, this problem will not occur

@mush42
Copy link
Collaborator

mush42 commented Feb 26, 2022

@cary-rowen

Does this problem happen with the text of the document? or does it only happen with the table-of-content tree view labels.

Best
Musharraf

@cary-rowen
Copy link
Collaborator Author

Hi,

it only happen with the table-of-content tree view labels.
the text of the document can be displayed normally.

Cheers
Cary

@cary-rowen
Copy link
Collaborator Author

cary-rowen commented Feb 28, 2022

There is also a mobi document that cannot be opened and is reported as an error:
Could not open file. Either the file has been damaged during download, or it has been corrupted in some other way.
Here is the documentation that reproduces the problem:
test_OpenError.zip

@cary-rowen
Copy link
Collaborator Author

Hi @mush42
Hopefully this one will also be fixed in the near future, let me know if you need nvda's Simplified Chinese speech synthesizer for testing. I can send it to you by email.
hope everything is fine
Cheers

@cary-rowen
Copy link
Collaborator Author

Hi @mush42
Is this issue on your to-do list?

Thanks

@mush42
Copy link
Collaborator

mush42 commented Mar 30, 2022

Hello @cary-rowen
Yes this one is in my To do list.
The only blocker is that we need to change an external library called 'mobi' that we depend on.
It seams we need to fork that project, change it, and bundle a private version of it with Bookworm.

Best

@cary-rowen
Copy link
Collaborator Author

Hi @mush42

Thank you, Great work.

All the best

Cary

@DraganRatkovich DraganRatkovich added bug Something isn't working Improvement Improving or fixing an existing feature labels Jun 14, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Improvement Improving or fixing an existing feature
Projects
None yet
Development

No branches or pull requests

3 participants