-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ability to listen to audiobooks #98
Comments
Hello @DraganRatkovich The first question that pops into mind is whether there is any standard format for audio books. Technically speaking, it is very easy to support playing audio in Bookworm. But this is out of scope for an eBook reader. Another related feature is support for EPUB 3 media overlays. But this is hampered by the lack of adoption for such feature by publishers. Best Musharraf |
Good day @mush42 |
@mush42, we can support them the same way Voice Dream Reader does, I think. Just take in either an MP3/wav file, or a zip file of audio files that get turned into their own chapters. |
My only concern is how would be able to keep this consistent? |
@TheQuinbox If yes, how would that look like? |
@mush42, I'm not sure exactly how the bookmarking code works, but I'd assume that it stores an index in the text? We could store the position that the user was at in the audio file. |
@TheQuinbox Yes, that's exactly how it is done. Still the issues raised by @pauliyobo need to be resolved before we embark on this. Best |
Good day to all, |
What about converting the audiobook to a text book first then process it like usual? This isn't as simple as just sending audio to a speech to text engine, but I think it would be the most straightforward way to accomplish the given task using the all the features readily available now. This is theoretically similar to performing OCR in image in order to make it text, but in practice it would depend on how the audiobook defined it's sections, how it reads it's titles, etc. Some of them have little soundtracks to acknowledge end of sections, others use the narrator voice, etc. |
Hello @iuriguilherme Could you give more technical details on what exactly do you mean by converting audiobook to text book format? I can understand the Daisy format which is basically .mp3 files with extra html and stuff to handle sections, chapters etc properly, but I can't understand correctly how it is possible to convert the mp3 file recorded in the studio to a text file. I know of several encryption methods for preserving the digital rights of audiobooks, but they are completely different things. |
Through the use of a speech-to-text engine (STT). EDIT: an example using google stt: https://github.com/googleapis/python-speech/blob/main/samples/snippets/quickstart.py |
@iuriguilherme Then what's the point of integrating the audiobook listening feature into Bookworm if it needs to be converted into a textbook? Or if you mean text-to-speech technology instead of STT? Do you have examples of the types of books mentioned? |
No, I mean exactly converting speech to text. Because that is what allows all the processing BEFORE you use a text to speech (TTS) to interact with the user. I use this approach with hearing/speaking robots. In fact, every talking robot you see out there is essentially a chatbot which converts what it hears to text, process it using neural models then convert the text reply to speech so the robot can answer. |
@iuriguilherme Sounds really interesting. Perhaps the lead developer might consider this as I don't really have any knowledge of programming or other development. |
I don't have good knowledge of how to programatically use Cortana (the windows builtin STT engine), I only know the cloud api services. But the logic is what I described. |
In terms of a standard audiobook format, M4b comes to mind. It is at least something that I would like to see if this feature gets added, given that the format has chapter tags built in that most modern media players support. |
Is your feature request related to a problem? Please describe.
No
Describe the solution you'd like
I wonder what the community thinks to add the ability to listen to an audiobook in this program? Well, I know that you can listen to an audiobook in any media player, but listening to an audiobook with the ability to save bookmarks to quickly jump to the right section, make notes, etc., would probably be amazing. Opinions of others are very welcome
Describe alternatives you've considered
Additional context
It would be great to hear from the developers @mush42 @MichelSuch @cary-rowen @pauliyobo about this feature.
The text was updated successfully, but these errors were encountered: