I love learning new things, but the process of actually acquiring and maintaining knowledge isn't very fun. I've loved using Anki to help me organize what I'm learing, and make sure I don't forget things. This makes it easier to remember things I learn without having to come up with my own spaced repetition plan. I hate staring at a screen for so long though, so I've often wished I could do my flashcards without having to look at a screen all the time. I finally decided to make it happen. I focused a lot on reducing latency to make it feel as streamlined as possible.
- Text-To-Speech: ElevenLabs/OpenAI
- Speech-To-Text: Faster-Whisper (I used the
tiny.en
model, about 0.5s latency on my M1 mac) - Language Model: ChatGPT (I used
gpt-3.5-turbo-1106
) - Others: PyWebView, Silero VAD