Skip to content

Releases: pszemraj/vid2cleantxt

whisper 🤫

12 Oct 00:37
fee7375
Compare
Choose a tag to compare

Add support for OpenAI's whisper model through transformers ✔️

  • this makes vid2cleantxt significantly more robust/useful for day-to-day cases
  • the ability of the openAI models to translate does carry over to vid2cleantxt*! transcribe a Chinese video to English with model_id="openai/whisper-small"for example

* all testing here was done en-to-en, official testing/support for other languages to come later :)

  • What's Changed

Full Changelog: v0.2...v0.2.5

package + python API

10 Oct 09:28
f6cb118
Compare
Choose a tag to compare

Finally, a python API to transcribe things instead of using a CLI or custom notebook!

See the example here on Colab.

high-level API

Install with pip:

pip install git+https://github.com/pszemraj/vid2cleantxt.git

Use in python:

import vid2cleantxt

text_output_dir, metadata_output_dir = vid2cleantxt.transcribe.transcribe_dir(
    input_dir="path/to/video/files",
    chunk_length=15,
)

What's Changed

New Contributors

Full Changelog: v0.1.21...v0.2

v0.1.21 - neuspell workaround

24 Feb 01:26
cae85b7
Compare
Choose a tag to compare
  • this version is a bug fix: in NeuSpell, there is a bug loading model via the recommended API, so added code to use SymSpell if that is the case automatically. Check the log file to see/confirm that this is happening when you transcribe.

What's Changed

  • Add workaround for neuspell initialization bug by @pszemraj in #8

Full Changelog: v0.1.2...v0.1.21

v0.1.2

28 Jan 19:26
309c420
Compare
Choose a tag to compare

What's Changed

Next Release

  • Adding PDF generation from text files post-transcription.

Full Changelog: https://github.com/pszemraj/vid2cleantxt/commits/v0.1.2