encodec

Here are 13 public repositories matching this topic...

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

streaming duplex speech moshi speech-representation encodec gpt-4o speech-language-model spoken-dialogue-models modal-alignment intreaction mini-omni llama-omni wavtokenizer

Updated Nov 11, 2024

youngsheen / SimVQ

Star

SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

audio image vector-quantization vqgan encodec

Updated Nov 7, 2024
Python

lucadellalib / audiocodecs

Star

A collections of audio codecs with a standardized API

text-to-speech pytorch speech-synthesis codec quantization mimi dac self-supervised-learning encodec wavlm speech-coding speechtokenizer speech-language-model

Updated Nov 5, 2024
Python

jishengpeng / WavTokenizer

Star

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

semantic text-to-speech codec acoustic dac speech-representation audio-representation encodec soundstream music-representation-learning gpt4o speech-language-model

Updated Oct 23, 2024
Python

ZhikangNiu / encodec-pytorch

Star

unofficial implementation of the High Fidelity Neural Audio Compression

pytorch audio-processing audio-compression encodec

Updated Aug 15, 2024
Python

habla-liaa / encodecmae

Star

Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

audio deep-learning representation-learning masked-autoencoder encodec

Updated Jul 24, 2024
Python

BreakingY / Nvidia-Video-Codec

Star

Nvidia video hard decoding, rendering, soft/hard encoding, and writing to MP4 file ; Nvidia视频硬解码、渲染、软/硬编码并写入MP4文件

ffmpeg mp4 codec encodec hardcodec decodec

Updated Jul 22, 2024
C

rsxdalv / one-click-installers-tts

Star

Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos

bark rvc tortoise demucs encodec musicgen vocos

Updated Jul 6, 2024
Shell

mrpep / encodecmae-to-wav

Star

Experiments sonifying frame-level encodecmae features and encodecmae summary vectors using generative audio models.

audio diffusion encodec generative-ai encodecmae

Updated Feb 29, 2024
Python

modelscope / FunCodec

Star

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

tts speech-synthesis codec speech-to-text audio-generation encodec voicecloning audio-quantization

Updated Jan 25, 2024
Python

octu0 / go-encodec-cpp

Star

Go binding for encodec.cpp

bindings audio-codec encodec

Updated Jan 10, 2024
Go

Supremolink81 / TTSCeleb

Star

A TTS app where you can clone the voices of any person you wish.

natural-language-processing text-to-speech pytorch tts bark beats streamlit encodec

Updated Aug 14, 2023
Python

dadaxian / HuffmanCodec

Star

通过哈夫曼树编解码原理编写解码器，实现文件的压缩与解压缩

huffman-coding encodec

Updated Sep 13, 2022
C++

Improve this page

Add a description, image, and links to the encodec topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the encodec topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

encodec

Here are 13 public repositories matching this topic...

jishengpeng / WavChat

youngsheen / SimVQ

lucadellalib / audiocodecs

jishengpeng / WavTokenizer

ZhikangNiu / encodec-pytorch

habla-liaa / encodecmae

BreakingY / Nvidia-Video-Codec

rsxdalv / one-click-installers-tts

mrpep / encodecmae-to-wav

modelscope / FunCodec

octu0 / go-encodec-cpp

Supremolink81 / TTSCeleb

dadaxian / HuffmanCodec

Improve this page

Add this topic to your repo