The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
-
Updated
Oct 22, 2024 - Python
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResNetSE and other models, as well as a variety of preprocessing methods.
This repository contains code/papers/research on Speech or Audio Classification
A Convolutional Neural Network which is trained to detect COVID 19 even in asymptotic patients using only cough recordings.
Dataset for CABA: Clasificador Automatico de Botellas por Acustica (Automatic Acustic Bottle Classifier)
Speech Emotion Recognition, abbreviated as SER, is the act of attempting to recognize human emotion and affective states from speech. This is capitalizing on the fact that voice often reflects underlying emotion through tone and pitch. This is also the phenomenon that animals like dogs and horses employ to be able to understand human emotion.
Broadcasting-residual network model zoo for Multiple Classification Tasks.
A deep learning model that can detect the presence of capuchin bird calls in audio clips
This is a space where I share my personal portfolio :)
A compilation of libraries, case studies, resources, and research papers revolving around deep learning/machine learning for audio
Add a description, image, and links to the audioclassification topic page so that developers can more easily learn about it.
To associate your repository with the audioclassification topic, visit your repo's landing page and select "manage topics."