The codes include audio analyses of coughing and sneezing wave sounds, recorded speech of me pronouncing different letters, and other publicly available annotated and standard audio datasets.
- The code checks wave sound files of two different categories, cough and sneez. Cough and sneez files exist in the 'data' folder above.
- The code works on features extraction from the chosen wave sounds. The features extracted from the wave files include the amplitude frequency, spectrogram, Mel-frequency cepstral coeffecients and energy band ratio.
- The code includes results-clearfying comments.
- The codes use my voice as data. Three wave sounds were recorded of me saying three different letters 'a', 'e', and 'o'. The voice files are in 'data' folder, 'kenansounds'.
- The code implements Windowing, Formants, and Convertion to midi chromagram to all three wave sounds.
- Comments on the results are in the code.
- The code uses the standard 'orchset' dataset. The dataset can be downloaded from https://zenodo.org/record/1289786#.YKqUs6IzZH5.
- The code collects ground truth values from the satndard data.
- The code makes predictions using Essentia-Melodia and Crepe Algorithms and check the prediction results in terms of Voicing Recall, Voicing False Alarm, Raw Pitch Accuracy, Raw Chroma Accuracy, and over all accuracy.
- Comments on the performance of the two algorithms are at the end.