🐳 DEEP DIVE 🐬

Description

Final project of Le Wagon Data Science Bootcamp - batch #️⃣8️⃣0️⃣2️⃣ It has been developed in two weeks, between February 28th and March 11th 2022, by three beginners in Python and Deep Learning

Objective

Goal of the project: build a classification deep learning model.

Input: sound recording of an unknown marine mammal song (.wav format).
Output: ranking of corresponding marine mammal species from most to least probable.

Source Data

All the data used to train our model has been downloaded from the Watkins Marine Mammal Sound Database. We have used the sounds available in the “Best of cuts” section of the site (around 1700 recordings). We intended to work on the “all cuts” section (around 16000 recordings) but suffered RAM issues that we couldn’t solve in the allowed two weeks.

Workflow

Data selection

From the raw dataset, we selected:

The number of families and species to observe (we chose to work with all 8 families and 31 species)
The minimum duration (we chose none i.e. we selected all samples)
The ‘quality’ of audio recording (we chose to work only with cleaned samples (i.e. no external noises (boat, rain, icebergs…etc.) nor multi-species recordings (i.e. 2 or more species heard on the recording)

The generated dataset amounted to approximately 900 audio samples.

Preprocessing Workflow

Beforehand, we observed 2 main ‘issues’:

The species in our selected dataset were extremely unbalanced
The audio samples were of different durations and would thus result in spectrograms of different lengths, which would be impossible for our model to process

To address both problems, we agreed on the following preprocessing workflow:

Target duration: duration of all audio signals that will be processed by our model, we deemed that 5 seconds was a sufficient duration to observe and hear significant patterns.

Step 1: Train - Validation - Test split on the dataset, as we will perform some Data Augmentation solely on the Train set.

Step 2: Train set:

Check class balance in terms of duration (through a barplot)
For over-represented classes :
- samples >= 5s: slice them into 5s consecutive slices (+ pad the last one randomly if >= 3s)
- samples < 5s: pad randomly
For under-represented classes :
- samples >= 5s: slice them into 5s consecutive slices 3 times at different intervals, for each new signal generated, apply White Noise and Random Gain, thus generating 2 additional signals
- samples < 5s: pad randomly per sample
Check class balance again and adapt preprocessing if needed

Step 3: Validation and Test set:

samples >= 5s: slice them into 5s consecutive slices (+ pad the last one randomly if >= 3s)
samples < 5s: pad randomly

Step 4: Convert all audio signals to mel spectrograms (i.e. numpy arrays)

Note: the audio signals must be converted to mel spectrograms as CNN cannot process audio signals as such.

Model training

CNN architecture and training history can be found in /03_model_training/notebooks and /models

Repositories

front-end repo: datastronaut/lewagon-deepdive-front
website: DeepDiveWebsite
back-end repo: datastronaut/lewagon-deepdive

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
01_getting_data		01_getting_data
02_data_analysis		02_data_analysis
03_model_training		03_model_training
lewagon-deepdive		lewagon-deepdive
scripts		scripts
tests		tests
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐳 DEEP DIVE 🐬

Description

Objective

Source Data

Workflow

Data selection

Preprocessing Workflow

Model training

Repositories

About

Releases

Packages

Contributors 2

Languages

datastronaut/lewagon-deepdive

Folders and files

Latest commit

History

Repository files navigation

🐳 DEEP DIVE 🐬

Description

Objective

Source Data

Workflow

Data selection

Preprocessing Workflow

Model training

Repositories

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages