Speech Preprocess

A repo for a solution to denoising and separating for two-speeker-mixed noisy speech, using a BSRNN inspired deep learning network.

View demos here.

Network Architecture 💡

Model Basics ✔️

Key	VAlue
Datasets	AISHELL-3 & NoiseX-92
FLOPs	2.408G
Weights Size	61.95M
Parameters	16.15M

Important Metrics 🧭

Naive Case (only mix, no noise)

Metric	SI-SNR	PESQ(wb)	PESQ(nb)	STOI
Raw dataset	0.002	1.240	1.473	0.681
BSRNN(modified)	12.195	2.453	2.866	0.901

Difficult Case (with mix & noise)

Metric	SI-SNR	PESQ(wb)	PESQ(nb)	STOI
Raw dataset	-0.597	1.146	1.379	0.656
BSRNN(modified)	11.384	2.212	2.661	0.880

Test & Train 🚂

Python version is 3.8. Requirements can be installed by:

pip install -r requirements.txt

See this if installing the dependency pesq fails.

Test

Modify config/test.yml with your own dataset path, and run following command:

python speech-preprocess/test.py

Train

Follow the order in data/index_data.py and data/make_data.py to config your raw dataset. Then run following commands to make index and generate data:

python data/index_data.py
python data/make_data.py

For this project, AISHELL-3 and NoiseX-92 datasets are used.

Then, modify the config file config/train.yml and run following command:

python speech-preprocess/train.py

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
config		config
data		data
md		md
model		model
pipeline		pipeline
save/x64s_8w		save/x64s_8w
.gitignore		.gitignore
README.md		README.md
prune.py		prune.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Preprocess

Network Architecture 💡

Model Basics ✔️

Important Metrics 🧭

Test & Train 🚂

Test

Train

Training Visualization 📉

About

Releases

Packages

Languages

juhayna-zh/speech-preprocess

Folders and files

Latest commit

History

Repository files navigation

Speech Preprocess

Network Architecture 💡

Model Basics ✔️

Important Metrics 🧭

Test & Train 🚂

Test

Train

Training Visualization 📉

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages