Neural network based SAD

If you want to train a neural network for speech activity detection on the ham_radio database follow these steps:

Clone this repository and install it with pip. (We assume that Cython and numpy are already installed)
Download the database with

    wget -qO- https://zenodo.org/record/5175960/files/ham_radio.tar.gz.parta{a,b,c} \
	| tar -C /PATH/TO/HAM_RADIO_DB/ -zx --checkpoint=10000 --checkpoint-action=echo="%u/5530000 %c"

where /PATH/TO/HAM_RADIO_DB has to be replaced with the chosen database directory

Set the variable HAM_RADIO_JSON to the file name the database json should be written to export HAM_RADIO_JSON=/PATH/TO/JSON
Create a database json with

python -m ham_radio.database.ham_radio.create_json \
    with database_path=/PATH/TO/HAM_RADIO_DB

Set a directory to which to write all models with export STORAGE_ROOT=/PATH/TO/MODEL_DIR
Start a training with:
```
python -m ham_radio.train with cnn
```

The trained model and the event files are written to

  /PATH/TO/MODEL_DIR/ham_radio/SADModel_{number_of_train_runs}

For more information about the training script and the event files visit our padertorch repository

If you want to reduce the required space for the gpu you can reduce the batch size by adding provider_opts.batch_size=4 or any other value for the batch size.

If you want to use a simple RNN structure instead of the
RNN you can replace cnn with rnn Most paramters are adjustable in a similar fashion.

Citation

@misc{heitkaemper2021database,
     title={A Database for Research on Detection and Enhancement of Speech Transmitted over HF links}, 
     author={Jens Heitkaemper and Joerg Schmalenstroeer and Joerg Ullmann and Valentin Ion and Reinhold Haeb-Umbach},
     year={2021},
     eprint={2106.02472},
     archivePrefix={arXiv},
     primaryClass={cs.SD}
}

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
ham_radio		ham_radio
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural network based SAD

Citation

About

Releases

Packages

Contributors 3

Languages

fgnt/ham_radio

Folders and files

Latest commit

History

Repository files navigation

Neural network based SAD

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages