wikispeech-annotator

Dependencies:

sudo apt install python3-venv
sudo apt install libespeak-dev

python3 -m pip install numpy
python3 -m pip install -r requirements.txt

Run in terminal:

python3 annotator.py

Run test server:

python3 -m uvicorn --reload --port 4567 annotator:app

Run server as system service:

sudo make install
sudo systemctl start ws-annotator
sudo systemctl status ws-annotator
journalctl -r -u ws-annotator

sudo systemctl stop ws-annotator
sudo make uninstall

Example: input is longer soundfile and text output is json with text+sentence time points

python3 annotator.py align swe ~/git/karin_boye/audio/boye_javisstgordetont.mp3 ~/git/karin_boye/text/boye_javisstgordetont.txt
python3 annotator.py align eng test_data/shakespeare_part1.wav test_data/shakespeare_part1.txt

http post :4567/align language="en-GB" textInputType="FILE" text=~/git/wikispeech-annotator/test_data/shakespeare_part1_par1.txt audioInputType="FILE" audioInput=~/git/wikispeech-annotator/test_data/shakespeare_part1_par1.wav

http post :4567/align language="sv-SE" audioInputType="FILE" audioInput=~/git/karin_boye/audio/boye_javisstgordetont.mp3 textInputType="FILE" text=~/git/karin_boye/text/boye_javisstgordetont.txt

Example:

Validate sound file

python3 annotator.py validate test_data/shakespeare_part1.wav

http get :4567/validate?audioInput=test_data/shakespeare_part1.wav
http post :4567/validate audioInputType="FILE" audioInput=test_data/shakespeare_part1.wav

Validate sound file and text file

python3 annotator.py validate test_data/shakespeare_part1.wav --text test_data/shakespeare_part1.txt

http get :4567/validate?audioInput=test_data/shakespeare_part1.wav&text=test_data/shakespeare_part1.txt
http post :4567/validate audioInputType="FILE" audioInput=test_data/shakespeare_part1.wav textInputType="FILE" text=test_data/shakespeare_part1.txt

Example:

Get VAD time points for sound file

python3 annotator.py vad test_data/shakespeare_part1.wav
python3 annotator.py vad test_data/shakespeare_part1.wav --returntype LAB

http post :4567/vad audioInputType="FILE" audioInput=test_data/shakespeare_part1.wav
http post :4567/vad audioInputType="FILE" audioInput=test_data/shakespeare_part1.wav returnType=LAB

Example:

input is soundfile and phonemes

python3 annotator.py align test_data/shakespeare_sent1_phrase1.wav "w ih l ih ah m sh ey k s p iy r" --textinputtype=STRING --alignmethod=SHIRO --language=en-GB
python3 annotator.py align test_data/shakespeare_sent1_phrase1.wav test_data/shakespeare_sent1_phrase1.json --textinputtype=FILE --alignmethod=JSON_SHIRO --language=en-GB

http post :4567/align audioInputType="FILE" audioInput=test_data/shakespeare_sent1_phrase1.wav textInputType=STRING text="w ih l ih ah m sh ey k s p iy r" alignMethod=SHIRO language=en-GB
http post :4567/align audioInputType="FILE" audioInput=test_data/shakespeare_sent1_phrase1.wav textInputType=FILE text=test_data/shakespeare_sent1_phrase1.json alignMethod=JSON_SHIRO language=en-GB

output is json with text+word time points and phonemes+time points

This work was supported by the Swedish Post and Telecom Authority (PTS) through the grant "Talresursinsamlaren – För ett tillgängligare Wikipedia genom Wikispeech" (2019–2021).

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
.github		.github
aligner_models		aligner_models
scripts		scripts
static		static
templates		templates
test_data		test_data
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
align_shiro.py		align_shiro.py
align_shiro_readme.txt		align_shiro_readme.txt
annotator.py		annotator.py
commands.txt		commands.txt
kaldi_asr.py		kaldi_asr.py
requirements.txt		requirements.txt
test_annotator.py		test_annotator.py
test_validator.py		test_validator.py
validator.py		validator.py
ws-annotator.env		ws-annotator.env
ws-annotator.service		ws-annotator.service

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wikispeech-annotator

About

Releases

Packages

Contributors 3

Languages

stts-se/wikispeech-annotator

Folders and files

Latest commit

History

Repository files navigation

wikispeech-annotator

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages