csj-eval

This repository aims at evaluating speech recognition system using the Corpus of Spontaneous Japanese (CSJ)

Preparation

$ pip install -r requirements.txt

$ pip install -r chrome/requirements.txt

$ ./split_wav.py --ignore_tag --ignore_kansuuji --ignore_anonymization --core /Volumes/Untitled/

In this repository, webkitSpeechRecognition in the Google Chrome is used as a baseline.

At first, run the Web application server as the following:

$  ./chrome/server.py

Next, open http://127.0.0.1:5000/ by Google Chrome.

Futhermore, connect a USB audio interface to your computer.

Finally, play wav files as the following:

$ ./play_wav.py --interval 2.0

After executing above commands, chrome/chrome_result.txt is created.

$ ./wer.py chrome/chrome_result.txt