prefetch

CLI tool that measures latency of cloud streaming speech-to-text APIs. It simulates a realtime audio stream and primitive speech command parsing to measure the amount of time required for an API to provide the client with enough responses to begin actual command processing like playing a song or ordering a burrito.

Supported treaming speech-to-text API vendors

There are plans to eventually support latency measurements for all major STT vendors including Google Cloud, AWS, Azure.

Usage

Set location of your project credentials file to GOOGLE_APPLICATION_CREDENTIALS enrivonment variable. Then run python cli.py. Available options are:

  -i, --input TEXT          Audio WAV file
  -p, --pattern TEXT        Pattern to match the transcript against, string or a file path
  -l, --language_code TEXT  Language code, defaults to 'en-US'

Example command produced using speech synthesis is provided in the sound folder. Command to run the program would then look like this:

python cli.py -i sound/book-tickets-en.wav -p "book tickets from Seattle to Tokyo on June 7th"

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
prefetch		prefetch
sound		sound
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cli.py		cli.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

prefetch

Supported treaming speech-to-text API vendors

Usage

About

Releases

Packages

Languages

License

hiisi13/prefetch

Folders and files

Latest commit

History

Repository files navigation

prefetch

Supported treaming speech-to-text API vendors

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages