Skip to content

(local) Build & Setup Instructions

Milo Webster edited this page Jul 28, 2020 · 5 revisions

Install the GCP SDK

https://cloud.google.com/sdk/docs/quickstarts

Clone the repo

  • Make sure you have your github SSH key setup
  • git clone git@github.com:mmwebster/voxsrc-2020.git

Install a dataset

  • Move into data dir cd voxsrc-2020/data/
  • Run the install script python utils.py --install-local-dataset --src-bucket voxsrc-2020-voxceleb-v4 --src-dataset no_cuda --dst-data-path ./datasets --dst-list-path ./lists --dst-tmp-path ./tmp
  • Setup symlinks
    • ln -s ./datasets/vox1_no_cuda ../components/train/tmp/data/vox1_no_cuda
    • ln -s ./datasets/vox1_no_cuda.txt ../components/train/tmp/data/vox1_no_cuda.txt
    • ln -s ./datasets/vox2_no_cuda ../components/train/tmp/data/vox2_no_cuda
    • ln -s ./datasets/vox2_no_cuda.txt ../components/train/tmp/data/vox2_no_cuda.txt
  • NOTE: To setup another dataset, replace "no_cuda" with another dataset's name. For example, "full" to install the complete, original dataset

Run the train component (locally, standalone)

  • Move into component dir cd ../components/train/
  • Execute the component's local run script with default config ./run_local.sh
Clone this wiki locally