S-Align (Soft alignment for E2E Speech Translation)

The code is forked from Fairseq-v0.12.3. For more Installation details, please refer to Fairseq

Useage

Training scripts and configurations for the MuST-C dataset are as follows:

egs
|---machine_translation
|    |---train.sh
|    |---decode.sh
|    |---load_embedding.py
|---pretrain-all
|    |---joint_train_merge.sh
|    |---decode.sh
|    |---device_run.sh
|    |---conf

Step 1. MT Pretrain

• Prepare MT training data.

• Modify the necessary paths in machine_translation/train.sh, and run machine_translation/train.sh to pretrain MT model.

• Adjust all the required paths in the machine_translation/decode.sh to match those in machine_translation/train.sh, and run machine_translation/decode.sh to inference your pretrained MT model.

• Use machine_translation/load_embedding.py to fetch necessary word embeddings from pretrianed MT model.

Step 2. Multi-Task Fine-tuning

• Download the Hubert-base pretrained Model without fune-tuning.

• Prepare the MuST-C ST training data, please follow here.

• Modify the necessary paths in the pretrain-all/conf/train_soft_alignment.yaml, such as:

w2v-path=/your/path/to/hubert
mt-model-path=/your/path/to/mt/pretrain/model
decoder-embed-path=/your/path/to/mt/word/embedding

• Set data path and other required paths in the pretrain-all/joint_train_merge.sh, and run pretrain-all/joint_train_merge.sh to fune-tune your model.

• Use pretrain-all/decode.sh to inference your model

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
docs		docs
egs		egs
examples		examples
fairseq.egg-info		fairseq.egg-info
fairseq		fairseq
fairseq_cli		fairseq_cli
scripts		scripts
tests		tests
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Fairseq-README.md		Fairseq-README.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
RELEASE.md		RELEASE.md
hubconf.py		hubconf.py
pyproject.toml		pyproject.toml
release_utils.py		release_utils.py
setup.cfg		setup.cfg
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

S-Align (Soft alignment for E2E Speech Translation)

Useage

Step 1. MT Pretrain

Step 2. Multi-Task Fine-tuning

Citation

About

Releases

Packages

Languages

License

MuKai2000/S-Align

Folders and files

Latest commit

History

Repository files navigation

S-Align (Soft alignment for E2E Speech Translation)

Useage

Step 1. MT Pretrain

Step 2. Multi-Task Fine-tuning

Citation

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages