- Apache License 2.0
Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.
Model Name | Language | Training Data | Vocab Size | Parameter | Offline/Online | Notes |
---|---|---|---|---|---|---|
Paraformer-large | CN & EN | Alibaba Speech Data (60000hours) | 8404 | 220M | Offline | Duration of input wav <= 20s |
Model Name | Training Data | Parameters | Sampling Rate | Notes |
---|---|---|---|---|
FSMN-VAD | Alibaba Speech Data (5000hours) | 0.4M | 16000 |
Model Name | Training Data | Parameters | Vocab Size | Offline/Online | Notes |
---|---|---|---|---|---|
CT-Transformer | Alibaba Text Data | 70M | 272727 | Offline | offline punctuation model |