Pretrained Models on Huggingface
Model License
Model Zoo
Here we provided several pretrained models on different datasets. The details of models and datasets can be found on ModelScope.
Speech Recognition Models
Paraformer Models
| Model Name |
Language |
Training Data |
Vocab Size |
Parameter |
Offline/Online |
Notes |
| Paraformer-large |
CN & EN |
Alibaba Speech Data (60000hours) |
8404 |
220M |
Offline |
Duration of input wav <= 20s |
UniASR Models
Conformer Models
RNN-T Models
Multi-talker Speech Recognition Models
MFCCA Models
Voice Activity Detection Models
| Model Name |
Training Data |
Parameters |
Sampling Rate |
Notes |
| FSMN-VAD |
Alibaba Speech Data (5000hours) |
0.4M |
16000 |
|
Punctuation Restoration Models
| Model Name |
Training Data |
Parameters |
Vocab Size |
Offline/Online |
Notes |
| CT-Transformer |
Alibaba Text Data |
70M |
272727 |
Offline |
offline punctuation model |
Language Models
Speaker Verification Models
Speaker diarization Models
Timestamp Prediction Models