游雁 29b2cc8ba1 export		hace 2 años
..
models	ea6903101b punc onnx	hace 2 años
test	ce7914034d increase vad realtime punc	hace 2 años
utils	5b0047bf58 export model	hace 2 años
README.md	8c484b0b74 docs	hace 2 años
__init__.py	59f184a622 export model	hace 2 años
export_model.py	29b2cc8ba1 export	hace 2 años

Export models

Environments

Install modelscope and funasr

The installation is the same as funasr

# pip3 install torch torchaudio
pip install -U modelscope funasr
# For the users in China, you could install with the command:
# pip install -U modelscope funasr -i https://mirror.sjtu.edu.cn/pypi/web/simple

Install the quantization tools

pip install torch-quant # Optional, for torchscript quantization
pip install onnx onnxruntime # Optional, for onnx quantization

Usage

Tips: torch>=1.11.0

   python -m funasr.export.export_model \
       --model-name [model_name] \
       --export-dir [export_dir] \
       --type [onnx, torch] \
       --quantize [true, false] \
       --fallback-num [fallback_num]

model-name: the model is to export. It could be the models from modelscope, or local finetuned model(named: model.pb).

export-dir: the dir where the onnx is export.

type: onnx or torch, export onnx format model or torchscript format model.

quantize: true, export quantized model at the same time; false, export fp32 model only.

fallback-num: specify the number of fallback layers to perform automatic mixed precision quantization.

Export onnx format model

Export model from modelscope

python -m funasr.export.export_model --model-name damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type onnx --quantize false

Export model from local path

The model'name must be model.pb

python -m funasr.export.export_model --model-name /mnt/workspace/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type onnx --quantize false

Test onnx model

Ref to test

Export torchscripts format model

Export model from modelscope

python -m funasr.export.export_model --model-name damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type torch --quantize false

Export model from local path

The model'name must be model.pb

python -m funasr.export.export_model --model-name /mnt/workspace/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type torch --quantize false

Test onnx model

Ref to test

Runtime

ONNXRuntime

ONNXRuntime-python

Ref to docs

ONNXRuntime-cpp

Ref to docs

Libtorch

Libtorch-python

Ref to docs

Libtorch-cpp

Undo

Performance Benchmark

Paraformer on CPU

onnx runtime

libtorch runtime

Paraformer on GPU

nv-triton

Acknowledge

Torch model quantization is supported by BladeDISC, an end-to-end DynamIc Shape Compiler project for machine learning workloads. BladeDISC provides general, transparent, and ease of use performance optimization for TensorFlow/PyTorch workloads on GPGPU and CPU backends. If you are interested, please contact us.

README.md

Export models

Environments

Install modelscope and funasr

Install the quantization tools

Usage

Export onnx format model

Export model from modelscope

Export model from local path

Test onnx model

Export torchscripts format model

Export model from modelscope

Export model from local path

Test onnx model

Runtime

ONNXRuntime

ONNXRuntime-python

ONNXRuntime-cpp

Libtorch

Libtorch-python

Libtorch-cpp

Performance Benchmark

Paraformer on CPU

Paraformer on GPU

Acknowledge