mengzhe.cmz a684ebacb5 large punc model python onnx runtime 2 år sedan
..
csharp c2a2575f19 add c# assembly for fsmn vad (#650) 2 år sedan
deploy_tools 3e0230bcb0 [funasr-runtime-deploy 0.0.6] add SSL on-off and rationalize remove/start (#713) 2 år sedan
docs 61c24582f9 funasr sdk 2 år sedan
grpc eea2a7689f fix paraformer-server for new apis 2 år sedan
html5 07be725277 Update readme_zh.md 2 år sedan
java 293e212128 Java ws client support (#651) 2 år sedan
onnxruntime f8c740d5a8 readme docs 2 år sedan
python a684ebacb5 large punc model python onnx runtime 2 år sedan
ssl_key 546e6a658a update server.key 2 år sedan
triton_gpu 98abc0e5ac update setup (#686) 2 år sedan
websocket 1dcdd5f8a6 Update readme.md 2 år sedan
__init__.py 865ae89f0a export model 3 år sedan
readme.md bc1e5c6f93 docs 2 år sedan
readme_cn.md b395682268 Update readme_cn.md 2 år sedan
run_server.sh 6b840d5143 Merge remote-tracking branch 'origin/main' 2 år sedan

readme.md

FunASR runtime-SDK

中文文档(点击此处

FunASR is a speech recognition framework developed by the Speech Lab of DAMO Academy, which integrates industrial-level models in the fields of speech endpoint detection, speech recognition, punctuation segmentation, and more. It has attracted many developers to participate in experiencing and developing. To solve the last mile of industrial landing and integrate models into business, we have developed the FunASR runtime-SDK. The SDK supports several service deployments, including:

  • File transcription service, Mandarin, CPU version, done
  • File transcription service, Mandarin, GPU version, in progress
  • File transcription service, English, in progress
  • Streaming speech recognition service, is in progress
  • and more.

File Transcription Service, Mandarin (CPU)

Currently, the FunASR runtime-SDK-0.0.1 version supports the deployment of file transcription service, Mandarin (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams.

To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers.

Technical Principles

The technical principles and documentation behind FunASR explain the underlying technology, recognition accuracy, computational efficiency, and core advantages of the framework, including convenience, high precision, high efficiency, and support for long audio chains. For detailed information, please refer to the documentation available by docs.

Deployment Tutorial

The documentation mainly targets novice users who have no need for modifications or customization. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed tutorials, please refer to docs.

Advanced Development Guide

The documentation mainly targets advanced developers who require modifications and customization of the service. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed information, please refer to the documentation available by docs