lyblsgo 3703363363 update docs for funasr-runtime-sdk-online-cpu-0.1.3 пре 2 година
..
android 6eb0dfced4 Dev server hotwords (#1033) пре 2 година
csharp 4cacf936bc Adjusting the project directory structure (#1010) пре 2 година
deploy_tools 907966d80b update docs for en-bpe models пре 2 година
docs 3703363363 update docs for funasr-runtime-sdk-online-cpu-0.1.3 пре 2 година
grpc 85e351bdd9 增加模型下载流程 & 接口修正 & debug (#871) пре 2 година
html5 5d15727b1e set wav sample rate and PCM for h5 (#919) пре 2 година
ios b8909ab5b9 itn on iOS is not supported for the time being (#969) пре 2 година
java 0cf7339171 add hotwords for h5 and java (#876) пре 2 година
onnxruntime 91231a03f5 add jieba for ct-transformer пре 2 година
python 7b857162cb Update funasr_wss_client.py пре 2 година
ssl_key 546e6a658a update server.key пре 2 година
triton_gpu 98abc0e5ac update setup (#686) пре 2 година
websocket f1cb410262 Merge branch 'main' of https://github.com/alibaba-damo-academy/FunASR into main пре 2 година
__init__.py 865ae89f0a export model пре 3 година
readme.md 3703363363 update docs for funasr-runtime-sdk-online-cpu-0.1.3 пре 2 година
readme_cn.md 3703363363 update docs for funasr-runtime-sdk-online-cpu-0.1.3 пре 2 година
run_server.sh 71b403ee9d update docs пре 2 година
run_server_2pass.sh 71b403ee9d update docs пре 2 година

readme.md

FunASR Runtime Roadmap

中文文档(点击此处

FunASR is a speech recognition framework developed by the Speech Lab of DAMO Academy, which integrates industrial-level models in the fields of speech endpoint detection, speech recognition, punctuation segmentation, and more. It has attracted many developers to participate in experiencing and developing. To solve the last mile of industrial landing and integrate models into business, we have developed the FunASR runtime-SDK. The SDK supports several service deployments, including:

  • File transcription service, Mandarin, CPU version, done
  • The real-time transcription service, Mandarin (CPU), done
  • File transcription service, English, CPU version, done
  • File transcription service, Mandarin, GPU version, in progress
  • and more.

File Transcription Service, English (CPU)

Currently, the FunASR runtime-SDK supports the deployment of file transcription service, English (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams.

To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers.

Technical Principles

The technical principles and documentation behind FunASR explain the underlying technology, recognition accuracy, computational efficiency, and core advantages of the framework, including convenience, high precision, high efficiency, and support for long audio chains. For detailed information, please refer to the documentation available by docs.

Deployment Tutorial

The documentation mainly targets novice users who have no need for modifications or customization. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed tutorials, please refer to docs.

Advanced Development Guide

The documentation mainly targets advanced developers who require modifications and customization of the service. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed information, please refer to the documentation available by docs

latest version & image ID

image version image ID INFO
funasr-runtime-sdk-en-cpu-0.1.0 e0de03eb01

The real-time transcription service, Mandarin (CPU)

The FunASR real-time speech-to-text service software package not only performs real-time speech-to-text conversion, but also allows high-precision transcription text correction at the end of each sentence and outputs text with punctuation, supporting high-concurrency multiple requests. In order to meet the needs of different users for different scenarios, different tutorials are prepared:

Convenient Deployment Tutorial

This is suitable for scenarios where there is no need to modify the service deployment SDK and the deployed model comes from ModelScope or is finetuned by the user. For detailed tutorials, please refer to docs

Development Guide

This is suitable for scenarios where there is a need to modify the service deployment SDK and the deployed model comes from ModelScope or is finetuned by the user. For detailed documentation, please refer to docs

Technology Principles Revealed

The document introduces the technology principles behind the service, recognition accuracy, computing efficiency, and core advantages: convenience, high precision, high efficiency, and long audio chain. For detailed documentation, please refer to docs.

latest version & image ID

image version image ID INFO
funasr-runtime-sdk-online-cpu-0.1.3 0adef77795

File Transcription Service, Mandarin (CPU)

Currently, the FunASR runtime-SDK supports the deployment of file transcription service, Mandarin (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams.

To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers.

Technical Principles

The technical principles and documentation behind FunASR explain the underlying technology, recognition accuracy, computational efficiency, and core advantages of the framework, including convenience, high precision, high efficiency, and support for long audio chains. For detailed information, please refer to the documentation available by docs.

Deployment Tutorial

The documentation mainly targets novice users who have no need for modifications or customization. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed tutorials, please refer to docs.

Advanced Development Guide

The documentation mainly targets advanced developers who require modifications and customization of the service. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed information, please refer to the documentation available by docs

latest version & image ID

image version image ID INFO
funasr-runtime-sdk-cpu-0.2.2 2c5286be13