lyblsgo 49e0099a8e Automated deployment: Thu Nov 9 06:16:01 UTC 2023 8f41477f1e3f3e284cc4e4059aad08331bc1f53f		2 tahun lalu
..
android	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
csharp	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
docs	49e0099a8e Automated deployment: Thu Nov 9 06:16:01 UTC 2023 8f41477f1e3f3e284cc4e4059aad08331bc1f53f	2 tahun lalu
grpc	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
html5	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
ios	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
java	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
onnxruntime	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
python	afa5807212 Automated deployment: Wed Nov 8 03:46:12 UTC 2023 39b9983028ac0f95ee0db375b135d2eb8fbaefe1	2 tahun lalu
ssl_key	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
triton_gpu	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
websocket	34bf166429 Automated deployment: Wed Nov 8 02:40:51 UTC 2023 9e0e90f9c0406f65dc7cfdeacb25e4de73da3d48	2 tahun lalu
readme.html	49e0099a8e Automated deployment: Thu Nov 9 06:16:01 UTC 2023 8f41477f1e3f3e284cc4e4059aad08331bc1f53f	2 tahun lalu
readme_cn.html	49e0099a8e Automated deployment: Thu Nov 9 06:16:01 UTC 2023 8f41477f1e3f3e284cc4e4059aad08331bc1f53f	2 tahun lalu

FunASR

Installation

Installation

Docker

Quick Start

Quick Start

Academic Egs

Speech Recognition

Punctuation Restoration

Voice Activity Detection

Speaker Verification

Speaker Diarization

ModelScope Egs

Quick Start

Speech Recognition

Voice Activity Detection

Punctuation Restoration

Timestamp Prediction (FA)

Speaker Verification

Speaker Diarization

Inverse Text Normalization (ITN)

Model Zoo

Pretrained Models Released on ModelScope

Pretrained Models on Huggingface

Runtime and Service

FunASR Runtime Roadmap

FunASR Realtime Transcribe Service

Highlights

FunASR Offline File Transcription Service

Speech Recognition Service Html5 Client Access Interface

Benchmark and Leaderboard

Leaderboard IO

Funasr Library

Build custom tasks

Papers

Papers

Application

Audio Cut

Realtime Speech Recognition

Audio Chat

FQA

FunASR

FunASR Runtime Roadmap

View page source

FunASR Runtime Roadmap

中文文档（点击此处）

FunASR is a speech recognition framework developed by the Speech Lab of DAMO Academy, which integrates industrial-level models in the fields of speech endpoint detection, speech recognition, punctuation segmentation, and more.
It has attracted many developers to participate in experiencing and developing. To solve the last mile of industrial landing and integrate models into business, we have developed the FunASR runtime-SDK. The SDK supports several service deployments, including:

File transcription service, Mandarin, CPU version, done

The real-time transcription service, Mandarin (CPU), done

File transcription service, English, CPU version, done

File transcription service, Mandarin, GPU version, in progress

and more.

File Transcription Service, English (CPU)

Currently, the FunASR runtime-SDK supports the deployment of file transcription service, English (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams.

To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers.

Whats-new

2023/11/08: Adaptation to runtime structure changes (FunASR/funasr/runtime -> FunASR/runtime), docker image version funasr-runtime-sdk-en-cpu-0.1.1 (27017f70f72a).

2023/10/16: English File Transcription Service 1.0 released, docker image version funasr-runtime-sdk-en-cpu-0.1.0 (e0de03eb0163), refer to the detailed documentation（here）

Technical Principles

The technical principles and documentation behind FunASR explain the underlying technology, recognition accuracy, computational efficiency, and core advantages of the framework, including convenience, high precision, high efficiency, and support for long audio chains. For detailed information, please refer to the documentation available by docs.

Deployment Tutorial

The documentation mainly targets novice users who have no need for modifications or customization. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed tutorials, please refer to docs.

Advanced Development Guide

The documentation mainly targets advanced developers who require modifications and customization of the service. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed information, please refer to the documentation available by docs

The real-time transcription service, Mandarin (CPU)

The FunASR real-time speech-to-text service software package not only performs real-time speech-to-text conversion, but also allows high-precision transcription text correction at the end of each sentence and outputs text with punctuation, supporting high-concurrency multiple requests.
In order to meet the needs of different users for different scenarios, different tutorials are prepared:

Whats-new

2023/11/09: Real-time Transcription Service 1.5 released，fix bug: without online results, docker image version funasr-runtime-sdk-online-cpu-0.1.5 (b16584b6d38b)

2023/11/08: Real-time Transcription Service 1.4 released, supporting server-side loading of hotwords (updated hotword communication protocol), adaptation to runtime structure changes (FunASR/funasr/runtime -> FunASR/runtime), docker image version funasr-runtime-sdk-online-cpu-0.1.4(691974017c38).

2023/09/19: Real-time Transcription Service 1.2 released, supporting hotwords, timestamps, and ITN model in 2pass mode, docker image version funasr-runtime-sdk-online-cpu-0.1.2 (7222c5319bcf).

2023/08/11: Real-time Transcription Service 1.1 released, addressing some known bugs (including server crashes), docker image version funasr-runtime-sdk-online-cpu-0.1.1 (bdbdd0b27dee).

2023/08/07: Real-time Transcription Service 1.0 released, docker image version funasr-runtime-sdk-online-cpu-0.1.0(bdbdd0b27dee), refer to the detailed documentation（here）

Convenient Deployment Tutorial

This is suitable for scenarios where there is no need to modify the service deployment SDK and the deployed model comes from ModelScope or is finetuned by the user. For detailed tutorials, please refer to docs

Development Guide

This is suitable for scenarios where there is a need to modify the service deployment SDK and the deployed model comes from ModelScope or is finetuned by the user. For detailed documentation, please refer to docs

Technology Principles Revealed

The document introduces the technology principles behind the service, recognition accuracy, computing efficiency, and core advantages: convenience, high precision, high efficiency, and long audio chain. For detailed documentation, please refer to docs.

File Transcription Service, Mandarin (CPU)

Currently, the FunASR runtime-SDK supports the deployment of file transcription service, Mandarin (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams.

To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers.

Whats-new

2023/11/08: File Transcription Service 3.0 released, supporting punctuation large model, Ngram model, fst hotwords (updated hotword communication protocol), server-side loading of hotwords, adaptation to runtime structure changes (FunASR/funasr/runtime -> FunASR/runtime), docker image version funasr-runtime-sdk-cpu-0.3.0 (caa64bddbb43), refer to the detailed documentation （here）
2023/09/19: File Transcription Service 2.2 released, supporting ITN model, docker image version funasr-runtime-sdk-cpu-0.2.2 (2c5286be13e9).
2023/08/22: File Transcription Service 2.0 released, integrated ffmpeg to support various audio and video inputs, supporting hotword model and timestamp model, docker image version funasr-runtime-sdk-cpu-0.2.0 (1ad3d19e0707), refer to the detailed documentation （here）
2023/07/03: File Transcription Service 1.0 released, docker image version funasr-runtime-sdk-cpu-0.1.0 (1ad3d19e0707), refer to the detailed documentation （here）

readme.html

FunASR Runtime Roadmap

File Transcription Service, English (CPU)

Whats-new

Technical Principles

Deployment Tutorial

Advanced Development Guide

The real-time transcription service, Mandarin (CPU)

Whats-new

Convenient Deployment Tutorial

Development Guide

Technology Principles Revealed

File Transcription Service, Mandarin (CPU)

Whats-new

Technical Principles

Deployment Tutorial

Advanced Development Guide