README.html 34 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347
  1. <!DOCTYPE html>
  2. <html class="writer-html5" lang="en" >
  3. <head>
  4. <meta charset="utf-8" />
  5. <meta name="viewport" content="width=device-width, initial-scale=1.0" />
  6. <title>ONNXRuntime-python &mdash; FunASR documentation</title><link rel="stylesheet" href="../../../_static/css/theme.css" type="text/css" />
  7. <link rel="stylesheet" href="../../../_static/pygments.css" type="text/css" />
  8. <!--[if lt IE 9]>
  9. <script src="../../../_static/js/html5shiv.min.js"></script>
  10. <![endif]-->
  11. <script id="documentation_options" data-url_root="../../../" src="../../../_static/documentation_options.js"></script>
  12. <script src="../../../_static/jquery.js"></script>
  13. <script src="../../../_static/underscore.js"></script>
  14. <script src="../../../_static/doctools.js"></script>
  15. <script src="../../../_static/language_data.js"></script>
  16. <script crossorigin="anonymous" integrity="sha256-Ae2Vz/4ePdIu6ZyI/5ZGsYnb+m0JlOmKPjt6XZ9JJkA=" src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.4/require.min.js"></script>
  17. <script src="../../../_static/js/theme.js"></script>
  18. <link rel="index" title="Index" href="../../../genindex.html" />
  19. <link rel="search" title="Search" href="../../../search.html" />
  20. </head>
  21. <body class="wy-body-for-nav">
  22. <div class="wy-grid-for-nav">
  23. <nav data-toggle="wy-nav-shift" class="wy-nav-side">
  24. <div class="wy-side-scroll">
  25. <div class="wy-side-nav-search" >
  26. <a href="../../../index.html" class="icon icon-home">
  27. FunASR
  28. </a>
  29. <div role="search">
  30. <form id="rtd-search-form" class="wy-form" action="../../../search.html" method="get">
  31. <input type="text" name="q" placeholder="Search docs" aria-label="Search docs" />
  32. <input type="hidden" name="check_keywords" value="yes" />
  33. <input type="hidden" name="area" value="default" />
  34. </form>
  35. </div>
  36. </div><div class="wy-menu wy-menu-vertical" data-spy="affix" role="navigation" aria-label="Navigation menu">
  37. <p class="caption"><span class="caption-text">Installation</span></p>
  38. <ul>
  39. <li class="toctree-l1"><a class="reference internal" href="../../../installation/installation.html">Installation</a></li>
  40. <li class="toctree-l1"><a class="reference internal" href="../../../installation/docker.html">Docker</a></li>
  41. </ul>
  42. <p class="caption"><span class="caption-text">Quick Start</span></p>
  43. <ul>
  44. <li class="toctree-l1"><a class="reference internal" href="../../../funasr/quick_start.html">Quick Start</a></li>
  45. </ul>
  46. <p class="caption"><span class="caption-text">Academic Egs</span></p>
  47. <ul>
  48. <li class="toctree-l1"><a class="reference internal" href="../../../academic_recipe/asr_recipe.html">Speech Recognition</a></li>
  49. <li class="toctree-l1"><a class="reference internal" href="../../../academic_recipe/punc_recipe.html">Punctuation Restoration</a></li>
  50. <li class="toctree-l1"><a class="reference internal" href="../../../academic_recipe/vad_recipe.html">Voice Activity Detection</a></li>
  51. <li class="toctree-l1"><a class="reference internal" href="../../../academic_recipe/sv_recipe.html">Speaker Verification</a></li>
  52. <li class="toctree-l1"><a class="reference internal" href="../../../academic_recipe/sd_recipe.html">Speaker Diarization</a></li>
  53. </ul>
  54. <p class="caption"><span class="caption-text">ModelScope Egs</span></p>
  55. <ul>
  56. <li class="toctree-l1"><a class="reference internal" href="../../../modelscope_pipeline/quick_start.html">Quick Start</a></li>
  57. <li class="toctree-l1"><a class="reference internal" href="../../../egs_modelscope/asr/TEMPLATE/README.html">Speech Recognition</a></li>
  58. <li class="toctree-l1"><a class="reference internal" href="../../../egs_modelscope/vad/TEMPLATE/README.html">Voice Activity Detection</a></li>
  59. <li class="toctree-l1"><a class="reference internal" href="../../../egs_modelscope/punctuation/TEMPLATE/README.html">Punctuation Restoration</a></li>
  60. <li class="toctree-l1"><a class="reference internal" href="../../../egs_modelscope/tp/TEMPLATE/README.html">Timestamp Prediction (FA)</a></li>
  61. <li class="toctree-l1"><a class="reference internal" href="../../../modelscope_pipeline/sv_pipeline.html">Speaker Verification</a></li>
  62. <li class="toctree-l1"><a class="reference internal" href="../../../modelscope_pipeline/sd_pipeline.html">Speaker Diarization</a></li>
  63. <li class="toctree-l1"><a class="reference internal" href="../../../modelscope_pipeline/itn_pipeline.html">Inverse Text Normalization (ITN)</a></li>
  64. </ul>
  65. <p class="caption"><span class="caption-text">Model Zoo</span></p>
  66. <ul>
  67. <li class="toctree-l1"><a class="reference internal" href="../../../model_zoo/modelscope_models.html">Pretrained Models Released on ModelScope</a></li>
  68. <li class="toctree-l1"><a class="reference internal" href="../../../model_zoo/huggingface_models.html">Pretrained Models on Huggingface</a></li>
  69. </ul>
  70. <p class="caption"><span class="caption-text">Runtime and Service</span></p>
  71. <ul>
  72. <li class="toctree-l1"><a class="reference internal" href="../../readme.html">FunASR Runtime Roadmap</a></li>
  73. <li class="toctree-l1"><a class="reference internal" href="../../docs/SDK_tutorial_online.html">FunASR Realtime Transcribe Service</a></li>
  74. <li class="toctree-l1"><a class="reference internal" href="../../docs/SDK_tutorial.html">Highlights</a></li>
  75. <li class="toctree-l1"><a class="reference internal" href="../../docs/SDK_tutorial.html#funasr-offline-file-transcription-service">FunASR Offline File Transcription Service</a></li>
  76. <li class="toctree-l1"><a class="reference internal" href="../../html5/readme.html">Speech Recognition Service Html5 Client Access Interface</a></li>
  77. </ul>
  78. <p class="caption"><span class="caption-text">Benchmark and Leaderboard</span></p>
  79. <ul>
  80. <li class="toctree-l1"><a class="reference internal" href="../../../benchmark/benchmark_pipeline_cer.html">Leaderboard IO</a></li>
  81. </ul>
  82. <p class="caption"><span class="caption-text">Funasr Library</span></p>
  83. <ul>
  84. <li class="toctree-l1"><a class="reference internal" href="../../../reference/build_task.html">Build custom tasks</a></li>
  85. </ul>
  86. <p class="caption"><span class="caption-text">Papers</span></p>
  87. <ul>
  88. <li class="toctree-l1"><a class="reference internal" href="../../../reference/papers.html">Papers</a></li>
  89. </ul>
  90. <p class="caption"><span class="caption-text">Application</span></p>
  91. <ul>
  92. <li class="toctree-l1"><a class="reference internal" href="../../../reference/application.html">Audio Cut</a></li>
  93. <li class="toctree-l1"><a class="reference internal" href="../../../reference/application.html#realtime-speech-recognition">Realtime Speech Recognition</a></li>
  94. <li class="toctree-l1"><a class="reference internal" href="../../../reference/application.html#audio-chat">Audio Chat</a></li>
  95. </ul>
  96. <p class="caption"><span class="caption-text">FQA</span></p>
  97. <ul>
  98. <li class="toctree-l1"><a class="reference internal" href="../../../reference/FQA.html">FQA</a></li>
  99. </ul>
  100. </div>
  101. </div>
  102. </nav>
  103. <section data-toggle="wy-nav-shift" class="wy-nav-content-wrap"><nav class="wy-nav-top" aria-label="Mobile navigation menu" >
  104. <i data-toggle="wy-nav-top" class="fa fa-bars"></i>
  105. <a href="../../../index.html">FunASR</a>
  106. </nav>
  107. <div class="wy-nav-content">
  108. <div class="rst-content">
  109. <div role="navigation" aria-label="Page navigation">
  110. <ul class="wy-breadcrumbs">
  111. <li><a href="../../../index.html" class="icon icon-home" aria-label="Home"></a></li>
  112. <li class="breadcrumb-item active">ONNXRuntime-python</li>
  113. <li class="wy-breadcrumbs-aside">
  114. <a href="../../../_sources/runtime/python/onnxruntime/README.md.txt" rel="nofollow"> View page source</a>
  115. </li>
  116. </ul>
  117. <hr/>
  118. </div>
  119. <div role="main" class="document" itemscope="itemscope" itemtype="http://schema.org/Article">
  120. <div itemprop="articleBody">
  121. <div class="section" id="onnxruntime-python">
  122. <h1>ONNXRuntime-python<a class="headerlink" href="#onnxruntime-python" title="Permalink to this headline"></a></h1>
  123. <div class="section" id="install-funasr-onnx">
  124. <h2>Install <code class="docutils literal notranslate"><span class="pre">funasr-onnx</span></code><a class="headerlink" href="#install-funasr-onnx" title="Permalink to this headline"></a></h2>
  125. <p>install from pip</p>
  126. <div class="highlight-shell notranslate"><div class="highlight"><pre><span></span>pip install -U funasr-onnx
  127. <span class="c1"># For the users in China, you could install with the command:</span>
  128. <span class="c1"># pip install -U funasr-onnx -i https://mirror.sjtu.edu.cn/pypi/web/simple</span>
  129. <span class="c1"># If you want to export .onnx file, you should install modelscope and funasr</span>
  130. pip install -U modelscope funasr
  131. <span class="c1"># For the users in China, you could install with the command:</span>
  132. <span class="c1"># pip install -U modelscope funasr -i https://mirror.sjtu.edu.cn/pypi/web/simple</span>
  133. </pre></div>
  134. </div>
  135. <p>or install from source code</p>
  136. <div class="highlight-shell notranslate"><div class="highlight"><pre><span></span>git clone https://github.com/alibaba/FunASR.git <span class="o">&amp;&amp;</span> <span class="nb">cd</span> FunASR
  137. <span class="nb">cd</span> funasr/runtime/python/onnxruntime
  138. pip install -e ./
  139. <span class="c1"># For the users in China, you could install with the command:</span>
  140. <span class="c1"># pip install -e ./ -i https://mirror.sjtu.edu.cn/pypi/web/simple</span>
  141. </pre></div>
  142. </div>
  143. </div>
  144. <div class="section" id="inference-with-runtime">
  145. <h2>Inference with runtime<a class="headerlink" href="#inference-with-runtime" title="Permalink to this headline"></a></h2>
  146. <div class="section" id="speech-recognition">
  147. <h3>Speech Recognition<a class="headerlink" href="#speech-recognition" title="Permalink to this headline"></a></h3>
  148. <div class="section" id="paraformer">
  149. <h4>Paraformer<a class="headerlink" href="#paraformer" title="Permalink to this headline"></a></h4>
  150. <div class="highlight-python notranslate"><div class="highlight"><pre><span></span><span class="kn">from</span> <span class="nn">funasr_onnx</span> <span class="kn">import</span> <span class="n">Paraformer</span>
  151. <span class="kn">from</span> <span class="nn">pathlib</span> <span class="kn">import</span> <span class="n">Path</span>
  152. <span class="n">model_dir</span> <span class="o">=</span> <span class="s2">&quot;damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch&quot;</span>
  153. <span class="n">model</span> <span class="o">=</span> <span class="n">Paraformer</span><span class="p">(</span><span class="n">model_dir</span><span class="p">,</span> <span class="n">batch_size</span><span class="o">=</span><span class="mi">1</span><span class="p">,</span> <span class="n">quantize</span><span class="o">=</span><span class="kc">True</span><span class="p">)</span>
  154. <span class="n">wav_path</span> <span class="o">=</span> <span class="p">[</span><span class="s1">&#39;</span><span class="si">{}</span><span class="s1">/.cache/modelscope/hub/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/example/asr_example.wav&#39;</span><span class="o">.</span><span class="n">format</span><span class="p">(</span><span class="n">Path</span><span class="o">.</span><span class="n">home</span><span class="p">())]</span>
  155. <span class="n">result</span> <span class="o">=</span> <span class="n">model</span><span class="p">(</span><span class="n">wav_path</span><span class="p">)</span>
  156. <span class="nb">print</span><span class="p">(</span><span class="n">result</span><span class="p">)</span>
  157. </pre></div>
  158. </div>
  159. <ul class="simple">
  160. <li><p><code class="docutils literal notranslate"><span class="pre">model_dir</span></code>: model_name in modelscope or local path downloaded from modelscope. If the local path is set, it should contain <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code>, <code class="docutils literal notranslate"><span class="pre">config.yaml</span></code>, <code class="docutils literal notranslate"><span class="pre">am.mvn</span></code></p></li>
  161. <li><p><code class="docutils literal notranslate"><span class="pre">batch_size</span></code>: <code class="docutils literal notranslate"><span class="pre">1</span></code> (Default), the batch size duration inference</p></li>
  162. <li><p><code class="docutils literal notranslate"><span class="pre">device_id</span></code>: <code class="docutils literal notranslate"><span class="pre">-1</span></code> (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu)</p></li>
  163. <li><p><code class="docutils literal notranslate"><span class="pre">quantize</span></code>: <code class="docutils literal notranslate"><span class="pre">False</span></code> (Default), load the model of <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code>. If set <code class="docutils literal notranslate"><span class="pre">True</span></code>, load the model of <code class="docutils literal notranslate"><span class="pre">model_quant.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code></p></li>
  164. <li><p><code class="docutils literal notranslate"><span class="pre">intra_op_num_threads</span></code>: <code class="docutils literal notranslate"><span class="pre">4</span></code> (Default), sets the number of threads used for intraop parallelism on CPU</p></li>
  165. </ul>
  166. <p>Input: wav formt file, support formats: <code class="docutils literal notranslate"><span class="pre">str,</span> <span class="pre">np.ndarray,</span> <span class="pre">List[str]</span></code></p>
  167. <p>Output: <code class="docutils literal notranslate"><span class="pre">List[str]</span></code>: recognition result</p>
  168. </div>
  169. <div class="section" id="paraformer-online">
  170. <h4>Paraformer-online<a class="headerlink" href="#paraformer-online" title="Permalink to this headline"></a></h4>
  171. </div>
  172. </div>
  173. <div class="section" id="voice-activity-detection">
  174. <h3>Voice Activity Detection<a class="headerlink" href="#voice-activity-detection" title="Permalink to this headline"></a></h3>
  175. <div class="section" id="fsmn-vad">
  176. <h4>FSMN-VAD<a class="headerlink" href="#fsmn-vad" title="Permalink to this headline"></a></h4>
  177. <div class="highlight-python notranslate"><div class="highlight"><pre><span></span><span class="kn">from</span> <span class="nn">funasr_onnx</span> <span class="kn">import</span> <span class="n">Fsmn_vad</span>
  178. <span class="kn">from</span> <span class="nn">pathlib</span> <span class="kn">import</span> <span class="n">Path</span>
  179. <span class="n">model_dir</span> <span class="o">=</span> <span class="s2">&quot;damo/speech_fsmn_vad_zh-cn-16k-common-pytorch&quot;</span>
  180. <span class="n">wav_path</span> <span class="o">=</span> <span class="s1">&#39;</span><span class="si">{}</span><span class="s1">/.cache/modelscope/hub/damo/speech_fsmn_vad_zh-cn-16k-common-pytorch/example/vad_example.wav&#39;</span><span class="o">.</span><span class="n">format</span><span class="p">(</span><span class="n">Path</span><span class="o">.</span><span class="n">home</span><span class="p">())</span>
  181. <span class="n">model</span> <span class="o">=</span> <span class="n">Fsmn_vad</span><span class="p">(</span><span class="n">model_dir</span><span class="p">)</span>
  182. <span class="n">result</span> <span class="o">=</span> <span class="n">model</span><span class="p">(</span><span class="n">wav_path</span><span class="p">)</span>
  183. <span class="nb">print</span><span class="p">(</span><span class="n">result</span><span class="p">)</span>
  184. </pre></div>
  185. </div>
  186. <ul class="simple">
  187. <li><p><code class="docutils literal notranslate"><span class="pre">model_dir</span></code>: model_name in modelscope or local path downloaded from modelscope. If the local path is set, it should contain <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code>, <code class="docutils literal notranslate"><span class="pre">config.yaml</span></code>, <code class="docutils literal notranslate"><span class="pre">am.mvn</span></code></p></li>
  188. <li><p><code class="docutils literal notranslate"><span class="pre">batch_size</span></code>: <code class="docutils literal notranslate"><span class="pre">1</span></code> (Default), the batch size duration inference</p></li>
  189. <li><p><code class="docutils literal notranslate"><span class="pre">device_id</span></code>: <code class="docutils literal notranslate"><span class="pre">-1</span></code> (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu)</p></li>
  190. <li><p><code class="docutils literal notranslate"><span class="pre">quantize</span></code>: <code class="docutils literal notranslate"><span class="pre">False</span></code> (Default), load the model of <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code>. If set <code class="docutils literal notranslate"><span class="pre">True</span></code>, load the model of <code class="docutils literal notranslate"><span class="pre">model_quant.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code></p></li>
  191. <li><p><code class="docutils literal notranslate"><span class="pre">intra_op_num_threads</span></code>: <code class="docutils literal notranslate"><span class="pre">4</span></code> (Default), sets the number of threads used for intraop parallelism on CPU</p></li>
  192. </ul>
  193. <p>Input: wav formt file, support formats: <code class="docutils literal notranslate"><span class="pre">str,</span> <span class="pre">np.ndarray,</span> <span class="pre">List[str]</span></code></p>
  194. <p>Output: <code class="docutils literal notranslate"><span class="pre">List[str]</span></code>: recognition result</p>
  195. </div>
  196. <div class="section" id="fsmn-vad-online">
  197. <h4>FSMN-VAD-online<a class="headerlink" href="#fsmn-vad-online" title="Permalink to this headline"></a></h4>
  198. <div class="highlight-python notranslate"><div class="highlight"><pre><span></span><span class="kn">from</span> <span class="nn">funasr_onnx</span> <span class="kn">import</span> <span class="n">Fsmn_vad_online</span>
  199. <span class="kn">import</span> <span class="nn">soundfile</span>
  200. <span class="kn">from</span> <span class="nn">pathlib</span> <span class="kn">import</span> <span class="n">Path</span>
  201. <span class="n">model_dir</span> <span class="o">=</span> <span class="s2">&quot;damo/speech_fsmn_vad_zh-cn-16k-common-pytorch&quot;</span>
  202. <span class="n">wav_path</span> <span class="o">=</span> <span class="s1">&#39;</span><span class="si">{}</span><span class="s1">/.cache/modelscope/hub/damo/speech_fsmn_vad_zh-cn-16k-common-pytorch/example/vad_example.wav&#39;</span><span class="o">.</span><span class="n">format</span><span class="p">(</span><span class="n">Path</span><span class="o">.</span><span class="n">home</span><span class="p">())</span>
  203. <span class="n">model</span> <span class="o">=</span> <span class="n">Fsmn_vad_online</span><span class="p">(</span><span class="n">model_dir</span><span class="p">)</span>
  204. <span class="c1">##online vad</span>
  205. <span class="n">speech</span><span class="p">,</span> <span class="n">sample_rate</span> <span class="o">=</span> <span class="n">soundfile</span><span class="o">.</span><span class="n">read</span><span class="p">(</span><span class="n">wav_path</span><span class="p">)</span>
  206. <span class="n">speech_length</span> <span class="o">=</span> <span class="n">speech</span><span class="o">.</span><span class="n">shape</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span>
  207. <span class="c1">#</span>
  208. <span class="n">sample_offset</span> <span class="o">=</span> <span class="mi">0</span>
  209. <span class="n">step</span> <span class="o">=</span> <span class="mi">1600</span>
  210. <span class="n">param_dict</span> <span class="o">=</span> <span class="p">{</span><span class="s1">&#39;in_cache&#39;</span><span class="p">:</span> <span class="p">[]}</span>
  211. <span class="k">for</span> <span class="n">sample_offset</span> <span class="ow">in</span> <span class="nb">range</span><span class="p">(</span><span class="mi">0</span><span class="p">,</span> <span class="n">speech_length</span><span class="p">,</span> <span class="nb">min</span><span class="p">(</span><span class="n">step</span><span class="p">,</span> <span class="n">speech_length</span> <span class="o">-</span> <span class="n">sample_offset</span><span class="p">)):</span>
  212. <span class="k">if</span> <span class="n">sample_offset</span> <span class="o">+</span> <span class="n">step</span> <span class="o">&gt;=</span> <span class="n">speech_length</span> <span class="o">-</span> <span class="mi">1</span><span class="p">:</span>
  213. <span class="n">step</span> <span class="o">=</span> <span class="n">speech_length</span> <span class="o">-</span> <span class="n">sample_offset</span>
  214. <span class="n">is_final</span> <span class="o">=</span> <span class="kc">True</span>
  215. <span class="k">else</span><span class="p">:</span>
  216. <span class="n">is_final</span> <span class="o">=</span> <span class="kc">False</span>
  217. <span class="n">param_dict</span><span class="p">[</span><span class="s1">&#39;is_final&#39;</span><span class="p">]</span> <span class="o">=</span> <span class="n">is_final</span>
  218. <span class="n">segments_result</span> <span class="o">=</span> <span class="n">model</span><span class="p">(</span><span class="n">audio_in</span><span class="o">=</span><span class="n">speech</span><span class="p">[</span><span class="n">sample_offset</span><span class="p">:</span> <span class="n">sample_offset</span> <span class="o">+</span> <span class="n">step</span><span class="p">],</span>
  219. <span class="n">param_dict</span><span class="o">=</span><span class="n">param_dict</span><span class="p">)</span>
  220. <span class="k">if</span> <span class="n">segments_result</span><span class="p">:</span>
  221. <span class="nb">print</span><span class="p">(</span><span class="n">segments_result</span><span class="p">)</span>
  222. </pre></div>
  223. </div>
  224. <ul class="simple">
  225. <li><p><code class="docutils literal notranslate"><span class="pre">model_dir</span></code>: model_name in modelscope or local path downloaded from modelscope. If the local path is set, it should contain <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code>, <code class="docutils literal notranslate"><span class="pre">config.yaml</span></code>, <code class="docutils literal notranslate"><span class="pre">am.mvn</span></code></p></li>
  226. <li><p><code class="docutils literal notranslate"><span class="pre">batch_size</span></code>: <code class="docutils literal notranslate"><span class="pre">1</span></code> (Default), the batch size duration inference</p></li>
  227. <li><p><code class="docutils literal notranslate"><span class="pre">device_id</span></code>: <code class="docutils literal notranslate"><span class="pre">-1</span></code> (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu)</p></li>
  228. <li><p><code class="docutils literal notranslate"><span class="pre">quantize</span></code>: <code class="docutils literal notranslate"><span class="pre">False</span></code> (Default), load the model of <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code>. If set <code class="docutils literal notranslate"><span class="pre">True</span></code>, load the model of <code class="docutils literal notranslate"><span class="pre">model_quant.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code></p></li>
  229. <li><p><code class="docutils literal notranslate"><span class="pre">intra_op_num_threads</span></code>: <code class="docutils literal notranslate"><span class="pre">4</span></code> (Default), sets the number of threads used for intraop parallelism on CPU</p></li>
  230. </ul>
  231. <p>Input: wav formt file, support formats: <code class="docutils literal notranslate"><span class="pre">str,</span> <span class="pre">np.ndarray,</span> <span class="pre">List[str]</span></code></p>
  232. <p>Output: <code class="docutils literal notranslate"><span class="pre">List[str]</span></code>: recognition result</p>
  233. </div>
  234. </div>
  235. <div class="section" id="punctuation-restoration">
  236. <h3>Punctuation Restoration<a class="headerlink" href="#punctuation-restoration" title="Permalink to this headline"></a></h3>
  237. <div class="section" id="ct-transformer">
  238. <h4>CT-Transformer<a class="headerlink" href="#ct-transformer" title="Permalink to this headline"></a></h4>
  239. <div class="highlight-python notranslate"><div class="highlight"><pre><span></span><span class="kn">from</span> <span class="nn">funasr_onnx</span> <span class="kn">import</span> <span class="n">CT_Transformer</span>
  240. <span class="n">model_dir</span> <span class="o">=</span> <span class="s2">&quot;damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch&quot;</span>
  241. <span class="n">model</span> <span class="o">=</span> <span class="n">CT_Transformer</span><span class="p">(</span><span class="n">model_dir</span><span class="p">)</span>
  242. <span class="n">text_in</span><span class="o">=</span><span class="s2">&quot;跨境河流是养育沿岸人民的生命之源长期以来为帮助下游地区防灾减灾中方技术人员在上游地区极为恶劣的自然条件下克服巨大困难甚至冒着生命危险向印方提供汛期水文资料处理紧急事件中方重视印方在跨境河流问题上的关切愿意进一步完善双方联合工作机制凡是中方能做的我们都会去做而且会做得更好我请印度朋友们放心中国在上游的任何开发利用都会经过科学规划和论证兼顾上下游的利益&quot;</span>
  243. <span class="n">result</span> <span class="o">=</span> <span class="n">model</span><span class="p">(</span><span class="n">text_in</span><span class="p">)</span>
  244. <span class="nb">print</span><span class="p">(</span><span class="n">result</span><span class="p">[</span><span class="mi">0</span><span class="p">])</span>
  245. </pre></div>
  246. </div>
  247. <ul class="simple">
  248. <li><p><code class="docutils literal notranslate"><span class="pre">model_dir</span></code>: model_name in modelscope or local path downloaded from modelscope. If the local path is set, it should contain <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code>, <code class="docutils literal notranslate"><span class="pre">config.yaml</span></code>, <code class="docutils literal notranslate"><span class="pre">am.mvn</span></code></p></li>
  249. <li><p><code class="docutils literal notranslate"><span class="pre">device_id</span></code>: <code class="docutils literal notranslate"><span class="pre">-1</span></code> (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu)</p></li>
  250. <li><p><code class="docutils literal notranslate"><span class="pre">quantize</span></code>: <code class="docutils literal notranslate"><span class="pre">False</span></code> (Default), load the model of <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code>. If set <code class="docutils literal notranslate"><span class="pre">True</span></code>, load the model of <code class="docutils literal notranslate"><span class="pre">model_quant.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code></p></li>
  251. <li><p><code class="docutils literal notranslate"><span class="pre">intra_op_num_threads</span></code>: <code class="docutils literal notranslate"><span class="pre">4</span></code> (Default), sets the number of threads used for intraop parallelism on CPU</p></li>
  252. </ul>
  253. <p>Input: <code class="docutils literal notranslate"><span class="pre">str</span></code>, raw text of asr result</p>
  254. <p>Output: <code class="docutils literal notranslate"><span class="pre">List[str]</span></code>: recognition result</p>
  255. </div>
  256. <div class="section" id="ct-transformer-online">
  257. <h4>CT-Transformer-online<a class="headerlink" href="#ct-transformer-online" title="Permalink to this headline"></a></h4>
  258. <div class="highlight-python notranslate"><div class="highlight"><pre><span></span><span class="kn">from</span> <span class="nn">funasr_onnx</span> <span class="kn">import</span> <span class="n">CT_Transformer_VadRealtime</span>
  259. <span class="n">model_dir</span> <span class="o">=</span> <span class="s2">&quot;damo/punc_ct-transformer_zh-cn-common-vad_realtime-vocab272727&quot;</span>
  260. <span class="n">model</span> <span class="o">=</span> <span class="n">CT_Transformer_VadRealtime</span><span class="p">(</span><span class="n">model_dir</span><span class="p">)</span>
  261. <span class="n">text_in</span> <span class="o">=</span> <span class="s2">&quot;跨境河流是养育沿岸|人民的生命之源长期以来为帮助下游地区防灾减灾中方技术人员|在上游地区极为恶劣的自然条件下克服巨大困难甚至冒着生命危险|向印方提供汛期水文资料处理紧急事件中方重视印方在跨境河流&gt;问题上的关切|愿意进一步完善双方联合工作机制|凡是|中方能做的我们|都会去做而且会做得更好我请印度朋友们放心中国在上游的|任何开发利用都会经过科学|规划和论证兼顾上下游的利益&quot;</span>
  262. <span class="n">vads</span> <span class="o">=</span> <span class="n">text_in</span><span class="o">.</span><span class="n">split</span><span class="p">(</span><span class="s2">&quot;|&quot;</span><span class="p">)</span>
  263. <span class="n">rec_result_all</span><span class="o">=</span><span class="s2">&quot;&quot;</span>
  264. <span class="n">param_dict</span> <span class="o">=</span> <span class="p">{</span><span class="s2">&quot;cache&quot;</span><span class="p">:</span> <span class="p">[]}</span>
  265. <span class="k">for</span> <span class="n">vad</span> <span class="ow">in</span> <span class="n">vads</span><span class="p">:</span>
  266. <span class="n">result</span> <span class="o">=</span> <span class="n">model</span><span class="p">(</span><span class="n">vad</span><span class="p">,</span> <span class="n">param_dict</span><span class="o">=</span><span class="n">param_dict</span><span class="p">)</span>
  267. <span class="n">rec_result_all</span> <span class="o">+=</span> <span class="n">result</span><span class="p">[</span><span class="mi">0</span><span class="p">]</span>
  268. <span class="nb">print</span><span class="p">(</span><span class="n">rec_result_all</span><span class="p">)</span>
  269. </pre></div>
  270. </div>
  271. <ul class="simple">
  272. <li><p><code class="docutils literal notranslate"><span class="pre">model_dir</span></code>: model_name in modelscope or local path downloaded from modelscope. If the local path is set, it should contain <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code>, <code class="docutils literal notranslate"><span class="pre">config.yaml</span></code>, <code class="docutils literal notranslate"><span class="pre">am.mvn</span></code></p></li>
  273. <li><p><code class="docutils literal notranslate"><span class="pre">device_id</span></code>: <code class="docutils literal notranslate"><span class="pre">-1</span></code> (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu)</p></li>
  274. <li><p><code class="docutils literal notranslate"><span class="pre">quantize</span></code>: <code class="docutils literal notranslate"><span class="pre">False</span></code> (Default), load the model of <code class="docutils literal notranslate"><span class="pre">model.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code>. If set <code class="docutils literal notranslate"><span class="pre">True</span></code>, load the model of <code class="docutils literal notranslate"><span class="pre">model_quant.onnx</span></code> in <code class="docutils literal notranslate"><span class="pre">model_dir</span></code></p></li>
  275. <li><p><code class="docutils literal notranslate"><span class="pre">intra_op_num_threads</span></code>: <code class="docutils literal notranslate"><span class="pre">4</span></code> (Default), sets the number of threads used for intraop parallelism on CPU</p></li>
  276. </ul>
  277. <p>Input: <code class="docutils literal notranslate"><span class="pre">str</span></code>, raw text of asr result</p>
  278. <p>Output: <code class="docutils literal notranslate"><span class="pre">List[str]</span></code>: recognition result</p>
  279. </div>
  280. </div>
  281. </div>
  282. <div class="section" id="performance-benchmark">
  283. <h2>Performance benchmark<a class="headerlink" href="#performance-benchmark" title="Permalink to this headline"></a></h2>
  284. <p>Please ref to <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR/blob/main/runtime/docs/benchmark_onnx.md">benchmark</a></p>
  285. </div>
  286. <div class="section" id="acknowledge">
  287. <h2>Acknowledge<a class="headerlink" href="#acknowledge" title="Permalink to this headline"></a></h2>
  288. <ol class="simple">
  289. <li><p>This project is maintained by <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR">FunASR community</a>.</p></li>
  290. <li><p>We partially refer <a class="reference external" href="https://github.com/RapidAI/RapidASR">SWHL</a> for onnxruntime (only for paraformer model).</p></li>
  291. </ol>
  292. </div>
  293. </div>
  294. </div>
  295. </div>
  296. <footer>
  297. <hr/>
  298. <div role="contentinfo">
  299. <p>&#169; Copyright 2022, Speech Lab, Alibaba Group.</p>
  300. </div>
  301. Built with <a href="https://www.sphinx-doc.org/">Sphinx</a> using a
  302. <a href="https://github.com/readthedocs/sphinx_rtd_theme">theme</a>
  303. provided by <a href="https://readthedocs.org">Read the Docs</a>.
  304. </footer>
  305. </div>
  306. </div>
  307. </section>
  308. </div>
  309. <script>
  310. jQuery(function () {
  311. SphinxRtdTheme.Navigation.enable(true);
  312. });
  313. </script>
  314. </body>
  315. </html>