|
|
3 years ago | |
|---|---|---|
| .. | ||
| inverse_text_normalization | 3 years ago | |
| text_normalization | 3 years ago | |
| README.md | 3 years ago | |
| __init__.py | 3 years ago | |
| install_pynini.sh | 3 years ago | |
| version.txt | 3 years ago | |
FunTextProcessing is a Python toolkit for fundamental text processing in ASR including text processing , inverse text processing, which is included in the FunASR.
Fun Text Processing, specifically (Inverse) Text Normalization, requires Pynini to be installed.
bash fun_text_processing/install_pynini.sh
Given text inputs, such as speech recognition results, use fun_text_processing/inverse_text_normalization/inverse_normalize.py to output ITN results. You may refer to the following example scripts.
ITN example for English
python fun_text_processing/inverse_text_normalization/inverse_normalize.py --text="one hundred twenty three" --language=en
ITN example for Indonesian
python fun_text_processing/inverse_text_normalization/inverse_normalize.py --text="ratus dua puluh tiga" --language=id --cache_dir ./model/ --output_file output.txt
cat output.txt
Arguments:
This project is licensed under the Apache-2.0 license. FunTextProcessing also contains various third-party components and some code modified from other repos under other open source licenses.