From 2868fe3df4e92a6ae3e327faf6e57ea492e04124 Mon Sep 17 00:00:00 2001 From: 志浩 <neo.dzh@alibaba-inc.com> Date: 星期四, 16 三月 2023 19:24:21 +0800 Subject: [PATCH] Merge branch 'main' into dev_dzh --- funasr/runtime/python/libtorch/README.md | 67 +++++++++++++++++++++++++++++++++ 1 files changed, 67 insertions(+), 0 deletions(-) diff --git a/funasr/runtime/python/libtorch/README.md b/funasr/runtime/python/libtorch/README.md new file mode 100644 index 0000000..1e2d919 --- /dev/null +++ b/funasr/runtime/python/libtorch/README.md @@ -0,0 +1,67 @@ +## Using paraformer with libtorch + + +### Introduction +- Model comes from [speech_paraformer](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary). + +### Steps: +1. Export the model. + - Command: (`Tips`: torch >= 1.11.0 is required.) + + ```shell + python -m funasr.export.export_model [model_name] [export_dir] false + ``` + `model_name`: the model is to export. + + `export_dir`: the dir where the onnx is export. + + More details ref to ([export docs](https://github.com/alibaba-damo-academy/FunASR/tree/main/funasr/export)) + + - `e.g.`, Export model from modelscope + ```shell + python -m funasr.export.export_model 'damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch' "./export" false + ``` + - `e.g.`, Export model from local path, the model'name must be `model.pb`. + ```shell + python -m funasr.export.export_model '/mnt/workspace/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch' "./export" false + ``` + + +2. Install the `torch_paraformer`. + ```shell + git clone https://github.com/alibaba/FunASR.git && cd FunASR + cd funasr/runtime/python/libtorch + python setup.py install + ``` + + +3. Run the demo. + - Model_dir: the model path, which contains `model.torchscripts`, `config.yaml`, `am.mvn`. + - Input: wav formt file, support formats: `str, np.ndarray, List[str]` + - Output: `List[str]`: recognition result. + - Example: + ```python + from torch_paraformer import Paraformer + + model_dir = "/nfs/zhifu.gzf/export/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch" + model = Paraformer(model_dir, batch_size=1) + + wav_path = ['/nfs/zhifu.gzf/export/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/example/asr_example.wav'] + + result = model(wav_path) + print(result) + ``` + +## Speed + +Environment锛欼ntel(R) Xeon(R) Platinum 8163 CPU @ 2.50GHz + +Test [wav, 5.53s, 100 times avg.](https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/asr_example_zh.wav) + +| Backend | RTF (FP32) | +|:--------:|:----------:| +| Pytorch | 0.110 | +| Libtorch | 0.048 | +| Onnx | 0.038 | + +## Acknowledge -- Gitblit v1.9.1