From add315bdb35e09fe705d4eab39e4d2386734f4ae Mon Sep 17 00:00:00 2001 From: 游雁 <zhifu.gzf@alibaba-inc.com> Date: 星期五, 17 三月 2023 15:50:51 +0800 Subject: [PATCH] Merge branch 'main' of github.com:alibaba-damo-academy/FunASR add --- funasr/runtime/python/benchmark_onnx.md | 70 ++++++++++++++++++++++++++-------- 1 files changed, 53 insertions(+), 17 deletions(-) diff --git a/funasr/runtime/python/benchmark_onnx.md b/funasr/runtime/python/benchmark_onnx.md index b45f9ed..d2c114e 100644 --- a/funasr/runtime/python/benchmark_onnx.md +++ b/funasr/runtime/python/benchmark_onnx.md @@ -1,23 +1,43 @@ -Benchmark [Paraformer-large](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) based on Aishell1 test set , the total audio duration is 36108.919 seconds. +# Benchmark -(Note: The service has been fully warm up.) +### Data set: +Aishell1 test set , the total audio duration is 36108.919 seconds. (Note: The service has been fully warm up.) + +### Tools +- Install +```shell +git clone https://github.com/alibaba-damo-academy/FunASR.git && cd FunASR +pip install --editable ./ +cd funasr/runtime/python/utils +pip install -r requirements.txt +``` + +- recipe +set the model, data path and output_dir + +```shell +nohup bash test_rtf.sh &> log.txt & +``` + + +## [Paraformer-large](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) ### Intel(R) Xeon(R) Platinum 8369B CPU @ 2.90GHz 16core-32processor with avx512_vnni -| concurrent-tasks | processing time(s) | RTF | Speedup Rate | -|:----------------:|:------------------:|:------:|:------------:| -| 1 (onnx fp32) | 2806 | 0.0777 | 12.9 | -| 1 (onnx int8) | 1611 | 0.0446 | 22.4 | -| 8 (onnx fp32) | 538 | 0.0149 | 67.1 | -| 8 (onnx int8) | 210 | 0.0058 | 172.4 | -| 16 (onnx fp32) | 288 | 0.0080 | 125.2 | -| 16 (onnx int8) | 117 | 0.0032 | 309.9 | -| 32 (onnx fp32) | 167 | 0.0046 | 216.5 | -| 32 (onnx int8) | 107 | 0.0030 | 338.0 | -| 64 (onnx fp32) | 158 | 0.0044 | 228.1 | -| 64 (onnx int8) | 82 | 0.0023 | 442.8 | -| 96 (onnx fp32) | 151 | 0.0042 | 238.0 | -| 96 (onnx int8) | 80 | 0.0022 | 452.0 | +| concurrent-tasks | processing time(s) | RTF | Speedup Rate | +|:----------------:|:------------------:|:-------:|:------------:| +| 1 (onnx fp32) | 2806 | 0.0777 | 12.9 | +| 1 (onnx int8) | 1611 | 0.0446 | 22.4 | +| 8 (onnx fp32) | 538 | 0.0149 | 67.1 | +| 8 (onnx int8) | 210 | 0.0058 | 172.4 | +| 16 (onnx fp32) | 288 | 0.0080 | 125.2 | +| 16 (onnx int8) | 117 | 0.0032 | 309.9 | +| 32 (onnx fp32) | 167 | 0.0046 | 216.5 | +| 32 (onnx int8) | 86 | 0.0024 | 420.0 | +| 64 (onnx fp32) | 158 | 0.0044 | 228.1 | +| 64 (onnx int8) | 82 | 0.0023 | 442.8 | +| 96 (onnx fp32) | 151 | 0.0042 | 238.0 | +| 96 (onnx int8) | 80 | 0.0022 | 452.0 | ### Intel(R) Xeon(R) Platinum 8269CY CPU @ 2.50GHz 16core-32processor with avx512_vnni @@ -32,4 +52,20 @@ | 64 (onnx int8) | 87 | 0.0024 | 414.7 | -### \ No newline at end of file +### Intel(R) Xeon(R) Platinum 8163 CPU @ 2.50GHz 32core-64processor without avx512_vnni + + +| concurrent-tasks | processing time(s) | RTF | Speedup Rate | +|:----------------:|:------------------:|:------:|:------------:| +| 1 (onnx fp32) | 2959 | 0.0820 | 12.2 | +| 1 (onnx int8) | 2814 | 0.0778 | 12.8 | +| 16 (onnx fp32) | 373 | 0.0103 | 96.9 | +| 16 (onnx int8) | 331 | 0.0091 | 109.0 | +| 32 (onnx fp32) | 211 | 0.0058 | 171.4 | +| 32 (onnx int8) | 181 | 0.0050 | 200.0 | +| 64 (onnx fp32) | 153 | 0.0042 | 235.9 | +| 64 (onnx int8) | 103 | 0.0029 | 349.9 | +| 96 (onnx fp32) | 146 | 0.0041 | 247.0 | +| 96 (onnx int8) | 108 | 0.0030 | 334.1 | + +## [Paraformer](https://modelscope.cn/models/damo/speech_paraformer_asr_nat-zh-cn-16k-common-vocab8358-tensorflow1/summary) -- Gitblit v1.9.1