From 8dab6d184a034ca86eafa644ea0d2100aadfe27d Mon Sep 17 00:00:00 2001 From: jmwang66 <wangjiaming.wjm@alibaba-inc.com> Date: 星期二, 09 五月 2023 10:58:33 +0800 Subject: [PATCH] Merge pull request #473 from alibaba-damo-academy/dev_smohan --- funasr/runtime/python/benchmark_onnx.md | 92 ++++++++++++++++++++++++++++++++++++++++++++- 1 files changed, 89 insertions(+), 3 deletions(-) diff --git a/funasr/runtime/python/benchmark_onnx.md b/funasr/runtime/python/benchmark_onnx.md index e0584a5..2863ff8 100644 --- a/funasr/runtime/python/benchmark_onnx.md +++ b/funasr/runtime/python/benchmark_onnx.md @@ -1,6 +1,51 @@ -Benchmark [Paraformer-large](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) based on Aishell1 test set , the total audio duration is 36108.919 seconds. +# CPU Benchmark (ONNX-python) -(Note: The service has been fully warm up.) +## Configuration +### Data set: +Aishell1 [test set](https://www.openslr.org/33/) , the total audio duration is 36108.919 seconds. + +### Tools +#### Install Requirements +Install ModelScope and FunASR +```shell +pip install -U modelscope funasr +# For the users in China, you could install with the command: +#pip install -U funasr -i https://mirror.sjtu.edu.cn/pypi/web/simple +``` + +Install requirements +```shell +git clone https://github.com/alibaba-damo-academy/FunASR.git && cd FunASR +cd funasr/runtime/python/utils +pip install -r requirements.txt +``` + +#### Recipe + + +##### test_rtf +set the model, data path and output_dir +```shell +nohup bash test_rtf.sh &> log.txt & +``` + +##### test_cer +set the model, data path and output_dir +```shell +nohup bash test_cer.sh &> log.txt & +``` + +## [Paraformer-large](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) + +Number of Parameter: 220M + +Storage size: 880MB + +Storage size after int8-quant: 237MB + +CER: 1.95% + +CER after int8-quant: 1.95% ### Intel(R) Xeon(R) Platinum 8369B CPU @ 2.90GHz 16core-32processor with avx512_vnni @@ -32,4 +77,45 @@ | 64 (onnx int8) | 87 | 0.0024 | 414.7 | -### \ No newline at end of file +### Intel(R) Xeon(R) Platinum 8163 CPU @ 2.50GHz 32core-64processor without avx512_vnni + + +| concurrent-tasks | processing time(s) | RTF | Speedup Rate | +|:----------------:|:------------------:|:------:|:------------:| +| 1 (onnx fp32) | 2959 | 0.0820 | 12.2 | +| 1 (onnx int8) | 2814 | 0.0778 | 12.8 | +| 16 (onnx fp32) | 373 | 0.0103 | 96.9 | +| 16 (onnx int8) | 331 | 0.0091 | 109.0 | +| 32 (onnx fp32) | 211 | 0.0058 | 171.4 | +| 32 (onnx int8) | 181 | 0.0050 | 200.0 | +| 64 (onnx fp32) | 153 | 0.0042 | 235.9 | +| 64 (onnx int8) | 103 | 0.0029 | 349.9 | +| 96 (onnx fp32) | 146 | 0.0041 | 247.0 | +| 96 (onnx int8) | 108 | 0.0030 | 334.1 | + +## [Paraformer](https://modelscope.cn/models/damo/speech_paraformer_asr_nat-zh-cn-16k-common-vocab8358-tensorflow1/summary) + +Number of Parameter: 68M + +Storage size: 275MB + +Storage size after int8-quant: 81MB + +CER: 3.73% + +CER after int8-quant: 3.78% + + ### Intel(R) Xeon(R) Platinum 8369B CPU @ 2.90GHz 16core-32processor with avx512_vnni + +| concurrent-tasks | processing time(s) | RTF | Speedup Rate | +|:----------------:|:------------------:|:------:|:------------:| +| 1 (onnx fp32) | 1173 | 0.0325 | 30.8 | +| 1 (onnx int8) | 976 | 0.0270 | 37.0 | +| 16 (onnx fp32) | 91 | 0.0025 | 395.2 | +| 16 (onnx int8) | 78 | 0.0022 | 463.0 | +| 32 (onnx fp32) | 60 | 0.0017 | 598.8 | +| 32 (onnx int8) | 40 | 0.0011 | 892.9 | +| 64 (onnx fp32) | 55 | 0.0015 | 653.6 | +| 64 (onnx int8) | 31 | 0.0009 | 1162.8 | +| 96 (onnx fp32) | 57 | 0.0016 | 632.9 | +| 96 (onnx int8) | 33 | 0.0009 | 1098.9 | -- Gitblit v1.9.1