From 8f26a9acc2461ce0c77eacc3d36d3cef3457f520 Mon Sep 17 00:00:00 2001 From: speech_asr <wangjiaming.wjm@alibaba-inc.com> Date: 星期三, 29 三月 2023 15:49:46 +0800 Subject: [PATCH] Merge branch 'dev_wjm' of https://github.com/alibaba/FunASR into dev_wjm --- funasr/runtime/python/benchmark_onnx.md | 26 ++++++++++++++++++++++---- 1 files changed, 22 insertions(+), 4 deletions(-) diff --git a/funasr/runtime/python/benchmark_onnx.md b/funasr/runtime/python/benchmark_onnx.md index fe938ee..533798a 100644 --- a/funasr/runtime/python/benchmark_onnx.md +++ b/funasr/runtime/python/benchmark_onnx.md @@ -25,6 +25,16 @@ ## [Paraformer-large](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) +Number of Parameter: 220M + +Storage size: 880MB + +Storage size after int8-quant: 237MB + +CER: 1.95% + +CER after int8-quant: 1.95% + ### Intel(R) Xeon(R) Platinum 8369B CPU @ 2.90GHz 16core-32processor with avx512_vnni | concurrent-tasks | processing time(s) | RTF | Speedup Rate | @@ -73,10 +83,22 @@ ## [Paraformer](https://modelscope.cn/models/damo/speech_paraformer_asr_nat-zh-cn-16k-common-vocab8358-tensorflow1/summary) +Number of Parameter: 68M + +Storage size: 275MB + +Storage size after int8-quant: 81MB + +CER: 3.73% + +CER after int8-quant: 3.78% + ### Intel(R) Xeon(R) Platinum 8369B CPU @ 2.90GHz 16core-32processor with avx512_vnni | concurrent-tasks | processing time(s) | RTF | Speedup Rate | |:----------------:|:------------------:|:------:|:------------:| +| 1 (onnx fp32) | 1173 | 0.0325 | 30.8 | +| 1 (onnx int8) | 976 | 0.0270 | 37.0 | | 16 (onnx fp32) | 91 | 0.0025 | 395.2 | | 16 (onnx int8) | 78 | 0.0022 | 463.0 | | 32 (onnx fp32) | 60 | 0.0017 | 598.8 | @@ -85,7 +107,3 @@ | 64 (onnx int8) | 31 | 0.0009 | 1162.8 | | 96 (onnx fp32) | 57 | 0.0016 | 632.9 | | 96 (onnx int8) | 33 | 0.0009 | 1098.9 | - -[//]: # (| 1 (onnx fp32) | 2806 | 0.0777 | 12.9 |) - -[//]: # (| 1 (onnx int8) | 1611 | 0.0446 | 22.4 |) \ No newline at end of file -- Gitblit v1.9.1