python/FunASR-XL.git

			@@ -30,6 +30,16 @@

			`fallback-num`: specify the number of fallback layers to perform automatic mixed precision quantization.

			## Performance Benchmark of Runtime

			### Paraformer on CPU

			[onnx runtime](https://github.com/alibaba-damo-academy/FunASR/blob/main/funasr/runtime/python/benchmark_onnx.md)

			[libtorch runtime](https://github.com/alibaba-damo-academy/FunASR/blob/main/funasr/runtime/python/benchmark_libtorch.md)

			### Paraformer on GPU
			[nv-triton](https://github.com/alibaba-damo-academy/FunASR/tree/main/funasr/runtime/triton_gpu)

			## For example
			### Export onnx format model