python/FunASR-XL.git

			@@ -5,15 +5,21 @@

			```shell
			# pip3 install torch torchaudio
			pip install -U modelscope funasr
			pip3 install -U modelscope funasr
			# For the users in China, you could install with the command:
			# pip install -U modelscope funasr -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html -i https://mirror.sjtu.edu.cn/pypi/web/simple
			# pip3 install -U modelscope funasr -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html -i https://mirror.sjtu.edu.cn/pypi/web/simple
			```

			### Export [onnx model](https://github.com/alibaba-damo-academy/FunASR/tree/main/funasr/export)

			```shell
			python -m funasr.export.export_model --model-name damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch --export-dir ./export --type onnx --quantize True
			python -m funasr.export.export_model \
			--export-dir ./export \
			--type onnx \
			--quantize True \
			--model-name damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch \
			--model-name damo/speech_fsmn_vad_zh-cn-16k-common-pytorch \
			--model-name damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch
			```

			## Building for Linux/Unix
			@@ -36,10 +42,11 @@
			required openssl lib

			```shell
			#install openssl lib first
			apt-get install libssl-dev
			apt-get install libssl-dev #ubuntu
			# yum install openssl-devel #centos

			git clone https://github.com/alibaba-damo-academy/FunASR.git && cd funasr/runtime/websocket

			git clone https://github.com/alibaba-damo-academy/FunASR.git && cd FunASR/funasr/runtime/websocket
			mkdir build && cd build
			cmake -DCMAKE_BUILD_TYPE=release .. -DONNXRUNTIME_DIR=/path/to/onnxruntime-linux-x64-1.14.0
			make
			@@ -48,56 +55,89 @@

			```shell
			cd bin
			./websocketmain [--model_thread_num <int>] [--decoder_thread_num <int>]
			[--io_thread_num <int>] [--port <int>] [--listen_ip
			./funasr-wss-server [--download-model-dir <string>]
			[--model-thread-num <int>] [--decoder-thread-num <int>]
			[--io-thread-num <int>] [--port <int>] [--listen_ip
			<string>] [--punc-quant <string>] [--punc-dir <string>]
			[--vad-quant <string>] [--vad-dir <string>] [--quantize
			<string>] --model-dir <string> [--keyfile <string>]
			[--certfile <string>] [--] [--version] [-h]
			Where:
			--download-model-dir <string>
			Download model from Modelscope to download_model_dir

			--model-dir <string>
			(required) the asr model path, which contains model.onnx, config.yaml, am.mvn
			default: /workspace/models/asr, the asr model path, which contains model_quant.onnx, config.yaml, am.mvn
			--quantize <string>
			false (Default), load the model of model.onnx in model_dir. If set true, load the model of model_quant.onnx in model_dir
			true (Default), load the model of model_quant.onnx in model_dir. If set false, load the model of model.onnx in model_dir

			--vad-dir <string>
			the vad model path, which contains model.onnx, vad.yaml, vad.mvn
			default: /workspace/models/vad, the vad model path, which contains model_quant.onnx, vad.yaml, vad.mvn
			--vad-quant <string>
			false (Default), load the model of model.onnx in vad_dir. If set true, load the model of model_quant.onnx in vad_dir
			true (Default), load the model of model_quant.onnx in vad_dir. If set false, load the model of model.onnx in vad_dir

			--punc-dir <string>
			the punc model path, which contains model.onnx, punc.yaml
			default: /workspace/models/punc, the punc model path, which contains model_quant.onnx, punc.yaml
			--punc-quant <string>
			false (Default), load the model of model.onnx in punc_dir. If set true, load the model of model_quant.onnx in punc_dir
			true (Default), load the model of model_quant.onnx in punc_dir. If set false, load the model of model.onnx in punc_dir

			--decoder_thread_num <int>
			--decoder-thread-num <int>
			number of threads for decoder, default:8
			--io_thread_num <int>
			--io-thread-num <int>
			number of threads for network io, default:8
			--port <int>
			listen port, default:8889
			listen port, default:10095
			--certfile <string>
			path of certficate for WSS connection. if it is empty, it will be in WS mode.
			default: ../../../ssl_key/server.crt, path of certficate for WSS connection. if it is empty, it will be in WS mode.
			--keyfile <string>
			path of keyfile for WSS connection
			default: ../../../ssl_key/server.key, path of keyfile for WSS connection

			Required: --model-dir <string>
			If use vad, please add: --vad-dir <string>
			If use punc, please add: --punc-dir <string>
			example:
			websocketmain --model-dir /FunASR/funasr/runtime/onnxruntime/export/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
			# you can use models downloaded from modelscope or local models:
			# download models from modelscope
			./funasr-wss-server \
			--download-model-dir /workspace/models \
			--model-dir damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-onnx \
			--vad-dir damo/speech_fsmn_vad_zh-cn-16k-common-onnx \
			--punc-dir damo/punc_ct-transformer_zh-cn-common-vocab272727-onnx

			# load models from local paths
			./funasr-wss-server \
			--model-dir /workspace/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-onnx \
			--vad-dir /workspace/models/damo/speech_fsmn_vad_zh-cn-16k-common-onnx \
			--punc-dir /workspace/models/damo/punc_ct-transformer_zh-cn-common-vocab272727-onnx

			```

			## Run websocket client test

			```shell
			Usage: ./websocketclient server_ip port wav_path threads_num is_ssl
			./funasr-wss-client --server-ip <string>
			--port <string>
			--wav-path <string>
			[--thread-num <int>]
			[--is-ssl <int>] [--]
			[--version] [-h]

			is_ssl is 1 means use wss connection, or use ws connection
			Where:
			--server-ip <string>
			(required) server-ip

			--port <string>
			(required) port

			--wav-path <string>
			(required) the input could be: wav_path, e.g.: asr_example.wav;
			pcm_path, e.g.: asr_example.pcm; wav.scp, kaldi style wav list (wav_id \t wav_path)

			--thread-num <int>
			thread-num

			--is-ssl <int>
			is-ssl is 1 means use wss connection, or use ws connection

			example:

			websocketclient 127.0.0.1 8889 funasr/runtime/websocket/test.pcm.wav 64 0
			./funasr-wss-client --server-ip 127.0.0.1 --port 10095 --wav-path test.wav --thread-num 1 --is-ssl 1

			result json, example like:
			{"mode":"offline","text":"欢迎大家来体验达摩院推出的语音识别模型","wav_name":"wav2"}