From c2e4e3c2e9be855277d9f4fa9cd0544892ff829a Mon Sep 17 00:00:00 2001 From: 游雁 <zhifu.gzf@alibaba-inc.com> Date: 星期三, 30 八月 2023 09:57:30 +0800 Subject: [PATCH] Merge branch 'main' of github.com:alibaba-damo-academy/FunASR add --- funasr/runtime/python/grpc/Readme.md | 65 ++++++++------------------------ 1 files changed, 17 insertions(+), 48 deletions(-) diff --git a/funasr/runtime/python/grpc/Readme.md b/funasr/runtime/python/grpc/Readme.md index dc38b51..13723f2 100644 --- a/funasr/runtime/python/grpc/Readme.md +++ b/funasr/runtime/python/grpc/Readme.md @@ -1,58 +1,27 @@ -# Using paraformer with grpc -We can send streaming audio data to server in real-time with grpc client every 10 ms e.g., and get transcribed text when stop speaking. -The audio data is in streaming, the asr inference process is in offline. +# GRPC python Client for 2pass decoding +The client can send streaming or full audio data to server as you wish, and get transcribed text once the server respond (depends on mode) +In the demo client, audio_chunk_duration is set to 1000ms, and send_interval is set to 100ms -## Steps - -Step 1) Optional, prepare server environment (on server). Install modelscope and funasr with pip or with cuda-docker image. - - Option 1: Install modelscope and funasr with [pip](https://github.com/alibaba-damo-academy/FunASR#installation) - - Option 2: or install with cuda-docker image as: - -``` -CID=`docker run --network host -d -it --gpus '"device=0"' registry.cn-hangzhou.aliyuncs.com/modelscope-repo/modelscope:ubuntu20.04-cuda11.3.0-py37-torch1.11.0-tf1.15.5-1.2.0` -echo $CID -docker exec -it $CID /bin/bash -``` - Get funasr source code and get into grpc directory. -``` -git clone https://github.com/alibaba-damo-academy/FunASR -cd FunASR/funasr/runtime/python/grpc/ +### 1. Install the requirements +```shell +git clone https://github.com/alibaba/FunASR.git && cd FunASR/funasr/runtime/python/grpc +pip install -r requirements.txt ``` - -Step 2) Optional, generate protobuf file (run on server, the two generated pb file are both used for server and client). -``` -# paraformer_pb2.py and paraformer_pb2_grpc.py are already generated. -python -m grpc_tools.protoc --proto_path=./proto -I ./proto --python_out=. --grpc_python_out=./ ./proto/paraformer.proto +### 2. Generate protobuf file +```shell +# paraformer_pb2.py and paraformer_pb2_grpc.py are already generated, +# regenerate it only when you make changes to ./proto/paraformer.proto file. +python -m grpc_tools.protoc --proto_path=./proto -I ./proto --python_out=. --grpc_python_out=./ ./proto/paraformer.proto ``` -Step 3) Start grpc server (on server). -``` -python grpc_main_server.py --port 10095 -``` - -Step 4) Start grpc client (on client with microphone). - -``` -# Optional, Install dependency. -python -m pip install pyaudio webrtcvad -``` +### 3. Start grpc client ``` # Start client. -python grpc_main_client_mic.py --host 127.0.0.1 --port 10095 +python grpc_main_client.py --host 127.0.0.1 --port 10100 --wav_path /path/to/your_test_wav.wav ``` - -## Workflow in desgin - - - -## Reference -We borrow from or refer to some code as: - -1)https://github.com/wenet-e2e/wenet/tree/main/runtime/core/grpc - -2)https://github.com/Open-Speech-EkStep/inference_service/blob/main/realtime_inference_service.py \ No newline at end of file +## Acknowledge +1. This project is maintained by [FunASR community](https://github.com/alibaba-damo-academy/FunASR). +2. We acknowledge burkliu (鍒樻煆鍩�, liubaiji@xverse.cn) for contributing the grpc service. -- Gitblit v1.9.1