From 5602ebe208639ad8f91899adeddae0a2f1e39f09 Mon Sep 17 00:00:00 2001 From: shixian.shi <shixian.shi@alibaba-inc.com> Date: 星期三, 31 一月 2024 10:07:34 +0800 Subject: [PATCH] code update --- README.md | 4 ++-- 1 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 18d02c3..9f34553 100644 --- a/README.md +++ b/README.md @@ -144,7 +144,7 @@ ``` Note: `chunk_size` is the configuration for streaming latency.` [0,10,5]` indicates that the real-time display granularity is `10*60=600ms`, and the lookahead information is `5*60=300ms`. Each inference input is `600ms` (sample points are `16000*0.6=960`), and the output is the corresponding text. For the last speech segment input, `is_final=True` needs to be set to force the output of the last word. -### Voice Activity Detection (streaming) +### Voice Activity Detection (Non-Streaming) ```python from funasr import AutoModel @@ -153,7 +153,7 @@ res = model.generate(input=wav_file) print(res) ``` -### Voice Activity Detection (Non-streaming) +### Voice Activity Detection (Streaming) ```python from funasr import AutoModel -- Gitblit v1.9.1