From 5b2b979634b8cdf70aac0334feda99e6f5a779b8 Mon Sep 17 00:00:00 2001 From: Daniel <znsoft@163.com> Date: 星期日, 23 四月 2023 08:56:31 +0800 Subject: [PATCH] Merge branch 'alibaba-damo-academy:main' into main --- egs_modelscope/vad/TEMPLATE/README.md | 12 ++++++------ 1 files changed, 6 insertions(+), 6 deletions(-) diff --git a/egs_modelscope/vad/TEMPLATE/README.md b/egs_modelscope/vad/TEMPLATE/README.md index df45b35..a4b5e79 100644 --- a/egs_modelscope/vad/TEMPLATE/README.md +++ b/egs_modelscope/vad/TEMPLATE/README.md @@ -1,7 +1,7 @@ # Voice Activity Detection > **Note**: -> The modelscope pipeline supports all the models in [model zoo](https://alibaba-damo-academy.github.io/FunASR/en/modelscope_models.html#pretrained-models-on-modelscope) to inference and finetine. Here we take model of FSMN-VAD as example to demonstrate the usage. +> The modelscope pipeline supports all the models in [model zoo](https://alibaba-damo-academy.github.io/FunASR/en/modelscope_models.html#pretrained-models-on-modelscope) to inference and finetine. Here we take the model of FSMN-VAD as example to demonstrate the usage. ## Inference @@ -47,10 +47,10 @@ ##### Define pipeline - `task`: `Tasks.voice_activity_detection` - `model`: model name in [model zoo](https://alibaba-damo-academy.github.io/FunASR/en/modelscope_models.html#pretrained-models-on-modelscope), or model path in local disk -- `ngpu`: `1` (Defalut), decoding on GPU. If ngpu=0, decoding on CPU -- `ncpu`: `1` (Defalut), sets the number of threads used for intraop parallelism on CPU -- `output_dir`: `None` (Defalut), the output path of results if set -- `batch_size`: `1` (Defalut), batch size when decoding +- `ngpu`: `1` (Default), decoding on GPU. If ngpu=0, decoding on CPU +- `ncpu`: `1` (Default), sets the number of threads used for intraop parallelism on CPU +- `output_dir`: `None` (Default), the output path of results if set +- `batch_size`: `1` (Default), batch size when decoding ##### Infer pipeline - `audio_in`: the input to decode, which could be: - wav_path, `e.g.`: asr_example.wav, @@ -64,7 +64,7 @@ ``` In this case of `wav.scp` input, `output_dir` must be set to save the output results - `audio_fs`: audio sampling rate, only set when audio_in is pcm audio -- `output_dir`: None (Defalut), the output path of results if set +- `output_dir`: None (Default), the output path of results if set ### Inference with multi-thread CPUs or multi GPUs FunASR also offer recipes [infer.sh](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs_modelscope/vad/TEMPLATE/infer.sh) to decode with multi-thread CPUs, or multi GPUs. -- Gitblit v1.9.1