From d785c5881a95fdcb2397d98838e48c81864ea54e Mon Sep 17 00:00:00 2001 From: 嘉渊 <wangjiaming.wjm@alibaba-inc.com> Date: 星期三, 24 五月 2023 10:16:09 +0800 Subject: [PATCH] update repo --- docs/academic_recipe/asr_recipe.md | 18 +++++++++--------- 1 files changed, 9 insertions(+), 9 deletions(-) diff --git a/docs/academic_recipe/asr_recipe.md b/docs/academic_recipe/asr_recipe.md index ff3ac39..391d60b 100644 --- a/docs/academic_recipe/asr_recipe.md +++ b/docs/academic_recipe/asr_recipe.md @@ -44,18 +44,18 @@ ## Introduction We provide a recipe `egs/aishell/paraformer/run.sh` for training a paraformer model on AISHELL-1 dataset. This recipe consists of five stages, supporting training on multiple GPUs and decoding by CPU or GPU. Before introducing each stage in detail, we first explain several parameters which should be set by users. -- `CUDA_VISIBLE_DEVICES`: visible gpu list -- `gpu_num`: the number of GPUs used for training -- `gpu_inference`: whether to use GPUs for decoding -- `njob`: for CPU decoding, indicating the total number of CPU jobs; for GPU decoding, indicating the number of jobs on each GPU +- `CUDA_VISIBLE_DEVICES`: `0,1` (Default), visible gpu list +- `gpu_num`: `2` (Default), the number of GPUs used for training +- `gpu_inference`: `true` (Default), whether to use GPUs for decoding +- `njob`: `1` (Default), for CPU decoding, indicating the total number of CPU jobs; for GPU decoding, indicating the number of jobs on each GPU - `raw_data`: the raw path of AISHELL-1 dataset - `feats_dir`: the path for saving processed data -- `nj`: the number of jobs for data preparation -- `speed_perturb`: the range of speech perturbed +- `nj`: `64` (Default), the number of jobs for data preparation +- `speed_perturb`: `"0.9, 1.0 ,1.1"` (Default), the range of speech perturbed - `exp_dir`: the path for saving experimental results -- `tag`: the suffix of experimental result directory -- `stage` start the recipe from the specified stage -- `stop_stage` stop the recipe from the specified stage +- `tag`: `exp1` (Default), the suffix of experimental result directory +- `stage` `0` (Default), start the recipe from the specified stage +- `stop_stage` `5` (Default), stop the recipe from the specified stage ### Stage 0: Data preparation This stage processes raw AISHELL-1 dataset `$raw_data` and generates the corresponding `wav.scp` and `text` in `$feats_dir/data/xxx`. `xxx` means `train/dev/test`. Here we assume users have already downloaded AISHELL-1 dataset. If not, users can download data [here](https://www.openslr.org/33/) and set the path for `$raw_data`. The examples of `wav.scp` and `text` are as follows: -- Gitblit v1.9.1