From aaf9d88a79f182852b38c61babcd12132235420a Mon Sep 17 00:00:00 2001 From: speech_asr <wangjiaming.wjm@alibaba-inc.com> Date: 星期二, 14 二月 2023 19:32:48 +0800 Subject: [PATCH] update docs --- docs/get_started.md | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/docs/get_started.md b/docs/get_started.md index c62a610..b8b9996 100644 --- a/docs/get_started.md +++ b/docs/get_started.md @@ -56,7 +56,7 @@ These two files have two columns. The first column is wav ids and the second column is the corresponding speech feature shape and text length. ## Stage 2: Dictionary Preparation -This stage prepares a dictionary, which is used as a mapping between label characters and integer indices during ASR training. The output dictionary file is saved as `$feats_dir/data/$lang_toekn_list/$token_type/tokens.txt`. Here we show an example of `tokens.txt` as follows: +This stage processes the dictionary, which is used as a mapping between label characters and integer indices during ASR training. The processed dictionary file is saved as `$feats_dir/data/$lang_toekn_list/$token_type/tokens.txt`. An example of `tokens.txt` is as follows: * `tokens.txt` ``` <blank> -- Gitblit v1.9.1