From 1d1ef01b4e23630a99a3be7e9d1dce9550a793e9 Mon Sep 17 00:00:00 2001 From: yhliang <68215459+yhliang-aslp@users.noreply.github.com> Date: 星期四, 11 五月 2023 16:26:24 +0800 Subject: [PATCH] Merge branch 'main' into dev_smohan --- egs/alimeeting/sa-asr/README.md | 6 +++++- 1 files changed, 5 insertions(+), 1 deletions(-) diff --git a/egs/alimeeting/sa-asr/README.md b/egs/alimeeting/sa-asr/README.md index 5731c39..951670b 100644 --- a/egs/alimeeting/sa-asr/README.md +++ b/egs/alimeeting/sa-asr/README.md @@ -19,7 +19,7 @@ stage 7 - 9: Language model training (Optional). stage 10 - 11: ASR training (SA-ASR requires loading the pre-trained ASR model). stage 12: SA-ASR training. -stage 13 - 18: Inference and evaluation. +stage 13 - 16: Inference and evaluation. ``` Before running `run_m2met_2023_infer.sh`, you need to place the new test set `Test_2023_Ali_far` (to be released after the challenge starts) in the `./dataset` directory, which contains only raw audios. Then put the given `wav.scp`, `wav_raw.scp`, `segments`, `utt2spk` and `spk2utt` in the `./data/Test_2023_Ali_far` directory. ```shell @@ -37,6 +37,10 @@ stage 3: Inference. stage 4: Generation of SA-ASR results required for final submission. ``` + +The baseline model is available on [ModelScope](https://www.modelscope.cn/models/damo/speech_saasr_asr-zh-cn-16k-alimeeting/summary). +After generate stats of AliMeeting corpus(stage 10 in `run.sh`), you can set the `infer_with_pretrained_model=true` in `run.sh` to infer with our official baseline model released on ModelScope without training. + # Format of Final Submission Finally, you need to submit a file called `text_spk_merge` with the following format: ```shell -- Gitblit v1.9.1