From d29f201e3201bde6a984e436888a2aae877e449f Mon Sep 17 00:00:00 2001 From: 游雁 <zhifu.gzf@alibaba-inc.com> Date: 星期二, 19 三月 2024 12:04:50 +0800 Subject: [PATCH] vad conf --- examples/industrial_data_pretraining/paraformer/README_zh.md | 53 +++++++++++++++++++++++++++++++++++++++++++++++++++++ 1 files changed, 53 insertions(+), 0 deletions(-) diff --git a/examples/industrial_data_pretraining/paraformer/README_zh.md b/examples/industrial_data_pretraining/paraformer/README_zh.md index 8ddb202..38a4455 100644 --- a/examples/industrial_data_pretraining/paraformer/README_zh.md +++ b/examples/industrial_data_pretraining/paraformer/README_zh.md @@ -40,3 +40,56 @@ ```[audio_sample1, audio_sample2, ..., audio_sampleN]``` - fbank杈撳叆锛屾敮鎸佺粍batch銆俿hape涓篬batch, frames, dim]锛岀被鍨嬩负torch.Tensor锛屼緥濡� - `output_dir`: None 锛堥粯璁わ級锛屽鏋滆缃紝杈撳嚭缁撴灉鐨勮緭鍑鸿矾寰� + + +## 寰皟 + +#### 鍑嗗鏁版嵁 + +`train_text.txt` + +宸﹁竟涓烘暟鎹敮涓�ID锛岄渶涓巂train_wav.scp`涓殑`ID`涓�涓�瀵瑰簲 +鍙宠竟涓洪煶棰戞枃浠舵爣娉ㄦ枃鏈� + +```bash +ID0012W0013 褰撳鎴烽闄╂壙鍙楄兘鍔涜瘎浼颁緷鎹彂鐢熷彉鍖栨椂 +ID0012W0014 鏉ㄦ稕涓嶅緱涓嶅皢宸ュ巶鍏虫帀 +``` + + +`train_wav.scp` + +宸﹁竟涓烘暟鎹敮涓�ID锛岄渶涓巂train_text.txt`涓殑`ID`涓�涓�瀵瑰簲 +鍙宠竟涓洪煶棰戞枃浠剁殑缁濆璺緞 + +```bash +ID0012W0013 /Users/zhifu/funasr_github/test_local/aishell2_dev_ios/wav/D0012/ID0012W0013.wav +ID0012W0014 /Users/zhifu/funasr_github/test_local/aishell2_dev_ios/wav/D0012/ID0012W0014.wav +``` + +#### 璁粌 + +```bash +cd examples/industrial_data_pretraining/paraformer +sh finetune_from_local.sh +``` + +**鏌ョ湅璁粌鏃ュ織** + +```bash +tensorboard --logdir /xxxx/FunASR/examples/industrial_data_pretraining/paraformer/outputs/log/tensorboard +``` + + +## 瀵煎嚭onnx + +```python +from funasr import AutoModel +wav_file = "https://isv-data.oss-cn-hangzhou.aliyuncs.com/ics/MaaS/ASR/test_audio/vad_example.wav" + +model = AutoModel(model="iic/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch", + model_revision="v2.0.4") + +res = model.export(input=wav_file, type="onnx", quantize=False) +print(res) +``` \ No newline at end of file -- Gitblit v1.9.1