python/FunASR-XL.git

python / FunASR-XL

FUNASR训练

概况
操作记录
提交次数
目录
文档
派生
对比

blame | 历史 | 补丁 | 提交 | 提交对比 | ignore whitespace

fix vad max_end_sil bug

hnluo

2023-09-06 602d3b5e2ec26c360648d585140d57b2ffecff5c

 egs/wenetspeech/conformer/local/process_opus.py

@@ -65,7 +65,7 @@

            start = int(start_time_list[i] * sample_rate)
            end = int(end_time_list[i] * sample_rate)
            target_audio = waveform[:, start:end].transpose(0, 1).contiguous()
            target_audio = waveform[:, start:end]
            torchaudio.save(seg_wav_path, target_audio, sample_rate)

            fout.write("{} {}\n".format(utt_id, seg_wav_path))

			@@ -65,7 +65,7 @@

			start = int(start_time_list[i] * sample_rate)
			end = int(end_time_list[i] * sample_rate)
			target_audio = waveform[:, start:end].transpose(0, 1).contiguous()
			target_audio = waveform[:, start:end]
			torchaudio.save(seg_wav_path, target_audio, sample_rate)

			fout.write("{} {}\n".format(utt_id, seg_wav_path))