python/FunASR-XL.git

			@@ -43,15 +43,15 @@



			#### API-reference
			##### Define pipeline
			### API-reference
			#### Define pipeline
			- `task`: `Tasks.voice_activity_detection`
			- `model`: model name in [model zoo](https://alibaba-damo-academy.github.io/FunASR/en/modelscope_models.html#pretrained-models-on-modelscope), or model path in local disk
			- `ngpu`: `1` (Default), decoding on GPU. If ngpu=0, decoding on CPU
			- `ncpu`: `1` (Default), sets the number of threads used for intraop parallelism on CPU
			- `output_dir`: `None` (Default), the output path of results if set
			- `batch_size`: `1` (Default), batch size when decoding
			##### Infer pipeline
			#### Infer pipeline
			- `audio_in`: the input to decode, which could be:
			- wav_path, `e.g.`: asr_example.wav,
			- pcm_path, `e.g.`: asr_example.pcm,
			@@ -86,7 +86,7 @@
			--model "damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch" \
			--data_dir "./data/test" \
			--output_dir "./results" \
			--batch_size 64 \
			--batch_size 1 \
			--gpu_inference true \
			--gpuid_list "0,1"
			```
			@@ -97,7 +97,7 @@
			--data_dir "./data/test" \
			--output_dir "./results" \
			--gpu_inference false \
			--njob 64
			--njob 1
			```

			## Finetune with pipeline