| | |
| | | ``` |
| | | |
| | | |
| | | ## Install the `funasr_onnx` |
| | | ## Install `funasr_onnx` |
| | | |
| | | install from pip |
| | | ```shell |
| | |
| | | from funasr_onnx import Paraformer |
| | | |
| | | model_dir = "./export/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch" |
| | | model = Paraformer(model_dir, batch_size=1) |
| | | model = Paraformer(model_dir, batch_size=1, quantize=True) |
| | | |
| | | wav_path = ['./export/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/example/asr_example.wav'] |
| | | |
| | | result = model(wav_path) |
| | | print(result) |
| | | ``` |
| | | - Model_dir: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | - Output: `List[str]`: recognition result |
| | | - `model_dir`: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - `batch_size`: `1` (Default), the batch size duration inference |
| | | - `device_id`: `-1` (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu) |
| | | - `quantize`: `False` (Default), load the model of `model.onnx` in `model_dir`. If set `True`, load the model of `model_quant.onnx` in `model_dir` |
| | | - `intra_op_num_threads`: `4` (Default), sets the number of threads used for intraop parallelism on CPU |
| | | |
| | | Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | |
| | | Output: `List[str]`: recognition result |
| | | |
| | | #### Paraformer-online |
| | | |
| | |
| | | result = model(wav_path) |
| | | print(result) |
| | | ``` |
| | | - Model_dir: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | - Output: `List[str]`: recognition result |
| | | - `model_dir`: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - `batch_size`: `1` (Default), the batch size duration inference |
| | | - `device_id`: `-1` (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu) |
| | | - `quantize`: `False` (Default), load the model of `model.onnx` in `model_dir`. If set `True`, load the model of `model_quant.onnx` in `model_dir` |
| | | - `intra_op_num_threads`: `4` (Default), sets the number of threads used for intraop parallelism on CPU |
| | | |
| | | Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | |
| | | Output: `List[str]`: recognition result |
| | | |
| | | |
| | | #### FSMN-VAD-online |
| | | ```python |
| | |
| | | if segments_result: |
| | | print(segments_result) |
| | | ``` |
| | | - Model_dir: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | - Output: `List[str]`: recognition result |
| | | - `model_dir`: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - `batch_size`: `1` (Default), the batch size duration inference |
| | | - `device_id`: `-1` (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu) |
| | | - `quantize`: `False` (Default), load the model of `model.onnx` in `model_dir`. If set `True`, load the model of `model_quant.onnx` in `model_dir` |
| | | - `intra_op_num_threads`: `4` (Default), sets the number of threads used for intraop parallelism on CPU |
| | | |
| | | Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | |
| | | Output: `List[str]`: recognition result |
| | | |
| | | |
| | | ### Punctuation Restoration |
| | | #### CT-Transformer |
| | |
| | | result = model(text_in) |
| | | print(result[0]) |
| | | ``` |
| | | - Model_dir: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | - Output: `List[str]`: recognition result |
| | | - `model_dir`: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - `device_id`: `-1` (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu) |
| | | - `quantize`: `False` (Default), load the model of `model.onnx` in `model_dir`. If set `True`, load the model of `model_quant.onnx` in `model_dir` |
| | | - `intra_op_num_threads`: `4` (Default), sets the number of threads used for intraop parallelism on CPU |
| | | |
| | | Input: `str`, raw text of asr result |
| | | |
| | | Output: `List[str]`: recognition result |
| | | |
| | | |
| | | #### CT-Transformer-online |
| | | ```python |
| | |
| | | |
| | | print(rec_result_all) |
| | | ``` |
| | | - Model_dir: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - Input: wav formt file, support formats: `str, np.ndarray, List[str]` |
| | | - Output: `List[str]`: recognition result |
| | | - `model_dir`: the model path, which contains `model.onnx`, `config.yaml`, `am.mvn` |
| | | - `device_id`: `-1` (Default), infer on CPU. If you want to infer with GPU, set it to gpu_id (Please make sure that you have install the onnxruntime-gpu) |
| | | - `quantize`: `False` (Default), load the model of `model.onnx` in `model_dir`. If set `True`, load the model of `model_quant.onnx` in `model_dir` |
| | | - `intra_op_num_threads`: `4` (Default), sets the number of threads used for intraop parallelism on CPU |
| | | |
| | | Input: `str`, raw text of asr result |
| | | |
| | | Output: `List[str]`: recognition result |
| | | |
| | | ## Performance benchmark |
| | | |