From ebc6b46c58ab13cbafb23c0926acc4e351f2d1d4 Mon Sep 17 00:00:00 2001 From: onlybetheone <iriszhangchong@gmail.com> Date: 星期三, 15 二月 2023 20:37:57 +0800 Subject: [PATCH] add infer decoding model param --- README.md | 19 +++++++++++++++---- 1 files changed, 15 insertions(+), 4 deletions(-) diff --git a/README.md b/README.md index 1ac3f6e..8c4485f 100644 --- a/README.md +++ b/README.md @@ -1,10 +1,21 @@ -<div align="left"><img src="docs/images/funasr_logo.jpg" width="400"/></div> +[//]: # (<div align="left"><img src="docs/images/funasr_logo.jpg" width="400"/></div>) # FunASR: A Fundamental End-to-End Speech Recognition Toolkit <strong>FunASR</strong> hopes to build a bridge between academic research and industrial applications on speech recognition. By supporting the training & finetuning of the industrial-grade speech recognition model released on [ModelScope](https://www.modelscope.cn/models?page=1&tasks=auto-speech-recognition), researchers and developers can conduct research and production of speech recognition models more conveniently, and promote the development of speech recognition ecology. ASR for Fun锛� -## Release Notes: +[**News**](https://github.com/alibaba-damo-academy/FunASR#whats-new) +| [**Highlights**](#highlights) +| [**Installation**](#installation) +| [**Docs_CN**](https://alibaba-damo-academy.github.io/FunASR/cn/index.html) +| [**Docs_EN**](https://alibaba-damo-academy.github.io/FunASR/en/index.html) +| [**Tutorial**](https://github.com/alibaba-damo-academy/FunASR/wiki#funasr%E7%94%A8%E6%88%B7%E6%89%8B%E5%86%8C) +| [**Papers**](https://github.com/alibaba-damo-academy/FunASR#citations) +| [**Runtime**](https://github.com/alibaba-damo-academy/FunASR/tree/main/funasr/runtime) +| [**Model Zoo**](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) +| [**Contact**](#contact) + +## What's new: ### 2023.1.16, funasr-0.1.6 - We release a new version model [Paraformer-large-long](https://modelscope.cn/models/damo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary), which integrate the [VAD](https://modelscope.cn/models/damo/speech_fsmn_vad_zh-cn-16k-common-pytorch/summary) model, [ASR](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary), [Punctuation](https://www.modelscope.cn/models/damo/punc_ct-transformer_zh-cn-common-vocab272727-pytorch/summary) model and timestamp together. The model could take in several hours long inputs. @@ -16,7 +27,7 @@ - We improve the pipeline of modelscope to speedup the inference, by integrating the process of build model into build pipeline. - Various new types of audio input types are now supported by modelscope inference pipeline, including wav.scp, wav format, audio bytes, wave samples... -## Key Features +## Highlights - Many types of typical models are supported, e.g., [Tranformer](https://arxiv.org/abs/1706.03762), [Conformer](https://arxiv.org/abs/2005.08100), [Paraformer](https://arxiv.org/abs/2206.08317). - We have released large number of academic and industrial pretrained models on [ModelScope](https://www.modelscope.cn/models?page=1&tasks=auto-speech-recognition) - The pretrained model [Paraformer-large](https://www.modelscope.cn/models/damo/speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch/summary) obtains the best performance on many tasks in [SpeechIO leaderboard](https://github.com/SpeechColab/Leaderboard) @@ -32,7 +43,7 @@ For more details, please ref to [installation](https://github.com/alibaba-damo-academy/FunASR/wiki) ## Usage -For users who are new to FunASR and ModelScope, please refer to [FunASR Docs](https://alibaba-damo-academy.github.io/FunASR/index.html). +For users who are new to FunASR and ModelScope, please refer to FunASR Docs([CN](https://alibaba-damo-academy.github.io/FunASR/cn/index.html) / [EN](https://alibaba-damo-academy.github.io/FunASR/en/index.html)) ## Contact -- Gitblit v1.9.1