From 5853ebc98f51c79d0ae2955cefe1457cba78efe4 Mon Sep 17 00:00:00 2001 From: Yabin Li <wucong.lyb@alibaba-inc.com> Date: 星期四, 27 六月 2024 17:38:19 +0800 Subject: [PATCH] Merge Dev blade (#1856) --- runtime/readme.md | 16 +++++++++++++++- 1 files changed, 15 insertions(+), 1 deletions(-) diff --git a/runtime/readme.md b/runtime/readme.md index 28a063d..fb795c9 100644 --- a/runtime/readme.md +++ b/runtime/readme.md @@ -7,9 +7,23 @@ - File transcription service, Mandarin, CPU version, done - The real-time transcription service, Mandarin (CPU), done - File transcription service, English, CPU version, done -- File transcription service, Mandarin, GPU version, in progress +- File transcription service, Mandarin, GPU version, done - and more. +## File Transcription Service, Mandarin (GPU) + +Currently, the FunASR runtime-SDK supports the deployment of file transcription service, Mandarin (GPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams. + +To meet the needs of different users, we have prepared different tutorials with text and images for both novice and advanced developers. + +### Whats-new +- 2024/06/27: File Transcription Service 1.0 GPU released, supporting dynamic batch processing and multi-threading concurrency. In the long audio test set, the single-thread RTF is 0.0076, and multi-threads' speedup is 1200+ (compared to 330+ on CPU), ref to([docs](./docs/benchmark_libtorch_cpp.md)) , docker image version funasr-runtime-sdk-gpu-0.1.0 (aa10f938da3b) + +### Advanced Development Guide + +The documentation mainly targets advanced developers who require modifications and customization of the service. It supports downloading model deployments from modelscope and also supports deploying models that users have fine-tuned. For detailed information, please refer to the documentation available by [docs](./docs/SDK_advanced_guide_offline_gpu.md) + + ## File Transcription Service, English (CPU) Currently, the FunASR runtime-SDK supports the deployment of file transcription service, English (CPU version), with a complete speech recognition chain that can transcribe tens of hours of audio into punctuated text, and supports recognition for more than a hundred concurrent streams. -- Gitblit v1.9.1