From 3cd3473bf7a3b41484baa86d9092248d78e7af39 Mon Sep 17 00:00:00 2001
From: 游雁 <zhifu.gzf@alibaba-inc.com>
Date: 星期五, 21 四月 2023 17:17:37 +0800
Subject: [PATCH] docs

---
 docs/papers.md |    5 +++--
 1 files changed, 3 insertions(+), 2 deletions(-)

diff --git a/docs/papers.md b/docs/papers.md
index 2fcfff1..33bf72f 100644
--- a/docs/papers.md
+++ b/docs/papers.md
@@ -2,7 +2,7 @@
 
 FunASR have implemented the following paper code
 
-### Speech Recognition Models
+### Speech Recognition
 - [Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition](https://arxiv.org/abs/2206.08317), INTERSPEECH 2022.
 - [Universal ASR: Unifying Streaming and Non-Streaming ASR Using a Single Encoder-Decoder Model](https://arxiv.org/abs/2010.14099), arXiv preprint arXiv:2010.14099, 2020.
 - [San-m: Memory equipped self-attention for end-to-end speech recognition](https://arxiv.org/pdf/2006.01713), INTERSPEECH 2020
@@ -11,7 +11,7 @@
 - [Sequence-to-sequence learning with Transducers](https://arxiv.org/pdf/1211.3711.pdf), NIPS 2016
 
 
-### Multi-talker Speech Recognition Models
+### Multi-talker Speech Recognition
 - [MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario](https://arxiv.org/abs/2210.05265), ICASSP 2022
 
 ### Voice Activity Detection
@@ -28,6 +28,7 @@
 
 ### Speaker diarization
 - [Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis](https://arxiv.org/abs/2211.10243), EMNLP 2022
+- [TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization](https://arxiv.org/abs/2303.05397), ICASSP 2023
 
 ### Timestamp Prediction
 - [Achieving Timestamp Prediction While Recognizing with Non-Autoregressive End-to-End ASR Model](https://arxiv.org/abs/2301.12343), arXiv:2301.12343

--
Gitblit v1.9.1