| | |
| | | |
| | | ## What's new: |
| | | ### Multi-Channel Multi-Party Meeting Transcription 2.0 (M2MET2.0) Challenge |
| | | We are pleased to announce that the M2MeT2.0 challenge will be held in the near future. The baseline system is conducted on FunASR and is provided as a receipe of AliMeeting corpus. For more details you can see the guidence of M2MET2.0 ([CN](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/index.html)/[EN](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)). |
| | | We are pleased to announce that the M2MeT2.0 challenge has been accepted by the ASRU2023 challenge special session. The registration is now open. The baseline system is conducted on FunASR and is provided as a receipe of AliMeeting corpus. For more details you can see the guidence of M2MET2.0 ([CN](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/index.html)/[EN](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)). |
| | | ### Release notes |
| | | For the release notes, please ref to [news](https://github.com/alibaba-damo-academy/FunASR/releases) |
| | | |
| | |
| | | # Baseline |
| | | ## Overview |
| | | We will release an E2E SA-ASR~\cite{kanda21b_interspeech} baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile. |
| | | We will release an E2E SA-ASR baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile. |
| | | |
| | |  |
| | | |
| | | ## Quick start |
| | | #TODO: fill with the README.md of the baseline |
| | | To run the baseline, first you need to install FunASR and ModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html)) |
| | | There are two startup scripts, `run.sh` for training and evaluating on the old eval and test sets, and `run_m2met_2023_infer.sh` for inference on the new test set of the Multi-Channel Multi-Party Meeting Transcription 2.0 ([M2MET2.0](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)) Challenge. |
| | | Before running `run.sh`, you must manually download and unpack the [AliMeeting](http://www.openslr.org/119/) corpus and place it in the `./dataset` directory: |
| | | ```shell |
| | | dataset |
| | | |ââ Eval_Ali_far |
| | | |ââ Eval_Ali_near |
| | | |ââ Test_Ali_far |
| | | |ââ Test_Ali_near |
| | | |ââ Train_Ali_far |
| | | |ââ Train_Ali_near |
| | | Before running `run_m2met_2023_infer.sh`, you need to place the new test set `Test_2023_Ali_far` (to be released after the challenge starts) in the `./dataset` directory, which contains only raw audios. Then put the given `wav.scp`, `wav_raw.scp`, `segments`, `utt2spk` and `spk2utt` in the `./data/Test_2023_Ali_far` directory. |
| | | ```shell |
| | | data/Test_2023_Ali_far |
| | | |ââ wav.scp |
| | | |ââ wav_raw.scp |
| | | |ââ segments |
| | | |ââ utt2spk |
| | | |ââ spk2utt |
| | | ``` |
| | | For more details you can see [here](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md) |
| | | |
| | | ## Baseline results |
| | | The results of the baseline system are shown in Table 3. The speaker profile adopts the oracle speaker embedding during training. However, due to the lack of oracle speaker label during evaluation, the speaker profile provided by an additional spectral clustering is used. Meanwhile, the results of using the oracle speaker profile on Eval and Test Set are also provided to show the impact of speaker profile accuracy. |
| | |
| | | - $ May~8, 2023: $ Baseline release. |
| | | - $ May~15, 2023: $ Registration deadline, the due date for participants to join the Challenge. |
| | | - $ June~9, 2023: $ Test data release and leaderboard open. |
| | | - $ June~13, 2023: $ Final submission deadline. |
| | | - $ June~13, 2023: $ Final submission deadline and leaderboar close. |
| | | - $ June~19, 2023: $ Evaluation result and ranking release. |
| | | - $ July~3, 2023: $ Deadline for paper submission. |
| | | - $ July~10, 2023: $ Deadline for final paper submission. |
| | | - $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge session |
| | | - $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge Session |
| | | |
| | | ## Guidelines |
| | | |
| | | Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 15, 2023. |
| | | Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023. Participants are also welcome to join the [wechat group](https://alibaba-damo-academy.github.io/FunASR/m2met2/Contact.html) of M2MET2.0 and keep up to date with the latest updates about the challenge. |
| | | |
| | | [M2MET2.0 Registration](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link) |
| | | |
| | |
| | | # Sphinx build info version 1 |
| | | # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. |
| | | config: 9907eab6bf227ca0fc6db297f26919da |
| | | config: a62852d90c3e533904d811bbf85f977d |
| | | tags: 645f666f9bcd5a90fca523b33c5a78b7 |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Baseline — m2met2 documentation</title> |
| | | <title>Baseline — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Track_setting_and_evaluation.html" title="Track & Evaluation" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Baseline</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <h1>Baseline<a class="headerlink" href="#baseline" title="Permalink to this heading">¶</a></h1> |
| | | <section id="overview"> |
| | | <h2>Overview<a class="headerlink" href="#overview" title="Permalink to this heading">¶</a></h2> |
| | | <p>We will release an E2E SA-ASR~\cite{kanda21b_interspeech} baseline conducted on <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR">FunASR</a> at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.</p> |
| | | <p>We will release an E2E SA-ASR baseline conducted on <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR">FunASR</a> at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile.</p> |
| | | <p><img alt="model archietecture" src="_images/sa_asr_arch.png" /></p> |
| | | </section> |
| | | <section id="quick-start"> |
| | | <h2>Quick start<a class="headerlink" href="#quick-start" title="Permalink to this heading">¶</a></h2> |
| | | <p>#TODO: fill with the README.md of the baseline</p> |
| | | <p>To run the baseline, first you need to install FunASR and ModelScope. (<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/en/installation.html">installation</a>)<br /> |
| | | There are two startup scripts, <code class="docutils literal notranslate"><span class="pre">run.sh</span></code> for training and evaluating on the old eval and test sets, and <code class="docutils literal notranslate"><span class="pre">run_m2met_2023_infer.sh</span></code> for inference on the new test set of the Multi-Channel Multi-Party Meeting Transcription 2.0 (<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html">M2MET2.0</a>) Challenge.<br /> |
| | | Before running <code class="docutils literal notranslate"><span class="pre">run.sh</span></code>, you must manually download and unpack the <a class="reference external" href="http://www.openslr.org/119/">AliMeeting</a> corpus and place it in the <code class="docutils literal notranslate"><span class="pre">./dataset</span></code> directory:</p> |
| | | <div class="highlight-shell notranslate"><div class="highlight"><pre><span></span>dataset |
| | | <span class="p">|</span>ââ<span class="w"> </span>Eval_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>Eval_Ali_near |
| | | <span class="p">|</span>ââ<span class="w"> </span>Test_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>Test_Ali_near |
| | | <span class="p">|</span>ââ<span class="w"> </span>Train_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>Train_Ali_near |
| | | Before<span class="w"> </span>running<span class="w"> </span><span class="sb">`</span>run_m2met_2023_infer.sh<span class="sb">`</span>,<span class="w"> </span>you<span class="w"> </span>need<span class="w"> </span>to<span class="w"> </span>place<span class="w"> </span>the<span class="w"> </span>new<span class="w"> </span><span class="nb">test</span><span class="w"> </span><span class="nb">set</span><span class="w"> </span><span class="sb">`</span>Test_2023_Ali_far<span class="sb">`</span><span class="w"> </span><span class="o">(</span>to<span class="w"> </span>be<span class="w"> </span>released<span class="w"> </span>after<span class="w"> </span>the<span class="w"> </span>challenge<span class="w"> </span>starts<span class="o">)</span><span class="w"> </span><span class="k">in</span><span class="w"> </span>the<span class="w"> </span><span class="sb">`</span>./dataset<span class="sb">`</span><span class="w"> </span>directory,<span class="w"> </span>which<span class="w"> </span>contains<span class="w"> </span>only<span class="w"> </span>raw<span class="w"> </span>audios.<span class="w"> </span>Then<span class="w"> </span>put<span class="w"> </span>the<span class="w"> </span>given<span class="w"> </span><span class="sb">`</span>wav.scp<span class="sb">`</span>,<span class="w"> </span><span class="sb">`</span>wav_raw.scp<span class="sb">`</span>,<span class="w"> </span><span class="sb">`</span>segments<span class="sb">`</span>,<span class="w"> </span><span class="sb">`</span>utt2spk<span class="sb">`</span><span class="w"> </span>and<span class="w"> </span><span class="sb">`</span>spk2utt<span class="sb">`</span><span class="w"> </span><span class="k">in</span><span class="w"> </span>the<span class="w"> </span><span class="sb">`</span>./data/Test_2023_Ali_far<span class="sb">`</span><span class="w"> </span>directory.<span class="w"> </span> |
| | | <span class="sb">```</span>shell |
| | | data/Test_2023_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>wav.scp |
| | | <span class="p">|</span>ââ<span class="w"> </span>wav_raw.scp |
| | | <span class="p">|</span>ââ<span class="w"> </span>segments |
| | | <span class="p">|</span>ââ<span class="w"> </span>utt2spk |
| | | <span class="p">|</span>ââ<span class="w"> </span>spk2utt |
| | | </pre></div> |
| | | </div> |
| | | <p>For more details you can see <a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md">here</a></p> |
| | | </section> |
| | | <section id="baseline-results"> |
| | | <h2>Baseline results<a class="headerlink" href="#baseline-results" title="Permalink to this heading">¶</a></h2> |
| | |
| | | <li class="right" > |
| | | <a href="Track_setting_and_evaluation.html" title="Track & Evaluation" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Baseline</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Contact — m2met2 documentation</title> |
| | | <title>Contact — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Organizers.html" title="Organizers" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Contact</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="Organizers.html" title="Organizers" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Contact</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Datasets — m2met2 documentation</title> |
| | | <title>Datasets — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Introduction.html" title="Introduction" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Datasets</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="Introduction.html" title="Introduction" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Datasets</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Introduction — m2met2 documentation</title> |
| | | <title>Introduction — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="index.html" title="ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Introduction</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li><p><span class="math notranslate nohighlight">\( May~8, 2023: \)</span> Baseline release.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( May~15, 2023: \)</span> Registration deadline, the due date for participants to join the Challenge.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( June~9, 2023: \)</span> Test data release and leaderboard open.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( June~13, 2023: \)</span> Final submission deadline.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( June~13, 2023: \)</span> Final submission deadline and leaderboar close.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( June~19, 2023: \)</span> Evaluation result and ranking release.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( July~3, 2023: \)</span> Deadline for paper submission.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( July~10, 2023: \)</span> Deadline for final paper submission.</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( December~12\ to\ 16, 2023: \)</span> ASRU Workshop and challenge session</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( December~12\ to\ 16, 2023: \)</span> ASRU Workshop and challenge Session</p></li> |
| | | </ul> |
| | | </section> |
| | | <section id="guidelines"> |
| | | <h2>Guidelines<a class="headerlink" href="#guidelines" title="Permalink to this heading">¶</a></h2> |
| | | <p>Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 15, 2023.</p> |
| | | <p>Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023. Participants are also welcome to join the <a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/m2met2/Contact.html">wechat group</a> of M2MET2.0 and keep up to date with the latest updates about the challenge.</p> |
| | | <p><a class="reference external" href="https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link">M2MET2.0 Registration</a></p> |
| | | <p>Within three working days, the challenge organizer will send email invitations to eligible teams to participate in the challenge. All qualified teams are required to adhere to the challenge rules, which will be published on the challenge page. Prior to the ranking release time, each participant must submit a system description document detailing their approach and methods. The organizer will select the top ranking submissions to be included in the ASRU2023 Proceedings.</p> |
| | | </section> |
| | |
| | | <li class="right" > |
| | | <a href="index.html" title="ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Introduction</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Organizers — m2met2 documentation</title> |
| | | <title>Organizers — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Rules.html" title="Rules" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Organizers</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="Rules.html" title="Rules" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Organizers</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Rules — m2met2 documentation</title> |
| | | <title>Rules — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Baseline.html" title="Baseline" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Rules</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="Baseline.html" title="Baseline" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Rules</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Track & Evaluation — m2met2 documentation</title> |
| | | <title>Track & Evaluation — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Dataset.html" title="Datasets" |
| | | accesskey="P">previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Track & Evaluation</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="Dataset.html" title="Datasets" |
| | | >previous</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Track & Evaluation</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | # Baseline |
| | | ## Overview |
| | | We will release an E2E SA-ASR~\cite{kanda21b_interspeech} baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile. |
| | | We will release an E2E SA-ASR baseline conducted on [FunASR](https://github.com/alibaba-damo-academy/FunASR) at the time according to the timeline. The model architecture is shown in Figure 3. The SpeakerEncoder is initialized with a pre-trained speaker verification model from ModelScope. This speaker verification model is also be used to extract the speaker embedding in the speaker profile. |
| | | |
| | |  |
| | | |
| | | ## Quick start |
| | | #TODO: fill with the README.md of the baseline |
| | | To run the baseline, first you need to install FunASR and ModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html)) |
| | | There are two startup scripts, `run.sh` for training and evaluating on the old eval and test sets, and `run_m2met_2023_infer.sh` for inference on the new test set of the Multi-Channel Multi-Party Meeting Transcription 2.0 ([M2MET2.0](https://alibaba-damo-academy.github.io/FunASR/m2met2/index.html)) Challenge. |
| | | Before running `run.sh`, you must manually download and unpack the [AliMeeting](http://www.openslr.org/119/) corpus and place it in the `./dataset` directory: |
| | | ```shell |
| | | dataset |
| | | |ââ Eval_Ali_far |
| | | |ââ Eval_Ali_near |
| | | |ââ Test_Ali_far |
| | | |ââ Test_Ali_near |
| | | |ââ Train_Ali_far |
| | | |ââ Train_Ali_near |
| | | Before running `run_m2met_2023_infer.sh`, you need to place the new test set `Test_2023_Ali_far` (to be released after the challenge starts) in the `./dataset` directory, which contains only raw audios. Then put the given `wav.scp`, `wav_raw.scp`, `segments`, `utt2spk` and `spk2utt` in the `./data/Test_2023_Ali_far` directory. |
| | | ```shell |
| | | data/Test_2023_Ali_far |
| | | |ââ wav.scp |
| | | |ââ wav_raw.scp |
| | | |ââ segments |
| | | |ââ utt2spk |
| | | |ââ spk2utt |
| | | ``` |
| | | For more details you can see [here](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md) |
| | | |
| | | ## Baseline results |
| | | The results of the baseline system are shown in Table 3. The speaker profile adopts the oracle speaker embedding during training. However, due to the lack of oracle speaker label during evaluation, the speaker profile provided by an additional spectral clustering is used. Meanwhile, the results of using the oracle speaker profile on Eval and Test Set are also provided to show the impact of speaker profile accuracy. |
| | |
| | | - $ May~8, 2023: $ Baseline release. |
| | | - $ May~15, 2023: $ Registration deadline, the due date for participants to join the Challenge. |
| | | - $ June~9, 2023: $ Test data release and leaderboard open. |
| | | - $ June~13, 2023: $ Final submission deadline. |
| | | - $ June~13, 2023: $ Final submission deadline and leaderboar close. |
| | | - $ June~19, 2023: $ Evaluation result and ranking release. |
| | | - $ July~3, 2023: $ Deadline for paper submission. |
| | | - $ July~10, 2023: $ Deadline for final paper submission. |
| | | - $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge session |
| | | - $ December~12\ to\ 16, 2023: $ ASRU Workshop and challenge Session |
| | | |
| | | ## Guidelines |
| | | |
| | | Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 15, 2023. |
| | | Interested participants, whether from academia or industry, must register for the challenge by completing the Google form below. The deadline for registration is May 22, 2023. Participants are also welcome to join the [wechat group](https://alibaba-damo-academy.github.io/FunASR/m2met2/Contact.html) of M2MET2.0 and keep up to date with the latest updates about the challenge. |
| | | |
| | | [M2MET2.0 Registration](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link) |
| | | |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Index — m2met2 documentation</title> |
| | | <title>Index — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="#" title="General Index" |
| | | accesskey="I">index</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Index</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="#" title="General Index" |
| | | >index</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Index</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0) — m2met2 documentation</title> |
| | | <title>ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0) — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="Introduction.html" title="Introduction" |
| | | accesskey="N">next</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="#">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="#">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | #" class="text-logo">m2met2 documentation</a> |
| | | #" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="Introduction.html" title="Introduction" |
| | | >next</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="#">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="#">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | # Sphinx inventory version 2 |
| | | # Project: m2met2 |
| | | # Project: MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 |
| | | # Version: |
| | | # The remainder of this file is compressed using zlib. |
| | | xÃÂ
ÂAOÂ0ÃïýïdôÃ2ñæÂ!ÂM #Ã<-Â6@ìÂÂbÂÂö£·ö÷~ÿ÷^ÃÂ
B¶Jàù#ï*îaãêÃG Â;Â¥Y¥çÂCÃxbÃdÂCÃpÂa¥ûÂÂÂn;5·çÃ7Ãõ5SÃÂè¹y¡Ã+F)Ã|Âö¬z;ÂáºUõÂ)~LÂìz¿<Ãk2à ÂÂj¡ZÃ
Ã
çÃH²W!ÃÂ<µ1|~¢ÂææÃ¨,ö®ÃÃö)Ã
Aü¤îÂG}Â,I(&[ EDøÃ9Ã
;ÃLÃÂlWk¸ÃÃLPóºCÃÂÿ²Ãû)ð;937ÂÃÂ8«þÃOº1Ã[¤ÃúªY-³ÂÃÃsVô
é,à |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>Search — m2met2 documentation</title> |
| | | <title>Search — MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="genindex.html" title="General Index" |
| | | accesskey="I">index</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Search</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 documentation</a> |
| | | index.html" class="text-logo">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-toc"> |
| | | |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="genindex.html" title="General Index" |
| | | >index</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 documentation</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">Search</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | Search.setIndex({"docnames": ["Baseline", "Contact", "Dataset", "Introduction", "Organizers", "Rules", "Track_setting_and_evaluation", "index"], "filenames": ["Baseline.md", "Contact.md", "Dataset.md", "Introduction.md", "Organizers.md", "Rules.md", "Track_setting_and_evaluation.md", "index.rst"], "titles": ["Baseline", "Contact", "Datasets", "Introduction", "Organizers", "Rules", "Track & Evaluation", "ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)"], "terms": {"we": [0, 2, 3, 7], "releas": [0, 2, 3, 6], "an": [0, 2, 3, 6], "e2": 0, "sa": 0, "asr": [0, 3, 7], "cite": 0, "kanda21b_interspeech": 0, "conduct": [0, 2], "funasr": 0, "time": [0, 6], "accord": [0, 3], "timelin": [0, 2], "The": [0, 2, 3, 5, 6], "model": [0, 2, 3, 5, 6], "architectur": 0, "i": [0, 2, 3, 5], "shown": [0, 2], "figur": [0, 6], "3": [0, 2, 3], "speakerencod": 0, "initi": 0, "pre": [0, 6], "train": [0, 3, 5, 7], "speaker": [0, 2, 3, 7], "verif": 0, "from": [0, 2, 3, 5, 6], "modelscop": [0, 6], "thi": [0, 3, 5, 6], "also": [0, 2, 6], "us": [0, 2, 5, 6], "extract": 0, "embed": 0, "profil": 0, "todo": 0, "fill": 0, "readm": 0, "md": 0, "system": [0, 3, 5, 6, 7], "ar": [0, 2, 3, 5, 6, 7], "tabl": [0, 2], "adopt": 0, "oracl": [0, 6], "dure": [0, 2, 6], "howev": [0, 3, 6], "due": [0, 3], "lack": 0, "label": [0, 5, 6], "evalu": [0, 2, 3, 7], "provid": [0, 2, 6, 7], "addit": [0, 6], "spectral": 0, "cluster": 0, "meanwhil": 0, "eval": [0, 2, 5, 6], "test": [0, 2, 3, 5, 6], "set": [0, 2, 3, 5, 6], "show": 0, "impact": 0, "accuraci": [0, 6], "If": [1, 5, 6], "you": 1, "have": [1, 3], "ani": [1, 5, 6], "question": 1, "about": 1, "m2met2": [1, 3], "0": [1, 2, 3], "challeng": [1, 3, 5, 6], "pleas": 1, "u": [1, 2], "email": [1, 3, 4], "m2met": [1, 3, 6, 7], "alimeet": [1, 6], "gmail": 1, "com": [1, 4], "wechat": 1, "group": [1, 2], "In": [2, 3, 5], "fix": [2, 3, 7], "condit": [2, 3, 7], "restrict": 2, "three": [2, 3, 6], "publicli": [2, 6], "avail": [2, 6], "corpora": 2, "name": 2, "aishel": [2, 4, 6], "4": [2, 6], "cn": [2, 4, 6], "celeb": [2, 6], "To": [2, 3, 7], "perform": [2, 3], "new": [2, 3, 6], "call": 2, "2023": [2, 3, 5, 6], "score": [2, 6], "rank": [2, 3, 6], "describ": 2, "contain": [2, 6], "118": 2, "75": 2, "hour": [2, 3, 6], "speech": [2, 3, 6, 7], "total": [2, 6], "divid": [2, 6], "104": 2, "10": [2, 3, 6], "specif": [2, 6], "212": 2, "8": [2, 3], "20": 2, "session": [2, 3, 6, 7], "respect": 2, "each": [2, 3, 6], "consist": [2, 6], "15": [2, 3], "30": 2, "minut": 2, "discuss": 2, "particip": [2, 5, 6], "number": [2, 3, 6], "456": 2, "25": 2, "60": 2, "balanc": 2, "gender": 2, "coverag": 2, "collect": 2, "13": [2, 3], "meet": [2, 3, 6], "venu": 2, "which": [2, 3, 6], "categor": 2, "type": 2, "small": 2, "medium": 2, "larg": [2, 3], "room": [2, 3], "size": 2, "rang": 2, "m": 2, "2": [2, 6], "55": 2, "differ": [2, 3, 6], "give": 2, "varieti": 2, "acoust": [2, 3, 6], "properti": 2, "layout": 2, "paramet": [2, 5], "togeth": 2, "wall": 2, "materi": 2, "cover": 2, "cement": 2, "glass": 2, "etc": 2, "other": 2, "furnish": 2, "includ": [2, 3, 5, 6], "sofa": 2, "tv": 2, "blackboard": 2, "fan": 2, "air": 2, "condition": 2, "plant": 2, "record": [2, 6], "sit": 2, "around": 2, "microphon": [2, 3], "arrai": [2, 3], "place": 2, "natur": 2, "convers": 2, "distanc": 2, "5": 2, "all": [2, 3, 5, 6], "nativ": 2, "chines": 2, "speak": [2, 3], "mandarin": [2, 3], "without": 2, "strong": 2, "accent": 2, "variou": [2, 3], "kind": 2, "indoor": 2, "nois": [2, 3, 5], "limit": [2, 3, 5], "click": 2, "keyboard": 2, "door": 2, "open": [2, 3, 7], "close": 2, "bubbl": 2, "made": [2, 3], "For": 2, "both": [2, 6], "requir": [2, 3, 6], "remain": [2, 3], "same": [2, 5], "posit": 2, "There": 2, "overlap": [2, 3], "between": [2, 6], "exampl": 2, "fig": 2, "1": 2, "within": [2, 3], "one": [2, 5], "ensur": 2, "ratio": 2, "select": [2, 3, 5, 6], "topic": 2, "medic": 2, "treatment": 2, "educ": 2, "busi": 2, "organ": [2, 3, 5, 6, 7], "manag": 2, "industri": [2, 3], "product": 2, "daili": 2, "routin": 2, "averag": 2, "42": 2, "27": 2, "34": 2, "76": 2, "more": 2, "A": [2, 4], "distribut": 2, "were": 2, "ident": [2, 6], "compris": [2, 3, 7], "therebi": 2, "share": 2, "similar": 2, "configur": 2, "field": [2, 3, 6], "signal": [2, 3], "headset": 2, "onli": [2, 5, 6], "": [2, 6], "own": 2, "transcrib": [2, 3, 6], "It": [2, 6], "worth": [2, 6], "note": [2, 6], "far": [2, 3], "audio": [2, 3, 6], "synchron": 2, "common": 2, "transcript": [2, 3, 5, 6], "prepar": 2, "textgrid": 2, "format": 2, "inform": [2, 3], "durat": 2, "id": 2, "segment": [2, 6], "timestamp": [2, 6], "mention": 2, "abov": 2, "can": [2, 3, 5, 6], "download": 2, "openslr": 2, "via": 2, "follow": [2, 5], "link": 2, "particularli": 2, "baselin": [2, 3, 7], "conveni": 2, "script": 2, "automat": [3, 7], "recognit": [3, 7], "diariz": 3, "signific": 3, "stride": 3, "recent": 3, "year": 3, "result": 3, "surg": 3, "technologi": 3, "applic": 3, "across": 3, "domain": 3, "present": 3, "uniqu": [3, 6], "complex": [3, 5], "divers": 3, "style": 3, "variabl": 3, "confer": 3, "environment": 3, "reverber": [3, 5], "over": 3, "sever": 3, "been": 3, "advanc": [3, 7], "develop": [3, 6], "rich": 3, "comput": [3, 5], "hear": 3, "multisourc": 3, "environ": 3, "chime": 3, "latest": 3, "iter": 3, "ha": 3, "particular": 3, "focu": 3, "distant": 3, "gener": 3, "topologi": 3, "scenario": 3, "while": 3, "progress": 3, "english": 3, "languag": [3, 5], "barrier": 3, "achiev": 3, "compar": 3, "non": 3, "multimod": 3, "base": 3, "process": [3, 6], "misp": 3, "multi": [3, 6], "channel": 3, "parti": [3, 6], "instrument": 3, "seek": 3, "address": 3, "problem": 3, "visual": 3, "everydai": 3, "home": 3, "focus": 3, "tackl": 3, "issu": 3, "offlin": 3, "icassp2022": 3, "two": [3, 5, 7], "main": 3, "task": [3, 6, 7], "former": 3, "involv": [3, 6], "identifi": 3, "who": 3, "spoke": 3, "when": 3, "latter": 3, "aim": 3, "multipl": [3, 6], "simultan": 3, "pose": [3, 6], "technic": 3, "difficulti": 3, "interfer": 3, "build": [3, 6, 7], "success": [3, 7], "previou": 3, "excit": 3, "propos": [3, 7], "asru2023": [3, 7], "special": [3, 5, 7], "origin": [3, 5], "metric": [3, 7], "wa": [3, 6], "independ": 3, "meant": 3, "could": 3, "determin": 3, "correspond": [3, 5], "further": 3, "current": [3, 7], "talker": [3, 7], "toward": 3, "practic": 3, "attribut": [3, 7], "sub": [3, 5, 7], "track": [3, 5, 7], "what": 3, "facilit": [3, 7], "reproduc": [3, 7], "research": [3, 4, 7], "offer": 3, "comprehens": [3, 7], "overview": [3, 7], "dataset": [3, 5, 6, 7], "rule": [3, 7], "furthermor": 3, "carefulli": 3, "curat": 3, "approxim": [3, 6], "design": 3, "enabl": 3, "valid": 3, "state": [3, 6, 7], "art": [3, 7], "area": 3, "april": 3, "29": 3, "registr": 3, "mai": 3, "deadlin": 3, "date": 3, "join": 3, "june": 3, "9": 3, "data": [3, 5, 6], "leaderboard": 3, "final": [3, 5, 6], "submiss": 3, "19": 3, "juli": 3, "paper": [3, 6], "decemb": 3, "12": 3, "16": 3, "asru": 3, "workshop": 3, "interest": 3, "whether": 3, "academia": 3, "must": [3, 5, 6], "regist": 3, "complet": 3, "googl": 3, "form": 3, "below": 3, "work": 3, "dai": 3, "send": 3, "invit": 3, "elig": [3, 5], "team": 3, "qualifi": 3, "adher": [3, 5], "publish": 3, "page": 3, "prior": 3, "submit": 3, "descript": [3, 6], "document": 3, "detail": [3, 6], "approach": [3, 5], "method": 3, "top": 3, "proceed": 3, "lei": 4, "xie": 4, "professor": 4, "foundat": 4, "china": 4, "lxie": 4, "nwpu": 4, "edu": 4, "kong": 4, "aik": 4, "lee": 4, "senior": 4, "scientist": 4, "institut": 4, "infocomm": 4, "star": 4, "singapor": 4, "kongaik": 4, "ieee": 4, "org": 4, "zhiji": 4, "yan": 4, "princip": 4, "engin": 4, "alibaba": 4, "yzj": 4, "inc": 4, "shiliang": 4, "zhang": 4, "sly": 4, "zsl": 4, "yanmin": 4, "qian": 4, "shanghai": 4, "jiao": 4, "tong": 4, "univers": 4, "yanminqian": 4, "sjtu": 4, "zhuo": 4, "chen": 4, "appli": 4, "microsoft": 4, "usa": 4, "zhuc": 4, "jian": 4, "wu": 4, "wujian": 4, "hui": 4, "bu": 4, "ceo": 4, "buhui": 4, "aishelldata": 4, "should": 5, "augment": 5, "allow": [5, 6], "ad": 5, "speed": 5, "perturb": 5, "tone": 5, "chang": 5, "permit": 5, "purpos": 5, "instead": [5, 6], "util": [5, 6], "tune": 5, "violat": 5, "strictli": [5, 6], "prohibit": [5, 6], "fine": 5, "cpcer": [5, 6], "lower": 5, "judg": 5, "superior": 5, "forc": 5, "align": 5, "obtain": [5, 6], "frame": 5, "level": 5, "classif": 5, "basi": 5, "shallow": 5, "fusion": 5, "end": 5, "e": [5, 6], "g": 5, "la": 5, "rnnt": 5, "transform": [5, 6], "come": 5, "right": 5, "interpret": 5, "belong": 5, "case": 5, "circumst": 5, "coordin": 5, "assign": 6, "illustr": 6, "aishell4": 6, "constrain": 6, "sourc": 6, "addition": 6, "corpu": 6, "soon": 6, "simpl": 6, "voic": 6, "activ": 6, "detect": 6, "vad": 6, "concaten": 6, "minimum": 6, "permut": 6, "charact": 6, "error": 6, "rate": 6, "calcul": 6, "step": 6, "firstli": 6, "refer": 6, "hypothesi": 6, "chronolog": 6, "order": 6, "secondli": 6, "cer": 6, "repeat": 6, "possibl": 6, "lowest": 6, "tthe": 6, "insert": 6, "Ins": 6, "substitut": 6, "delet": 6, "del": 6, "output": 6, "text": 6, "frac": 6, "mathcal": 6, "n_": 6, "100": 6, "where": 6, "usag": 6, "third": 6, "hug": 6, "face": 6, "list": 6, "clearli": 6, "privat": 6, "manual": 6, "simul": 6, "thei": 6, "mandatori": 6, "clear": 6, "scheme": 6, "delight": 7, "introduct": 7, "contact": 7}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"baselin": 0, "overview": [0, 2], "quick": 0, "start": 0, "result": 0, "contact": 1, "dataset": 2, "train": [2, 6], "data": 2, "detail": 2, "alimeet": 2, "corpu": 2, "get": 2, "introduct": 3, "call": 3, "particip": 3, "timelin": 3, "aoe": 3, "time": 3, "guidelin": 3, "organ": 4, "rule": 5, "track": 6, "evalu": 6, "speaker": 6, "attribut": 6, "asr": 6, "metric": 6, "sub": 6, "arrang": 6, "i": 6, "fix": 6, "condit": 6, "ii": 6, "open": 6, "asru": 7, "2023": 7, "multi": 7, "channel": 7, "parti": 7, "meet": 7, "transcript": 7, "challeng": 7, "2": 7, "0": 7, "m2met2": 7, "content": 7}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 8, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 57}, "alltitles": {"Baseline": [[0, "baseline"]], "Overview": [[0, "overview"]], "Quick start": [[0, "quick-start"]], "Baseline results": [[0, "baseline-results"]], "Contact": [[1, "contact"]], "Datasets": [[2, "datasets"]], "Overview of training data": [[2, "overview-of-training-data"]], "Detail of AliMeeting corpus": [[2, "detail-of-alimeeting-corpus"]], "Get the data": [[2, "get-the-data"]], "Introduction": [[3, "introduction"]], "Call for participation": [[3, "call-for-participation"]], "Timeline(AOE Time)": [[3, "timeline-aoe-time"]], "Guidelines": [[3, "guidelines"]], "Organizers": [[4, "organizers"]], "Rules": [[5, "rules"]], "Track & Evaluation": [[6, "track-evaluation"]], "Speaker-Attributed ASR": [[6, "speaker-attributed-asr"]], "Evaluation metric": [[6, "evaluation-metric"]], "Sub-track arrangement": [[6, "sub-track-arrangement"]], "Sub-track I (Fixed Training Condition):": [[6, "sub-track-i-fixed-training-condition"]], "Sub-track II (Open Training Condition):": [[6, "sub-track-ii-open-training-condition"]], "ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)": [[7, "asru-2023-multi-channel-multi-party-meeting-transcription-challenge-2-0-m2met2-0"]], "Contents:": [[7, null]]}, "indexentries": {}}) |
| | | Search.setIndex({"docnames": ["Baseline", "Contact", "Dataset", "Introduction", "Organizers", "Rules", "Track_setting_and_evaluation", "index"], "filenames": ["Baseline.md", "Contact.md", "Dataset.md", "Introduction.md", "Organizers.md", "Rules.md", "Track_setting_and_evaluation.md", "index.rst"], "titles": ["Baseline", "Contact", "Datasets", "Introduction", "Organizers", "Rules", "Track & Evaluation", "ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)"], "terms": {"we": [0, 2, 3, 7], "releas": [0, 2, 3, 6], "an": [0, 2, 3, 6], "e2": 0, "sa": 0, "asr": [0, 3, 7], "conduct": [0, 2], "funasr": 0, "time": [0, 6], "accord": [0, 3], "timelin": [0, 2], "The": [0, 2, 3, 5, 6], "model": [0, 2, 3, 5, 6], "architectur": 0, "i": [0, 2, 3, 5], "shown": [0, 2], "figur": [0, 6], "3": [0, 2, 3], "speakerencod": 0, "initi": 0, "pre": [0, 6], "train": [0, 3, 5, 7], "speaker": [0, 2, 3, 7], "verif": 0, "from": [0, 2, 3, 5, 6], "modelscop": [0, 6], "thi": [0, 3, 5, 6], "also": [0, 2, 3, 6], "us": [0, 2, 5, 6], "extract": 0, "embed": 0, "profil": 0, "To": [0, 2, 3, 7], "run": 0, "first": 0, "you": [0, 1], "need": 0, "instal": 0, "There": [0, 2], "ar": [0, 2, 3, 5, 6, 7], "two": [0, 3, 5, 7], "startup": 0, "script": [0, 2], "sh": 0, "evalu": [0, 2, 3, 7], "old": 0, "eval": [0, 2, 5, 6], "test": [0, 2, 3, 5, 6], "set": [0, 2, 3, 5, 6], "run_m2met_2023_inf": 0, "infer": 0, "new": [0, 2, 3, 6], "multi": [0, 3, 6], "channel": [0, 3], "parti": [0, 3, 6], "meet": [0, 2, 3, 6], "transcript": [0, 2, 3, 5, 6], "2": [0, 2, 6], "0": [0, 1, 2, 3], "m2met2": [0, 1, 3], "challeng": [0, 1, 3, 5, 6], "befor": 0, "must": [0, 3, 5, 6], "manual": [0, 6], "download": [0, 2], "unpack": 0, "alimeet": [0, 1, 6], "corpu": [0, 6], "place": [0, 2], "dataset": [0, 3, 5, 6, 7], "directori": 0, "eval_ali_far": 0, "eval_ali_near": 0, "test_ali_far": 0, "test_ali_near": 0, "train_ali_far": 0, "train_ali_near": 0, "test_2023_ali_far": 0, "after": 0, "which": [0, 2, 3, 6], "contain": [0, 2, 6], "onli": [0, 2, 5, 6], "raw": 0, "audio": [0, 2, 3, 6], "Then": 0, "put": 0, "given": 0, "wav": 0, "scp": 0, "wav_raw": 0, "segment": [0, 2, 6], "utt2spk": 0, "spk2utt": 0, "data": [0, 3, 5, 6], "shell": 0, "For": [0, 2], "more": [0, 2], "detail": [0, 3, 6], "can": [0, 2, 3, 5, 6], "see": 0, "here": 0, "system": [0, 3, 5, 6, 7], "tabl": [0, 2], "adopt": 0, "oracl": [0, 6], "dure": [0, 2, 6], "howev": [0, 3, 6], "due": [0, 3], "lack": 0, "label": [0, 5, 6], "provid": [0, 2, 6, 7], "addit": [0, 6], "spectral": 0, "cluster": 0, "meanwhil": 0, "show": 0, "impact": 0, "accuraci": [0, 6], "If": [1, 5, 6], "have": [1, 3], "ani": [1, 5, 6], "question": 1, "about": [1, 3], "pleas": 1, "u": [1, 2], "email": [1, 3, 4], "m2met": [1, 3, 6, 7], "gmail": 1, "com": [1, 4], "wechat": [1, 3], "group": [1, 2, 3], "In": [2, 3, 5], "fix": [2, 3, 7], "condit": [2, 3, 7], "restrict": 2, "three": [2, 3, 6], "publicli": [2, 6], "avail": [2, 6], "corpora": 2, "name": 2, "aishel": [2, 4, 6], "4": [2, 6], "cn": [2, 4, 6], "celeb": [2, 6], "perform": [2, 3], "call": 2, "2023": [2, 3, 5, 6], "score": [2, 6], "rank": [2, 3, 6], "describ": 2, "118": 2, "75": 2, "hour": [2, 3, 6], "speech": [2, 3, 6, 7], "total": [2, 6], "divid": [2, 6], "104": 2, "10": [2, 3, 6], "specif": [2, 6], "212": 2, "8": [2, 3], "20": 2, "session": [2, 3, 6, 7], "respect": 2, "each": [2, 3, 6], "consist": [2, 6], "15": [2, 3], "30": 2, "minut": 2, "discuss": 2, "particip": [2, 5, 6], "number": [2, 3, 6], "456": 2, "25": 2, "60": 2, "balanc": 2, "gender": 2, "coverag": 2, "collect": 2, "13": [2, 3], "venu": 2, "categor": 2, "type": 2, "small": 2, "medium": 2, "larg": [2, 3], "room": [2, 3], "size": 2, "rang": 2, "m": 2, "55": 2, "differ": [2, 3, 6], "give": 2, "varieti": 2, "acoust": [2, 3, 6], "properti": 2, "layout": 2, "paramet": [2, 5], "togeth": 2, "wall": 2, "materi": 2, "cover": 2, "cement": 2, "glass": 2, "etc": 2, "other": 2, "furnish": 2, "includ": [2, 3, 5, 6], "sofa": 2, "tv": 2, "blackboard": 2, "fan": 2, "air": 2, "condition": 2, "plant": 2, "record": [2, 6], "sit": 2, "around": 2, "microphon": [2, 3], "arrai": [2, 3], "natur": 2, "convers": 2, "distanc": 2, "5": 2, "all": [2, 3, 5, 6], "nativ": 2, "chines": 2, "speak": [2, 3], "mandarin": [2, 3], "without": 2, "strong": 2, "accent": 2, "variou": [2, 3], "kind": 2, "indoor": 2, "nois": [2, 3, 5], "limit": [2, 3, 5], "click": 2, "keyboard": 2, "door": 2, "open": [2, 3, 7], "close": [2, 3], "bubbl": 2, "made": [2, 3], "both": [2, 6], "requir": [2, 3, 6], "remain": [2, 3], "same": [2, 5], "posit": 2, "overlap": [2, 3], "between": [2, 6], "exampl": 2, "fig": 2, "1": 2, "within": [2, 3], "one": [2, 5], "ensur": 2, "ratio": 2, "select": [2, 3, 5, 6], "topic": 2, "medic": 2, "treatment": 2, "educ": 2, "busi": 2, "organ": [2, 3, 5, 6, 7], "manag": 2, "industri": [2, 3], "product": 2, "daili": 2, "routin": 2, "averag": 2, "42": 2, "27": 2, "34": 2, "76": 2, "A": [2, 4], "distribut": 2, "were": 2, "ident": [2, 6], "compris": [2, 3, 7], "therebi": 2, "share": 2, "similar": 2, "configur": 2, "field": [2, 3, 6], "signal": [2, 3], "headset": 2, "": [2, 6], "own": 2, "transcrib": [2, 3, 6], "It": [2, 6], "worth": [2, 6], "note": [2, 6], "far": [2, 3], "synchron": 2, "common": 2, "prepar": 2, "textgrid": 2, "format": 2, "inform": [2, 3], "durat": 2, "id": 2, "timestamp": [2, 6], "mention": 2, "abov": 2, "openslr": 2, "via": 2, "follow": [2, 5], "link": 2, "particularli": 2, "baselin": [2, 3, 7], "conveni": 2, "automat": [3, 7], "recognit": [3, 7], "diariz": 3, "signific": 3, "stride": 3, "recent": 3, "year": 3, "result": 3, "surg": 3, "technologi": 3, "applic": 3, "across": 3, "domain": 3, "present": 3, "uniqu": [3, 6], "complex": [3, 5], "divers": 3, "style": 3, "variabl": 3, "confer": 3, "environment": 3, "reverber": [3, 5], "over": 3, "sever": 3, "been": 3, "advanc": [3, 7], "develop": [3, 6], "rich": 3, "comput": [3, 5], "hear": 3, "multisourc": 3, "environ": 3, "chime": 3, "latest": 3, "iter": 3, "ha": 3, "particular": 3, "focu": 3, "distant": 3, "gener": 3, "topologi": 3, "scenario": 3, "while": 3, "progress": 3, "english": 3, "languag": [3, 5], "barrier": 3, "achiev": 3, "compar": 3, "non": 3, "multimod": 3, "base": 3, "process": [3, 6], "misp": 3, "instrument": 3, "seek": 3, "address": 3, "problem": 3, "visual": 3, "everydai": 3, "home": 3, "focus": 3, "tackl": 3, "issu": 3, "offlin": 3, "icassp2022": 3, "main": 3, "task": [3, 6, 7], "former": 3, "involv": [3, 6], "identifi": 3, "who": 3, "spoke": 3, "when": 3, "latter": 3, "aim": 3, "multipl": [3, 6], "simultan": 3, "pose": [3, 6], "technic": 3, "difficulti": 3, "interfer": 3, "build": [3, 6, 7], "success": [3, 7], "previou": 3, "excit": 3, "propos": [3, 7], "asru2023": [3, 7], "special": [3, 5, 7], "origin": [3, 5], "metric": [3, 7], "wa": [3, 6], "independ": 3, "meant": 3, "could": 3, "determin": 3, "correspond": [3, 5], "further": 3, "current": [3, 7], "talker": [3, 7], "toward": 3, "practic": 3, "attribut": [3, 7], "sub": [3, 5, 7], "track": [3, 5, 7], "what": 3, "facilit": [3, 7], "reproduc": [3, 7], "research": [3, 4, 7], "offer": 3, "comprehens": [3, 7], "overview": [3, 7], "rule": [3, 7], "furthermor": 3, "carefulli": 3, "curat": 3, "approxim": [3, 6], "design": 3, "enabl": 3, "valid": 3, "state": [3, 6, 7], "art": [3, 7], "area": 3, "april": 3, "29": 3, "registr": 3, "mai": 3, "deadlin": 3, "date": 3, "join": 3, "june": 3, "9": 3, "leaderboard": 3, "final": [3, 5, 6], "submiss": 3, "leaderboar": 3, "19": 3, "juli": 3, "paper": [3, 6], "decemb": 3, "12": 3, "16": 3, "asru": 3, "workshop": 3, "interest": 3, "whether": 3, "academia": 3, "regist": 3, "complet": 3, "googl": 3, "form": 3, "below": 3, "22": 3, "welcom": 3, "keep": 3, "up": 3, "updat": 3, "work": 3, "dai": 3, "send": 3, "invit": 3, "elig": [3, 5], "team": 3, "qualifi": 3, "adher": [3, 5], "publish": 3, "page": 3, "prior": 3, "submit": 3, "descript": [3, 6], "document": 3, "approach": [3, 5], "method": 3, "top": 3, "proceed": 3, "lei": 4, "xie": 4, "professor": 4, "foundat": 4, "china": 4, "lxie": 4, "nwpu": 4, "edu": 4, "kong": 4, "aik": 4, "lee": 4, "senior": 4, "scientist": 4, "institut": 4, "infocomm": 4, "star": 4, "singapor": 4, "kongaik": 4, "ieee": 4, "org": 4, "zhiji": 4, "yan": 4, "princip": 4, "engin": 4, "alibaba": 4, "yzj": 4, "inc": 4, "shiliang": 4, "zhang": 4, "sly": 4, "zsl": 4, "yanmin": 4, "qian": 4, "shanghai": 4, "jiao": 4, "tong": 4, "univers": 4, "yanminqian": 4, "sjtu": 4, "zhuo": 4, "chen": 4, "appli": 4, "microsoft": 4, "usa": 4, "zhuc": 4, "jian": 4, "wu": 4, "wujian": 4, "hui": 4, "bu": 4, "ceo": 4, "buhui": 4, "aishelldata": 4, "should": 5, "augment": 5, "allow": [5, 6], "ad": 5, "speed": 5, "perturb": 5, "tone": 5, "chang": 5, "permit": 5, "purpos": 5, "instead": [5, 6], "util": [5, 6], "tune": 5, "violat": 5, "strictli": [5, 6], "prohibit": [5, 6], "fine": 5, "cpcer": [5, 6], "lower": 5, "judg": 5, "superior": 5, "forc": 5, "align": 5, "obtain": [5, 6], "frame": 5, "level": 5, "classif": 5, "basi": 5, "shallow": 5, "fusion": 5, "end": 5, "e": [5, 6], "g": 5, "la": 5, "rnnt": 5, "transform": [5, 6], "come": 5, "right": 5, "interpret": 5, "belong": 5, "case": 5, "circumst": 5, "coordin": 5, "assign": 6, "illustr": 6, "aishell4": 6, "constrain": 6, "sourc": 6, "addition": 6, "soon": 6, "simpl": 6, "voic": 6, "activ": 6, "detect": 6, "vad": 6, "concaten": 6, "minimum": 6, "permut": 6, "charact": 6, "error": 6, "rate": 6, "calcul": 6, "step": 6, "firstli": 6, "refer": 6, "hypothesi": 6, "chronolog": 6, "order": 6, "secondli": 6, "cer": 6, "repeat": 6, "possibl": 6, "lowest": 6, "tthe": 6, "insert": 6, "Ins": 6, "substitut": 6, "delet": 6, "del": 6, "output": 6, "text": 6, "frac": 6, "mathcal": 6, "n_": 6, "100": 6, "where": 6, "usag": 6, "third": 6, "hug": 6, "face": 6, "list": 6, "clearli": 6, "privat": 6, "simul": 6, "thei": 6, "mandatori": 6, "clear": 6, "scheme": 6, "delight": 7, "introduct": 7, "contact": 7}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"baselin": 0, "overview": [0, 2], "quick": 0, "start": 0, "result": 0, "contact": 1, "dataset": 2, "train": [2, 6], "data": 2, "detail": 2, "alimeet": 2, "corpu": 2, "get": 2, "introduct": 3, "call": 3, "particip": 3, "timelin": 3, "aoe": 3, "time": 3, "guidelin": 3, "organ": 4, "rule": 5, "track": 6, "evalu": 6, "speaker": 6, "attribut": 6, "asr": 6, "metric": 6, "sub": 6, "arrang": 6, "i": 6, "fix": 6, "condit": 6, "ii": 6, "open": 6, "asru": 7, "2023": 7, "multi": 7, "channel": 7, "parti": 7, "meet": 7, "transcript": 7, "challeng": 7, "2": 7, "0": 7, "m2met2": 7, "content": 7}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 8, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 57}, "alltitles": {"Baseline": [[0, "baseline"]], "Overview": [[0, "overview"]], "Quick start": [[0, "quick-start"]], "Baseline results": [[0, "baseline-results"]], "Contact": [[1, "contact"]], "Datasets": [[2, "datasets"]], "Overview of training data": [[2, "overview-of-training-data"]], "Detail of AliMeeting corpus": [[2, "detail-of-alimeeting-corpus"]], "Get the data": [[2, "get-the-data"]], "Introduction": [[3, "introduction"]], "Call for participation": [[3, "call-for-participation"]], "Timeline(AOE Time)": [[3, "timeline-aoe-time"]], "Guidelines": [[3, "guidelines"]], "Organizers": [[4, "organizers"]], "Rules": [[5, "rules"]], "Track & Evaluation": [[6, "track-evaluation"]], "Speaker-Attributed ASR": [[6, "speaker-attributed-asr"]], "Evaluation metric": [[6, "evaluation-metric"]], "Sub-track arrangement": [[6, "sub-track-arrangement"]], "Sub-track I (Fixed Training Condition):": [[6, "sub-track-i-fixed-training-condition"]], "Sub-track II (Open Training Condition):": [[6, "sub-track-ii-open-training-condition"]], "ASRU 2023 MULTI-CHANNEL MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0 (M2MeT2.0)": [[7, "asru-2023-multi-channel-multi-party-meeting-transcription-challenge-2-0-m2met2-0"]], "Contents:": [[7, null]]}, "indexentries": {}}) |
| | |
| | | # -- Project information ----------------------------------------------------- |
| | | # https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information |
| | | |
| | | project = 'm2met2' |
| | | project = 'MULTI-PARTY MEETING TRANSCRIPTION CHALLENGE 2.0' |
| | | copyright = '2023, Speech Lab, Alibaba Group; ASLP Group, Northwestern Polytechnical University' |
| | | author = 'Speech Lab, Alibaba Group; Audio, Speech and Language Processing Group, Northwestern Polytechnical University' |
| | | |
| | |
| | | # Sphinx build info version 1 |
| | | # This file hashes the configuration used when building these files. When it is not found, a full rebuild will be done. |
| | | config: 5462207d1656a9ae4ca43c2890d094be |
| | | config: 06d9c1d4093817b45b9d4df7ab350eaf |
| | | tags: 645f666f9bcd5a90fca523b33c5a78b7 |
| | |
| | |  |
| | | |
| | | ## å¿«éå¼å§ |
| | | #TODO: fill with the README.md of the baseline |
| | | |
| | | é¦å
éè¦å®è£
FunASRåModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html)) |
| | | åºçº¿ç³»ç»æè®ç»åæµè¯ä¸¤ä¸ªèæ¬ï¼`run.sh` æ¯ç¨äºè®ç»åºçº¿ç³»ç»å¹¶å¨M2METçéªè¯ä¸æµè¯éä¸è¯ä¼°çï¼è `run_m2met_2023_infer.sh` ç¨äºæ¤æ¬¡ç«èµé¢å¤å¼æ¾çå
¨æ°æµè¯é䏿µè¯åæ¶çæç¬¦åç«èµæç»æäº¤æ ¼å¼çæä»¶ã |
| | | å¨è¿è¡ `run.sh`åï¼éè¦èªè¡ä¸è½½å¹¶è§£å[AliMeeting](http://www.openslr.org/119/)æ°æ®éå¹¶æ¾ç½®äº`./dataset`ç®å½ä¸ï¼ |
| | | ```shell |
| | | dataset |
| | | |ââ Eval_Ali_far |
| | | |ââ Eval_Ali_near |
| | | |ââ Test_Ali_far |
| | | |ââ Test_Ali_near |
| | | |ââ Train_Ali_far |
| | | |ââ Train_Ali_near |
| | | å¨è¿è¡ `run_m2met_2023_infer.sh`å, éè¦å°æµè¯é`Test_2023_Ali_far`ï¼ä»
å
å«é³é¢ï¼å°äº6.16åå¸ï¼æ¾ç½®äº`./dataset`ç®å½ä¸ãç¶åå°ä¸»åæ¹æä¾ç`wav.scp`ï¼`wav_raw.scp`ï¼`segments`ï¼`utt2spk`å`spk2utt`æ¾ç½®äº`./data/Test_2023_Ali_far`ç®å½ä¸ã |
| | | ```shell |
| | | data/Test_2023_Ali_far |
| | | |ââ wav.scp |
| | | |ââ wav_raw.scp |
| | | |ââ segments |
| | | |ââ utt2spk |
| | | |ââ spk2utt |
| | | ``` |
| | | æ´å¤åºçº¿ç³»ç»è¯¦æ
è§[æ¤å¤](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md) |
| | | ## åºçº¿ç»æ |
| | | åºçº¿ç³»ç»çç»æå¦è¡¨3æç¤ºãå¨è®ç»æé´ï¼è¯´è¯äººæ¡£æ¡éç¨äºçå®è¯´è¯äººåµå
¥ãç¶èç±äºå¨è¯ä¼°è¿ç¨ä¸ç¼ºä¹çå®è¯´è¯äººæ ç¾ï¼å æ¤ä½¿ç¨äºç±é¢å¤çè°±èç±»æä¾ç说è¯äººç¹å¾ãåæ¶æä»¬è¿æä¾äºå¨è¯ä¼°åæµè¯éä¸ä½¿ç¨çå®è¯´è¯äººæ¡£æ¡çç»æï¼ä»¥æ¾ç¤ºè¯´è¯äººæ¡£æ¡åç¡®æ§çå½±åã |
| | |  |
| | |
| | | # ç®ä» |
| | | ## ç«èµä»ç» |
| | | |
| | | è¯é³è¯å«ï¼Automatic Speech Recognitionï¼ã说è¯äººæ¥å¿ï¼Speaker Diarizationï¼çè¯é³å¤çææ¯çææ°å屿¿åäºä¼å¤æºè½è¯é³ç广æ³åºç¨ãç¶èä¼è®®åºæ¯ç±äºå
¶å¤æç声妿¡ä»¶åä¸åç讲è¯é£æ ¼ï¼å
æ¬éå ç讲è¯ãä¸åæ°éçåè¨è
ã大ä¼è®®å®¤çè¿åºä¿¡å·ä»¥åç¯å¢åªå£°åæ··åï¼ä»ç¶å±äºä¸é¡¹æå
·æææ§çä»»å¡ã |
| | | |
| | | ä¸ºäºæ¨å¨ä¼è®®åºæ¯è¯é³è¯å«çåå±ï¼å·²ç»æå¾å¤ç¸å
³çææèµï¼å¦ Rich Transcription evaluation å CHIMEï¼Computational Hearing in Multisource Environmentsï¼ ææèµãææ°çCHIMEææèµå
³æ³¨äºè¿è·ç¦»èªå¨è¯é³è¯å«åå¼åè½å¨åç§ä¸åææç»æçéµåååºç¨åºæ¯ä¸éç¨çç³»ç»ãç¶èä¸åè¯è¨ä¹é´çå·®å¼éå¶äºéè±è¯ä¼è®®è½¬å½çè¿å±ãMISPï¼Multimodal Information Based Speech Processingï¼åM2MeTï¼Multi-Channel Multi-Party Meeting Transcriptionï¼ææèµä¸ºæ¨å¨æ®éè¯ä¼è®®åºæ¯è¯é³è¯å«ååºäºè´¡ç®ãMISPææèµä¾§éäºç¨è§å¬å¤æ¨¡æçæ¹æ³è§£å³æ¥å¸¸å®¶åºç¯å¢ä¸çè¿è·ç¦»å¤éº¦å
é£ä¿¡å·å¤çé®é¢ï¼èM2MeTææåä¾§éäºè§£å³ç¦»çº¿ä¼è®®å®¤ä¸ä¼è®®è½¬å½çè¯é³éå é®é¢ã |
| | |
| | | ## æ¶é´å®æ(AOEæ¶é´) |
| | | |
| | | - $ 2023.4.29: $ 弿¾æ³¨å |
| | | - $ 2023.5.8: $ åºçº¿åå¸ |
| | | - $ 2023.5.15: $ æ³¨åæªæ¢ |
| | | - $ 2023.6.9: $ æµè¯éæ°æ®åå¸ |
| | | - $ 2023.6.13: $ æç»ç»ææäº¤æªæ¢ |
| | | - $ 2023.6.19: $ è¯ä¼°ç»æåæååå¸ |
| | | - $ 2023.7.3: $ 论ææäº¤æªæ¢ |
| | | - $ 2023.7.10: $ æç»ç论ææäº¤æªæ¢ |
| | | - $ 2023.12.12: $ ASRU Workshop & challenge session |
| | | - $ 2023.5.11: $ åºçº¿åå¸ |
| | | - $ 2023.5.22: $ æ³¨åæªæ¢ |
| | | - $ 2023.6.16: $ æµè¯éæ°æ®åå¸ï¼æè¡æ¦å¼æ¾ |
| | | - $ 2023.6.20: $ æç»ç»ææäº¤æªæ¢ï¼æè¡æ¦å
³é |
| | | - $ 2023.6.26: $ è¯ä¼°ç»æåæååå¸ |
| | | - $ 2023.7.3: $ 论ææäº¤æªæ¢ï¼éè¿ASRU2023宿¹æç¨¿ï¼éæ©ç«èµSessionï¼ |
| | | - $ 2023.7.10: $ æç»ç论ææäº¤æªæ¢ï¼éè¿ASRU2023宿¹æç¨¿ï¼éæ©ç«èµSessionï¼ |
| | | - $ 2023.12.12: $ ASRU Workshop & Challenge Session |
| | | |
| | | ## ç«èµæ¥å |
| | | |
| | | æ¥èªå¦æ¯çåå·¥ä¸ççææååèµè
ååºå¨2023å¹´5æ15æ¥åä¹åå¡«å䏿¹çè°·æè¡¨åï¼ |
| | | æ¥èªå¦æ¯çåå·¥ä¸ççææååèµè
ååºå¨2023å¹´5æ22æ¥åä¹åå¡«å䏿¹çè°·æè¡¨åãåæ¶æ¬¢è¿å¹¿å¤§åèµè
å å
¥[宿¹äº¤æµå¾®ä¿¡ç¾¤](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html)交æµå¹¶åæ¶è·åç«èµææ°æ¶æ¯ï¼ |
| | | |
| | | [M2MET2.0æ¥å](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link) |
| | | |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>ç´¢å¼ — m2met2 ææ¡£</title> |
| | | <title>ç´¢å¼ — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="#" title="æ»ç´¢å¼" |
| | | accesskey="I">ç´¢å¼</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç´¢å¼</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="#" title="æ»ç´¢å¼" |
| | | >ç´¢å¼</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç´¢å¼</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>ASRU 2023 å¤éé夿¹ä¼è®®è½¬å½ææ 2.0 — m2met2 ææ¡£</title> |
| | | <title>ASRU 2023 å¤éé夿¹ä¼è®®è½¬å½ææ 2.0 — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E7%AE%80%E4%BB%8B.html" title="ç®ä»" |
| | | accesskey="N">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="#">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="#">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ASRU 2023 å¤éé夿¹ä¼è®®è½¬å½ææ 2.0</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | #" class="text-logo">m2met2 ææ¡£</a> |
| | | #" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="%E7%AE%80%E4%BB%8B.html" title="ç®ä»" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="#">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="#">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ASRU 2023 å¤éé夿¹ä¼è®®è½¬å½ææ 2.0</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>æç´¢ — m2met2 ææ¡£</title> |
| | | <title>æç´¢ — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="genindex.html" title="æ»ç´¢å¼" |
| | | accesskey="I">ç´¢å¼</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">æç´¢</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-toc"> |
| | | |
| | |
| | | <li class="right" style="margin-right: 10px"> |
| | | <a href="genindex.html" title="æ»ç´¢å¼" |
| | | >ç´¢å¼</a></li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">æç´¢</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | Search.setIndex({"docnames": ["index", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "filenames": ["index.rst", "\u57fa\u7ebf.md", "\u6570\u636e\u96c6.md", "\u7b80\u4ecb.md", "\u7ec4\u59d4\u4f1a.md", "\u8054\u7cfb\u65b9\u5f0f.md", "\u89c4\u5219.md", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30.md"], "titles": ["ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u7ade\u8d5b\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "terms": {"m2met": [0, 3, 5, 7], "asru2023": [0, 3], "m2met2": [0, 3, 5, 7], "funasr": 1, "sa": 1, "asr": [1, 3, 7], "speakerencod": 1, "modelscop": [1, 7], "todo": 1, "fill": 1, "with": 1, "the": 1, "readm": 1, "md": 1, "of": 1, "baselin": [1, 2], "aishel": [2, 7], "cn": [2, 4, 7], "celeb": [2, 7], "test": [2, 6, 7], "2023": [2, 3, 6, 7], "118": 2, "75": 2, "104": 2, "train": 2, "eval": [2, 6], "10": [2, 3, 7], "212": 2, "15": [2, 3], "30": 2, "456": 2, "25": 2, "13": [2, 3], "55": 2, "42": 2, "27": 2, "34": 2, "76": 2, "20": 2, "textgrid": 2, "id": 2, "openslr": 2, "automat": 3, "speech": 3, "recognit": 3, "speaker": 3, "diariz": 3, "rich": 3, "transcript": 3, "evalu": 3, "chime": 3, "comput": 3, "hear": 3, "in": 3, "multisourc": 3, "environ": 3, "misp": 3, "multimod": 3, "inform": 3, "base": 3, "process": 3, "multi": 3, "channel": 3, "parti": 3, "meet": 3, "iassp2022": 3, "29": 3, "19": 3, "12": 3, "asru": 3, "workshop": 3, "challeng": 3, "session": 3, "lxie": 4, "nwpu": 4, "edu": 4, "kong": 4, "aik": 4, "lee": 4, "star": 4, "kongaik": 4, "ieee": 4, "org": 4, "zhiji": 4, "yzj": 4, "alibaba": 4, "inc": 4, "com": [4, 5], "sli": 4, "zsl": 4, "yanminqian": 4, "sjtu": 4, "zhuc": 4, "microsoft": 4, "wujian": 4, "ceo": 4, "buhui": 4, "aishelldata": 4, "alimeet": [5, 7], "gmail": 5, "cpcer": [6, 7], "las": 6, "rnnt": 6, "transform": 6, "aishell4": 7, "vad": 7, "cer": 7, "ins": 7, "sub": 7, "del": 7, "text": 7, "frac": 7, "mathcal": 7, "n_": 7, "total": 7, "time": 7, "100": 7, "hug": 7, "face": 7}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"asru": 0, "2023": 0, "alimeet": 2, "aoe": 3}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 8, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 57}, "alltitles": {"ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0": [[0, "asru-2023-2-0"]], "\u76ee\u5f55:": [[0, null]], "\u57fa\u7ebf": [[1, "id1"]], "\u57fa\u7ebf\u6982\u8ff0": [[1, "id2"]], "\u5feb\u901f\u5f00\u59cb": [[1, "id3"]], "\u57fa\u7ebf\u7ed3\u679c": [[1, "id4"]], "\u6570\u636e\u96c6": [[2, "id1"]], "\u6570\u636e\u96c6\u6982\u8ff0": [[2, "id2"]], "Alimeeting\u6570\u636e\u96c6\u4ecb\u7ecd": [[2, "alimeeting"]], "\u83b7\u53d6\u6570\u636e": [[2, "id3"]], "\u7b80\u4ecb": [[3, "id1"]], "\u7ade\u8d5b\u4ecb\u7ecd": [[3, "id2"]], "\u65f6\u95f4\u5b89\u6392(AOE\u65f6\u95f4)": [[3, "aoe"]], "\u7ade\u8d5b\u62a5\u540d": [[3, "id3"]], "\u7ec4\u59d4\u4f1a": [[4, "id1"]], "\u8054\u7cfb\u65b9\u5f0f": [[5, "id1"]], "\u7ade\u8d5b\u89c4\u5219": [[6, "id1"]], "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30": [[7, "id1"]], "\u8bf4\u8bdd\u4eba\u76f8\u5173\u7684\u8bed\u97f3\u8bc6\u522b": [[7, "id2"]], "\u8bc4\u4f30\u65b9\u6cd5": [[7, "id3"]], "\u5b50\u8d5b\u9053\u8bbe\u7f6e": [[7, "id4"]], "\u5b50\u8d5b\u9053\u4e00 (\u9650\u5b9a\u8bad\u7ec3\u6570\u636e):": [[7, "id5"]], "\u5b50\u8d5b\u9053\u4e8c (\u5f00\u653e\u8bad\u7ec3\u6570\u636e):": [[7, "id6"]]}, "indexentries": {}}) |
| | | Search.setIndex({"docnames": ["index", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "filenames": ["index.rst", "\u57fa\u7ebf.md", "\u6570\u636e\u96c6.md", "\u7b80\u4ecb.md", "\u7ec4\u59d4\u4f1a.md", "\u8054\u7cfb\u65b9\u5f0f.md", "\u89c4\u5219.md", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30.md"], "titles": ["ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0", "\u57fa\u7ebf", "\u6570\u636e\u96c6", "\u7b80\u4ecb", "\u7ec4\u59d4\u4f1a", "\u8054\u7cfb\u65b9\u5f0f", "\u7ade\u8d5b\u89c4\u5219", "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30"], "terms": {"m2met": [0, 1, 3, 5, 7], "asru2023": [0, 3], "m2met2": [0, 3, 5, 7], "funasr": 1, "sa": 1, "asr": [1, 3, 7], "speakerencod": 1, "modelscop": [1, 7], "instal": 1, "run": 1, "sh": 1, "run_m2met_2023_inf": 1, "alimeet": [1, 5, 7], "dataset": 1, "eval_ali_far": 1, "eval_ali_near": 1, "test_ali_far": 1, "test_ali_near": 1, "train_ali_far": 1, "train_ali_near": 1, "test_2023_ali_far": 1, "16": [1, 3], "wav": 1, "scp": 1, "wav_raw": 1, "segment": 1, "utt2spk": 1, "spk2utt": 1, "data": 1, "shell": 1, "aishel": [2, 7], "cn": [2, 4, 7], "celeb": [2, 7], "test": [2, 6, 7], "2023": [2, 3, 6, 7], "118": 2, "75": 2, "104": 2, "train": 2, "eval": [2, 6], "10": [2, 3, 7], "212": 2, "15": 2, "30": 2, "456": 2, "25": 2, "13": 2, "55": 2, "42": 2, "27": 2, "34": 2, "76": 2, "20": [2, 3], "textgrid": 2, "id": 2, "openslr": 2, "baselin": 2, "automat": 3, "speech": 3, "recognit": 3, "speaker": 3, "diariz": 3, "rich": 3, "transcript": 3, "evalu": 3, "chime": 3, "comput": 3, "hear": 3, "in": 3, "multisourc": 3, "environ": 3, "misp": 3, "multimod": 3, "inform": 3, "base": 3, "process": 3, "multi": 3, "channel": 3, "parti": 3, "meet": 3, "iassp2022": 3, "29": 3, "11": 3, "22": 3, "26": 3, "session": 3, "12": 3, "asru": 3, "workshop": 3, "challeng": 3, "lxie": 4, "nwpu": 4, "edu": 4, "kong": 4, "aik": 4, "lee": 4, "star": 4, "kongaik": 4, "ieee": 4, "org": 4, "zhiji": 4, "yzj": 4, "alibaba": 4, "inc": 4, "com": [4, 5], "sli": 4, "zsl": 4, "yanminqian": 4, "sjtu": 4, "zhuc": 4, "microsoft": 4, "wujian": 4, "ceo": 4, "buhui": 4, "aishelldata": 4, "gmail": 5, "cpcer": [6, 7], "las": 6, "rnnt": 6, "transform": 6, "aishell4": 7, "vad": 7, "cer": 7, "ins": 7, "sub": 7, "del": 7, "text": 7, "frac": 7, "mathcal": 7, "n_": 7, "total": 7, "time": 7, "100": 7, "hug": 7, "face": 7}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"asru": 0, "2023": 0, "alimeet": 2, "aoe": 3}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 8, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx": 57}, "alltitles": {"ASRU 2023 \u591a\u901a\u9053\u591a\u65b9\u4f1a\u8bae\u8f6c\u5f55\u6311\u6218 2.0": [[0, "asru-2023-2-0"]], "\u76ee\u5f55:": [[0, null]], "\u57fa\u7ebf": [[1, "id1"]], "\u57fa\u7ebf\u6982\u8ff0": [[1, "id2"]], "\u5feb\u901f\u5f00\u59cb": [[1, "id3"]], "\u57fa\u7ebf\u7ed3\u679c": [[1, "id4"]], "\u6570\u636e\u96c6": [[2, "id1"]], "\u6570\u636e\u96c6\u6982\u8ff0": [[2, "id2"]], "Alimeeting\u6570\u636e\u96c6\u4ecb\u7ecd": [[2, "alimeeting"]], "\u83b7\u53d6\u6570\u636e": [[2, "id3"]], "\u7b80\u4ecb": [[3, "id1"]], "\u7ade\u8d5b\u4ecb\u7ecd": [[3, "id2"]], "\u65f6\u95f4\u5b89\u6392(AOE\u65f6\u95f4)": [[3, "aoe"]], "\u7ade\u8d5b\u62a5\u540d": [[3, "id3"]], "\u7ec4\u59d4\u4f1a": [[4, "id1"]], "\u8054\u7cfb\u65b9\u5f0f": [[5, "id1"]], "\u7ade\u8d5b\u89c4\u5219": [[6, "id1"]], "\u8d5b\u9053\u8bbe\u7f6e\u4e0e\u8bc4\u4f30": [[7, "id1"]], "\u8bf4\u8bdd\u4eba\u76f8\u5173\u7684\u8bed\u97f3\u8bc6\u522b": [[7, "id2"]], "\u8bc4\u4f30\u65b9\u6cd5": [[7, "id3"]], "\u5b50\u8d5b\u9053\u8bbe\u7f6e": [[7, "id4"]], "\u5b50\u8d5b\u9053\u4e00 (\u9650\u5b9a\u8bad\u7ec3\u6570\u636e):": [[7, "id5"]], "\u5b50\u8d5b\u9053\u4e8c (\u5f00\u653e\u8bad\u7ec3\u6570\u636e):": [[7, "id6"]]}, "indexentries": {}}) |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>åºçº¿ — m2met2 ææ¡£</title> |
| | | <title>åºçº¿ — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html" title="èµé设置ä¸è¯ä¼°" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">åºçº¿</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | </section> |
| | | <section id="id3"> |
| | | <h2>å¿«éå¼å§<a class="headerlink" href="#id3" title="æ¤æ é¢çæ°¸ä¹
龿¥">¶</a></h2> |
| | | <p>#TODO: fill with the README.md of the baseline</p> |
| | | <p>é¦å
éè¦å®è£
FunASRåModelScope. (<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/en/installation.html">installation</a>)<br /> |
| | | åºçº¿ç³»ç»æè®ç»åæµè¯ä¸¤ä¸ªèæ¬ï¼<code class="docutils literal notranslate"><span class="pre">run.sh</span></code> æ¯ç¨äºè®ç»åºçº¿ç³»ç»å¹¶å¨M2METçéªè¯ä¸æµè¯éä¸è¯ä¼°çï¼è <code class="docutils literal notranslate"><span class="pre">run_m2met_2023_infer.sh</span></code> ç¨äºæ¤æ¬¡ç«èµé¢å¤å¼æ¾çå
¨æ°æµè¯é䏿µè¯åæ¶çæç¬¦åç«èµæç»æäº¤æ ¼å¼çæä»¶ã |
| | | å¨è¿è¡ <code class="docutils literal notranslate"><span class="pre">run.sh</span></code>åï¼éè¦èªè¡ä¸è½½å¹¶è§£å<a class="reference external" href="http://www.openslr.org/119/">AliMeeting</a>æ°æ®éå¹¶æ¾ç½®äº<code class="docutils literal notranslate"><span class="pre">./dataset</span></code>ç®å½ä¸ï¼</p> |
| | | <div class="highlight-shell notranslate"><div class="highlight"><pre><span></span>dataset |
| | | <span class="p">|</span>ââ<span class="w"> </span>Eval_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>Eval_Ali_near |
| | | <span class="p">|</span>ââ<span class="w"> </span>Test_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>Test_Ali_near |
| | | <span class="p">|</span>ââ<span class="w"> </span>Train_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>Train_Ali_near |
| | | å¨è¿è¡<span class="w"> </span><span class="sb">`</span>run_m2met_2023_infer.sh<span class="sb">`</span>å,<span class="w"> </span>éè¦å°æµè¯é<span class="sb">`</span>Test_2023_Ali_far<span class="sb">`</span>ï¼ä»
å
å«é³é¢ï¼å°äº6.16åå¸ï¼æ¾ç½®äº<span class="sb">`</span>./dataset<span class="sb">`</span>ç®å½ä¸ãç¶åå°ä¸»åæ¹æä¾ç<span class="sb">`</span>wav.scp<span class="sb">`</span>ï¼<span class="sb">`</span>wav_raw.scp<span class="sb">`</span>ï¼<span class="sb">`</span>segments<span class="sb">`</span>ï¼<span class="sb">`</span>utt2spk<span class="sb">`</span>å<span class="sb">`</span>spk2utt<span class="sb">`</span>æ¾ç½®äº<span class="sb">`</span>./data/Test_2023_Ali_far<span class="sb">`</span>ç®å½ä¸ã |
| | | <span class="sb">```</span>shell |
| | | data/Test_2023_Ali_far |
| | | <span class="p">|</span>ââ<span class="w"> </span>wav.scp |
| | | <span class="p">|</span>ââ<span class="w"> </span>wav_raw.scp |
| | | <span class="p">|</span>ââ<span class="w"> </span>segments |
| | | <span class="p">|</span>ââ<span class="w"> </span>utt2spk |
| | | <span class="p">|</span>ââ<span class="w"> </span>spk2utt |
| | | </pre></div> |
| | | </div> |
| | | <p>æ´å¤åºçº¿ç³»ç»è¯¦æ
è§<a class="reference external" href="https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md">æ¤å¤</a></p> |
| | | </section> |
| | | <section id="id4"> |
| | | <h2>åºçº¿ç»æ<a class="headerlink" href="#id4" title="æ¤æ é¢çæ°¸ä¹
龿¥">¶</a></h2> |
| | |
| | | <li class="right" > |
| | | <a href="%E8%B5%9B%E9%81%93%E8%AE%BE%E7%BD%AE%E4%B8%8E%E8%AF%84%E4%BC%B0.html" title="èµé设置ä¸è¯ä¼°" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">åºçº¿</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>æ°æ®é — m2met2 ææ¡£</title> |
| | | <title>æ°æ®é — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E7%AE%80%E4%BB%8B.html" title="ç®ä»" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">æ°æ®é</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="%E7%AE%80%E4%BB%8B.html" title="ç®ä»" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">æ°æ®é</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>ç®ä» — m2met2 ææ¡£</title> |
| | | <title>ç®ä» — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="index.html" title="ASRU 2023 å¤éé夿¹ä¼è®®è½¬å½ææ 2.0" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç®ä»</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <h2>æ¶é´å®æ(AOEæ¶é´)<a class="headerlink" href="#aoe" title="æ¤æ é¢çæ°¸ä¹
龿¥">¶</a></h2> |
| | | <ul class="simple"> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.4.29: \)</span> 弿¾æ³¨å</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.5.8: \)</span> åºçº¿åå¸</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.5.15: \)</span> æ³¨åæªæ¢</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.6.9: \)</span> æµè¯éæ°æ®åå¸</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.6.13: \)</span> æç»ç»ææäº¤æªæ¢</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.6.19: \)</span> è¯ä¼°ç»æåæååå¸</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.7.3: \)</span> 论ææäº¤æªæ¢</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.7.10: \)</span> æç»ç论ææäº¤æªæ¢</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.12.12: \)</span> ASRU Workshop & challenge session</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.5.11: \)</span> åºçº¿åå¸</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.5.22: \)</span> æ³¨åæªæ¢</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.6.16: \)</span> æµè¯éæ°æ®åå¸ï¼æè¡æ¦å¼æ¾</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.6.20: \)</span> æç»ç»ææäº¤æªæ¢ï¼æè¡æ¦å
³é</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.6.26: \)</span> è¯ä¼°ç»æåæååå¸</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.7.3: \)</span> 论ææäº¤æªæ¢ï¼éè¿ASRU2023宿¹æç¨¿ï¼éæ©ç«èµSessionï¼</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.7.10: \)</span> æç»ç论ææäº¤æªæ¢ï¼éè¿ASRU2023宿¹æç¨¿ï¼éæ©ç«èµSessionï¼</p></li> |
| | | <li><p><span class="math notranslate nohighlight">\( 2023.12.12: \)</span> ASRU Workshop & Challenge Session</p></li> |
| | | </ul> |
| | | </section> |
| | | <section id="id3"> |
| | | <h2>ç«èµæ¥å<a class="headerlink" href="#id3" title="æ¤æ é¢çæ°¸ä¹
龿¥">¶</a></h2> |
| | | <p>æ¥èªå¦æ¯çåå·¥ä¸ççææååèµè
ååºå¨2023å¹´5æ15æ¥åä¹åå¡«å䏿¹çè°·æè¡¨åï¼</p> |
| | | <p>æ¥èªå¦æ¯çåå·¥ä¸ççææååèµè
ååºå¨2023å¹´5æ22æ¥åä¹åå¡«å䏿¹çè°·æè¡¨åãåæ¶æ¬¢è¿å¹¿å¤§åèµè
å å
¥<a class="reference external" href="https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html">宿¹äº¤æµå¾®ä¿¡ç¾¤</a>交æµå¹¶åæ¶è·åç«èµææ°æ¶æ¯ï¼</p> |
| | | <p><a class="reference external" href="https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link">M2MET2.0æ¥å</a></p> |
| | | <p>䏻忹å°å¨3ä¸ªå·¥ä½æ¥å
éè¿çµåé®ä»¶éç¥ç¬¦åæ¡ä»¶çåèµå¢éï¼å¢éå¿
é¡»éµå®å°å¨ææç½ç«ä¸åå¸çææè§åã卿ååå¸ä¹åï¼æ¯ä¸ªåèµè
å¿
é¡»æäº¤ä¸ä»½ç³»ç»æè¿°æä»¶ï¼è¯¦ç»è¯´æä½¿ç¨çæ¹æ³å模åã䏻忹尿åååçéä¼çº³å
¥ASRU2023论æéã</p> |
| | | </section> |
| | |
| | | <li class="right" > |
| | | <a href="index.html" title="ASRU 2023 å¤éé夿¹ä¼è®®è½¬å½ææ 2.0" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç®ä»</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>ç»å§ä¼ — m2met2 ææ¡£</title> |
| | | <title>ç»å§ä¼ — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E8%A7%84%E5%88%99.html" title="ç«èµè§å" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç»å§ä¼</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="%E8%A7%84%E5%88%99.html" title="ç«èµè§å" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç»å§ä¼</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>èç³»æ¹å¼ — m2met2 ææ¡£</title> |
| | | <title>èç³»æ¹å¼ — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E7%BB%84%E5%A7%94%E4%BC%9A.html" title="ç»å§ä¼" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">èç³»æ¹å¼</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="%E7%BB%84%E5%A7%94%E4%BC%9A.html" title="ç»å§ä¼" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">èç³»æ¹å¼</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>ç«èµè§å — m2met2 ææ¡£</title> |
| | | <title>ç«èµè§å — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E5%9F%BA%E7%BA%BF.html" title="åºçº¿" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç«èµè§å</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="%E5%9F%BA%E7%BA%BF.html" title="åºçº¿" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">ç«èµè§å</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | <link rel="stylesheet" type="text/css" href="_static/css/bootstrap-theme.min.css" /> |
| | | <meta name="viewport" content="width=device-width, initial-scale=1.0"> |
| | | |
| | | <title>èµé设置ä¸è¯ä¼° — m2met2 ææ¡£</title> |
| | | <title>èµé设置ä¸è¯ä¼° — å¤éé夿¹ä¼è®®è½¬å½ææ2.0</title> |
| | | <link rel="stylesheet" type="text/css" href="_static/pygments.css" /> |
| | | <link rel="stylesheet" type="text/css" href="_static/guzzle.css" /> |
| | | <script data-url_root="./" id="documentation_options" src="_static/documentation_options.js"></script> |
| | |
| | | <li class="right" > |
| | | <a href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="æ°æ®é" |
| | | accesskey="P">ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">èµé设置ä¸è¯ä¼°</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | </div> |
| | | <div id="left-column"> |
| | | <div class="sphinxsidebar"><a href=" |
| | | index.html" class="text-logo">m2met2 ææ¡£</a> |
| | | index.html" class="text-logo">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> |
| | | <div class="sidebar-block"> |
| | | <div class="sidebar-wrapper"> |
| | | <div id="main-search"> |
| | |
| | | <li class="right" > |
| | | <a href="%E6%95%B0%E6%8D%AE%E9%9B%86.html" title="æ°æ®é" |
| | | >ä¸ä¸é¡µ</a> |</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">m2met2 ææ¡£</a> »</li> |
| | | <li class="nav-item nav-item-0"><a href="index.html">å¤éé夿¹ä¼è®®è½¬å½ææ2.0</a> »</li> |
| | | <li class="nav-item nav-item-this"><a href="">èµé设置ä¸è¯ä¼°</a></li> |
| | | </ul> |
| | | </div> |
| | |
| | | # -- Project information ----------------------------------------------------- |
| | | # https://www.sphinx-doc.org/en/master/usage/configuration.html#project-information |
| | | |
| | | project = 'm2met2' |
| | | project = 'å¤éé夿¹ä¼è®®è½¬å½ææ2.0' |
| | | copyright = '2023, Speech Lab, Alibaba Group; ASLP Group, Northwestern Polytechnical University' |
| | | author = 'Speech Lab, Alibaba Group; Audio, Speech and Language Processing Group, Northwestern Polytechnical University' |
| | | |
| | |
| | |  |
| | | |
| | | ## å¿«éå¼å§ |
| | | #TODO: fill with the README.md of the baseline |
| | | |
| | | é¦å
éè¦å®è£
FunASRåModelScope. ([installation](https://alibaba-damo-academy.github.io/FunASR/en/installation.html)) |
| | | åºçº¿ç³»ç»æè®ç»åæµè¯ä¸¤ä¸ªèæ¬ï¼`run.sh` æ¯ç¨äºè®ç»åºçº¿ç³»ç»å¹¶å¨M2METçéªè¯ä¸æµè¯éä¸è¯ä¼°çï¼è `run_m2met_2023_infer.sh` ç¨äºæ¤æ¬¡ç«èµé¢å¤å¼æ¾çå
¨æ°æµè¯é䏿µè¯åæ¶çæç¬¦åç«èµæç»æäº¤æ ¼å¼çæä»¶ã |
| | | å¨è¿è¡ `run.sh`åï¼éè¦èªè¡ä¸è½½å¹¶è§£å[AliMeeting](http://www.openslr.org/119/)æ°æ®éå¹¶æ¾ç½®äº`./dataset`ç®å½ä¸ï¼ |
| | | ```shell |
| | | dataset |
| | | |ââ Eval_Ali_far |
| | | |ââ Eval_Ali_near |
| | | |ââ Test_Ali_far |
| | | |ââ Test_Ali_near |
| | | |ââ Train_Ali_far |
| | | |ââ Train_Ali_near |
| | | å¨è¿è¡ `run_m2met_2023_infer.sh`å, éè¦å°æµè¯é`Test_2023_Ali_far`ï¼ä»
å
å«é³é¢ï¼å°äº6.16åå¸ï¼æ¾ç½®äº`./dataset`ç®å½ä¸ãç¶åå°ä¸»åæ¹æä¾ç`wav.scp`ï¼`wav_raw.scp`ï¼`segments`ï¼`utt2spk`å`spk2utt`æ¾ç½®äº`./data/Test_2023_Ali_far`ç®å½ä¸ã |
| | | ```shell |
| | | data/Test_2023_Ali_far |
| | | |ââ wav.scp |
| | | |ââ wav_raw.scp |
| | | |ââ segments |
| | | |ââ utt2spk |
| | | |ââ spk2utt |
| | | ``` |
| | | æ´å¤åºçº¿ç³»ç»è¯¦æ
è§[æ¤å¤](https://github.com/alibaba-damo-academy/FunASR/blob/main/egs/alimeeting/sa-asr/README.md) |
| | | ## åºçº¿ç»æ |
| | | åºçº¿ç³»ç»çç»æå¦è¡¨3æç¤ºãå¨è®ç»æé´ï¼è¯´è¯äººæ¡£æ¡éç¨äºçå®è¯´è¯äººåµå
¥ãç¶èç±äºå¨è¯ä¼°è¿ç¨ä¸ç¼ºä¹çå®è¯´è¯äººæ ç¾ï¼å æ¤ä½¿ç¨äºç±é¢å¤çè°±èç±»æä¾ç说è¯äººç¹å¾ãåæ¶æä»¬è¿æä¾äºå¨è¯ä¼°åæµè¯éä¸ä½¿ç¨çå®è¯´è¯äººæ¡£æ¡çç»æï¼ä»¥æ¾ç¤ºè¯´è¯äººæ¡£æ¡åç¡®æ§çå½±åã |
| | |  |
| | |
| | | # ç®ä» |
| | | ## ç«èµä»ç» |
| | | |
| | | è¯é³è¯å«ï¼Automatic Speech Recognitionï¼ã说è¯äººæ¥å¿ï¼Speaker Diarizationï¼çè¯é³å¤çææ¯çææ°å屿¿åäºä¼å¤æºè½è¯é³ç广æ³åºç¨ãç¶èä¼è®®åºæ¯ç±äºå
¶å¤æç声妿¡ä»¶åä¸åç讲è¯é£æ ¼ï¼å
æ¬éå ç讲è¯ãä¸åæ°éçåè¨è
ã大ä¼è®®å®¤çè¿åºä¿¡å·ä»¥åç¯å¢åªå£°åæ··åï¼ä»ç¶å±äºä¸é¡¹æå
·æææ§çä»»å¡ã |
| | | |
| | | ä¸ºäºæ¨å¨ä¼è®®åºæ¯è¯é³è¯å«çåå±ï¼å·²ç»æå¾å¤ç¸å
³çææèµï¼å¦ Rich Transcription evaluation å CHIMEï¼Computational Hearing in Multisource Environmentsï¼ ææèµãææ°çCHIMEææèµå
³æ³¨äºè¿è·ç¦»èªå¨è¯é³è¯å«åå¼åè½å¨åç§ä¸åææç»æçéµåååºç¨åºæ¯ä¸éç¨çç³»ç»ãç¶èä¸åè¯è¨ä¹é´çå·®å¼éå¶äºéè±è¯ä¼è®®è½¬å½çè¿å±ãMISPï¼Multimodal Information Based Speech Processingï¼åM2MeTï¼Multi-Channel Multi-Party Meeting Transcriptionï¼ææèµä¸ºæ¨å¨æ®éè¯ä¼è®®åºæ¯è¯é³è¯å«ååºäºè´¡ç®ãMISPææèµä¾§éäºç¨è§å¬å¤æ¨¡æçæ¹æ³è§£å³æ¥å¸¸å®¶åºç¯å¢ä¸çè¿è·ç¦»å¤éº¦å
é£ä¿¡å·å¤çé®é¢ï¼èM2MeTææåä¾§éäºè§£å³ç¦»çº¿ä¼è®®å®¤ä¸ä¼è®®è½¬å½çè¯é³éå é®é¢ã |
| | |
| | | ## æ¶é´å®æ(AOEæ¶é´) |
| | | |
| | | - $ 2023.4.29: $ 弿¾æ³¨å |
| | | - $ 2023.5.8: $ åºçº¿åå¸ |
| | | - $ 2023.5.15: $ æ³¨åæªæ¢ |
| | | - $ 2023.6.9: $ æµè¯éæ°æ®åå¸ |
| | | - $ 2023.6.13: $ æç»ç»ææäº¤æªæ¢ |
| | | - $ 2023.6.19: $ è¯ä¼°ç»æåæååå¸ |
| | | - $ 2023.7.3: $ 论ææäº¤æªæ¢ |
| | | - $ 2023.7.10: $ æç»ç论ææäº¤æªæ¢ |
| | | - $ 2023.12.12: $ ASRU Workshop & challenge session |
| | | - $ 2023.5.11: $ åºçº¿åå¸ |
| | | - $ 2023.5.22: $ æ³¨åæªæ¢ |
| | | - $ 2023.6.16: $ æµè¯éæ°æ®åå¸ï¼æè¡æ¦å¼æ¾ |
| | | - $ 2023.6.20: $ æç»ç»ææäº¤æªæ¢ï¼æè¡æ¦å
³é |
| | | - $ 2023.6.26: $ è¯ä¼°ç»æåæååå¸ |
| | | - $ 2023.7.3: $ 论ææäº¤æªæ¢ï¼éè¿ASRU2023宿¹æç¨¿ï¼éæ©ç«èµSessionï¼ |
| | | - $ 2023.7.10: $ æç»ç论ææäº¤æªæ¢ï¼éè¿ASRU2023宿¹æç¨¿ï¼éæ©ç«èµSessionï¼ |
| | | - $ 2023.12.12: $ ASRU Workshop & Challenge Session |
| | | |
| | | ## ç«èµæ¥å |
| | | |
| | | æ¥èªå¦æ¯çåå·¥ä¸ççææååèµè
ååºå¨2023å¹´5æ15æ¥åä¹åå¡«å䏿¹çè°·æè¡¨åï¼ |
| | | æ¥èªå¦æ¯çåå·¥ä¸ççææååèµè
ååºå¨2023å¹´5æ22æ¥åä¹åå¡«å䏿¹çè°·æè¡¨åãåæ¶æ¬¢è¿å¹¿å¤§åèµè
å å
¥[宿¹äº¤æµå¾®ä¿¡ç¾¤](https://alibaba-damo-academy.github.io/FunASR/m2met2_cn/%E8%81%94%E7%B3%BB%E6%96%B9%E5%BC%8F.html)交æµå¹¶åæ¶è·åç«èµææ°æ¶æ¯ï¼ |
| | | |
| | | [M2MET2.0æ¥å](https://docs.google.com/forms/d/e/1FAIpQLSf77T9vAl7Ym-u5g8gXu18SBofoWRaFShBo26Ym0-HDxHW9PQ/viewform?usp=sf_link) |
| | | |