| | |
| | | |
| | | ä¸ºäºæ¨å¨ä¼è®®åºæ¯è¯é³è¯å«çåå±ï¼å·²ç»æå¾å¤ç¸å
³çææèµï¼å¦ Rich Transcription evaluation å CHIMEï¼Computational Hearing in Multisource Environmentsï¼ ææèµãææ°çCHIMEææèµå
³æ³¨äºè¿è·ç¦»èªå¨è¯é³è¯å«åå¼åè½å¨åç§ä¸åææç»æçéµåååºç¨åºæ¯ä¸éç¨çç³»ç»ãç¶èä¸åè¯è¨ä¹é´çå·®å¼éå¶äºéè±è¯ä¼è®®è½¬å½çè¿å±ãMISPï¼Multimodal Information Based Speech Processingï¼åM2MeTï¼Multi-Channel Multi-Party Meeting Transcriptionï¼ææèµä¸ºæ¨å¨æ®éè¯ä¼è®®åºæ¯è¯é³è¯å«ååºäºè´¡ç®ãMISPææèµä¾§éäºç¨è§å¬å¤æ¨¡æçæ¹æ³è§£å³æ¥å¸¸å®¶åºç¯å¢ä¸çè¿è·ç¦»å¤éº¦å
é£ä¿¡å·å¤çé®é¢ï¼èM2MeTææåä¾§éäºè§£å³ç¦»çº¿ä¼è®®å®¤ä¸ä¼è®®è½¬å½çè¯é³éå é®é¢ã |
| | | |
| | | ASSP2022 M2MeTææçä¾§éç¹æ¯ä¼è®®åºæ¯ï¼å®å
æ¬ä¸¤ä¸ªèµéï¼è¯´è¯äººæ¥è®°åå¤è¯´è¯äººèªå¨è¯é³è¯å«ãåè
æ¶åè¯å«âè°å¨ä»ä¹æ¶å说äºè¯âï¼èåè
æ¨å¨åæ¶è¯å«æ¥èªå¤ä¸ªè¯´è¯äººçè¯é³ï¼è¯é³éå ååç§åªå£°å¸¦æ¥äºå·¨å¤§çææ¯å°é¾ã |
| | | IASSP2022 M2MeTææçä¾§éç¹æ¯ä¼è®®åºæ¯ï¼å®å
æ¬ä¸¤ä¸ªèµéï¼è¯´è¯äººæ¥è®°åå¤è¯´è¯äººèªå¨è¯é³è¯å«ãåè
æ¶åè¯å«âè°å¨ä»ä¹æ¶å说äºè¯âï¼èåè
æ¨å¨åæ¶è¯å«æ¥èªå¤ä¸ªè¯´è¯äººçè¯é³ï¼è¯é³éå ååç§åªå£°å¸¦æ¥äºå·¨å¤§çææ¯å°é¾ã |
| | | |
| | | å¨ä¸ä¸å±M2METæå举åçåºç¡ä¸ï¼æä»¬å°å¨ASRU2023ä¸ç»§ç»ä¸¾åM2MET2.0ææèµãå¨ä¸ä¸å±M2METææèµä¸ï¼è¯ä¼°ææ æ¯è¯´è¯äººæ å
³çï¼æä»¬åªè½å¾å°è¯å«ææ¬ï¼èä¸è½ç¡®å®ç¸åºç说è¯äººã |
| | | 为äºè§£å³è¿ä¸å±éæ§å¹¶å°ç°å¨çå¤è¯´è¯äººè¯é³è¯å«ç³»ç»æ¨åå®ç¨åï¼M2MET2.0ææèµå°å¨è¯´è¯äººç¸å
³ç人ç©ä¸è¯ä¼°ï¼å¹¶ä¸åæ¶è®¾ç«é宿°æ®ä¸ä¸é宿°æ®ä¸¤ä¸ªåèµéãéè¿å°è¯é³å½å±äºç¹å®ç说è¯äººï¼è¿é¡¹ä»»å¡æ¨å¨æé«å¤è¯´è¯äººASRç³»ç»å¨çå®ä¸çç¯å¢ä¸çåç¡®æ§åéç¨æ§ã |