Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ... arXiv preprint arXiv:2102.01547, 2021 | 294 | 2021 |
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 121* | 2022 |
The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods X Shi, F Yu, Y Lu, Y Liang, Q Feng, D Wang, Y Qian, L Xie ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 82 | 2021 |
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 31 | 2022 |
An embarrassingly simple approach for LLM with strong ASR capacity Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ... arXiv preprint arXiv:2402.08846, 2024 | 30 | 2024 |
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao 2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021 | 23 | 2021 |
Boundary and context aware training for cif-based non-autoregressive end-to-end asr F Yu, H Luo, P Guo, Y Liang, Z Yao, L Xie, Y Gao, L Hou, S Zhang 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 16 | 2021 |
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario F Yu, S Zhang, P Guo, Y Liang, Z Du, Y Lin, L Xie 2022 IEEE Spoken Language Technology Workshop (SLT), 144-151, 2023 | 14* | 2023 |
A comparative study on speaker-attributed automatic speech recognition in multi-party meetings F Yu, Z Du, S Zhang, Y Lin, L Xie arXiv preprint arXiv:2203.16834, 2022 | 14 | 2022 |
Cosyvoice 2: Scalable streaming speech synthesis with large language models Z Du, Y Wang, Q Chen, X Shi, X Lv, T Zhao, Z Gao, Y Yang, C Gao, ... arXiv preprint arXiv:2412.10117, 2024 | 12 | 2024 |
Mala-asr: Multimedia-assisted llm-based asr G Yang, Z Ma, F Yu, Z Gao, S Zhang, X Chen arXiv preprint arXiv:2406.05839, 2024 | 8 | 2024 |
SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus H Wang, F Yu, X Shi, Y Wang, S Zhang, M Li ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 8 | 2024 |
Ba-moe: Boundary-aware mixture-of-experts adapter for code-switching speech recognition P Chen, F Yu, Y Liang, H Xue, X Wan, N Zheng, H Zhou, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 8 | 2023 |
Ba-sot: Boundary-aware serialized output training for multi-talker asr Y Liang, F Yu, Y Li, P Guo, S Zhang, Q Chen, L Xie arXiv preprint arXiv:2305.13716, 2023 | 8 | 2023 |
Casa-asr: Context-aware speaker-attributed asr M Shi, Z Du, Q Chen, F Yu, Y Li, S Zhang, J Zhang, LR Dai arXiv preprint arXiv:2305.12459, 2023 | 7 | 2023 |
The second multi-channel multi-party meeting transcription challenge (M2MeT 2.0): A benchmark for speaker-attributed ASR Y Liang, M Shi, F Yu, Y Li, S Zhang, Z Du, Q Chen, L Xie, Y Qian, J Wu, ... 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023 | 5 | 2023 |
A comparative study on multichannel speaker-attributed automatic speech recognition in multi-party meetings M Shi, J Zhang, Z Du, F Yu, Q Chen, S Zhang, LR Dai 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | 5 | 2023 |
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ... 2022 13th International Symposium on Chinese Spoken Language Processing …, 2022 | 5 | 2022 |
Sa-Paraformer: Non-autoregressive end-to-end speaker-attributed ASR Y Li, F Yu, Y Liang, P Guo, M Shi, Z Du, S Zhang, L Xie 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 4 | 2023 |
Hourglass-avsr: Down-up sampling-based computational efficiency model for audio-visual speech recognition F Yu, H Wang, Z Ma, S Zhang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 3 | 2024 |