Spremljaj
Fan Yu
Naslov
Navedeno
Navedeno
Leto
Wenet: Production oriented streaming and non-streaming end-to-end speech recognition toolkit
Z Yao, D Wu, X Wang, B Zhang, F Yu, C Yang, Z Peng, X Chen, L Xie, ...
arXiv preprint arXiv:2102.01547, 2021
2942021
M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge
F Yu, S Zhang, Y Fu, L Xie, S Zheng, Z Du, W Huang, P Guo, Z Yan, B Ma, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
121*2022
The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods
X Shi, F Yu, Y Lu, Y Liang, Q Feng, D Wang, Y Qian, L Xie
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
822021
Summary on the ICASSP 2022 multi-channel multi-party meeting transcription grand challenge
F Yu, S Zhang, P Guo, Y Fu, Z Du, S Zheng, W Huang, L Xie, ZH Tan, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
312022
An embarrassingly simple approach for LLM with strong ASR capacity
Z Ma, G Yang, Y Yang, Z Gao, J Wang, Z Du, F Yu, Q Chen, S Zheng, ...
arXiv preprint arXiv:2402.08846, 2024
302024
The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines
F Yu, Z Yao, X Wang, K An, L Xie, Z Ou, B Liu, X Li, G Miao
2021 IEEE Spoken Language Technology Workshop (SLT), 1117-1123, 2021
232021
Boundary and context aware training for cif-based non-autoregressive end-to-end asr
F Yu, H Luo, P Guo, Y Liang, Z Yao, L Xie, Y Gao, L Hou, S Zhang
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
162021
MFCCA: Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario
F Yu, S Zhang, P Guo, Y Liang, Z Du, Y Lin, L Xie
2022 IEEE Spoken Language Technology Workshop (SLT), 144-151, 2023
14*2023
A comparative study on speaker-attributed automatic speech recognition in multi-party meetings
F Yu, Z Du, S Zhang, Y Lin, L Xie
arXiv preprint arXiv:2203.16834, 2022
142022
Cosyvoice 2: Scalable streaming speech synthesis with large language models
Z Du, Y Wang, Q Chen, X Shi, X Lv, T Zhao, Z Gao, Y Yang, C Gao, ...
arXiv preprint arXiv:2412.10117, 2024
122024
Mala-asr: Multimedia-assisted llm-based asr
G Yang, Z Ma, F Yu, Z Gao, S Zhang, X Chen
arXiv preprint arXiv:2406.05839, 2024
82024
SlideSpeech: A Large Scale Slide-Enriched Audio-Visual Corpus
H Wang, F Yu, X Shi, Y Wang, S Zhang, M Li
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
82024
Ba-moe: Boundary-aware mixture-of-experts adapter for code-switching speech recognition
P Chen, F Yu, Y Liang, H Xue, X Wan, N Zheng, H Zhou, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
82023
Ba-sot: Boundary-aware serialized output training for multi-talker asr
Y Liang, F Yu, Y Li, P Guo, S Zhang, Q Chen, L Xie
arXiv preprint arXiv:2305.13716, 2023
82023
Casa-asr: Context-aware speaker-attributed asr
M Shi, Z Du, Q Chen, F Yu, Y Li, S Zhang, J Zhang, LR Dai
arXiv preprint arXiv:2305.12459, 2023
72023
The second multi-channel multi-party meeting transcription challenge (M2MeT 2.0): A benchmark for speaker-attributed ASR
Y Liang, M Shi, F Yu, Y Li, S Zhang, Z Du, Q Chen, L Xie, Y Qian, J Wu, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
52023
A comparative study on multichannel speaker-attributed automatic speech recognition in multi-party meetings
M Shi, J Zhang, Z Du, F Yu, Q Chen, S Zhang, LR Dai
2023 Asia Pacific Signal and Information Processing Association Annual …, 2023
52023
The iscslp 2022 intelligent cockpit speech recognition challenge (icsrc): Dataset, tracks, baseline and results
A Zhang, F Yu, K Huang, L Xie, L Wang, ES Chng, H Bu, B Zhang, ...
2022 13th International Symposium on Chinese Spoken Language Processing …, 2022
52022
Sa-Paraformer: Non-autoregressive end-to-end speaker-attributed ASR
Y Li, F Yu, Y Liang, P Guo, M Shi, Z Du, S Zhang, L Xie
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
42023
Hourglass-avsr: Down-up sampling-based computational efficiency model for audio-visual speech recognition
F Yu, H Wang, Z Ma, S Zhang
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
32024
Sistem trenutno ne more izvesti postopka. Poskusite znova pozneje.
Članki 1–20