Obserwuj
Yuchen Hu
Tytuł
Cytowane przez
Cytowane przez
Rok
Hyporadise: An open baseline for generative speech recognition with large language models
C Chen*, Y Hu*, CHH Yang, SM Siniscalchi, PY Chen, ES Chng
NeurIPS 2023, 2023
562023
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Y Hu, N Hou, C Chen, ES Chng
ICASSP 2022, 2022
552022
Noise-Robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
C Chen, N Hou, Y Hu, S Shirol, ES Chng
ICASSP 2022, 2022
482022
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
QS Zhu, L Zhou, J Zhang, SJ Liu, YC Hu, LR Dai
ICASSP 2023, 2023
352023
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
C Chen, Y Hu, Q Zhang, H Zou, B Zhu, ES Chng
AAAI 2023, 2023
312023
Interactive audio-text representation for automated audio captioning with contrastive learning
C Chen, N Hou, Y Hu, H Zou, X Qi, ES Chng
Interspeech 2022, 2022
302022
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition
Y Hu, C Chen, R Li, Q Zhu, ES Chng
ICASSP 2023, 2023
282023
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
Y Hu, C Chen, CHH Yang, R Li, C Zhang, PY Chen, ES Chng
ICLR 2024, 2024
262024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
C Chen, R Li, Y Hu, SM Siniscalchi, PY Chen, E Chng, CHH Yang
ICLR 2024, 2024
252024
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model
C Chen, Y Hu, W Weng, ES Chng
ICASSP 2023, 2023
222023
Dual-path style learning for end-to-end noise-robust speech recognition
Y Hu, N Hou, C Chen, ES Chng
Interspeech 2023, 2023
212023
Self-Critical Sequence Training for Automatic Speech Recognition
C Chen, Y Hu, N Hou, X Qi, H Zou, ES Chng
ICASSP 2022, 2022
202022
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
H Zou, M Shen, C Chen, Y Hu, D Rajan, ES Chng
ACL 2023, 2023
182023
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation
Y Hu, C Chen, H Zou, X Zhong, ES Chng
ICASSP 2023, 2023
172023
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
Y Hu, C Chen, CHH Yang, R Li, D Zhang, Z Chen, ES Chng
ACL 2024, 2024
162024
A Neural State-Space Model Approach to Efficient Speech Separation
C Chen, CHH Yang, K Li, Y Hu, PJ Ku, ES Chng
Interspeech 2023, 2023
152023
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
D Liu, M Du, X Li, Y Hu, L Dai
IWSLT 2021, 2021
152021
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR
Y Hu, C Chen, Q Zhu, ES Chng
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023
132023
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
Y Hu, C Chen, R Li, Q Zhu, ES Chng
Interspeech 2024, 2023
122023
Unsupervised Noise Adaptation using Data Simulation
C Chen, Y Hu, H Zou, L Sun, ES Chng
ICASSP 2023, 2023
122023
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20