Följ
Ruibin Yuan
Ruibin Yuan
HKUST
Verifierad e-postadress på andrew.cmu.edu
Titel
Citeras av
Citeras av
År
Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi
X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ...
CVPR 2024 Best Paper Nomination, 2023
3672023
Mert: Acoustic music understanding model with large-scale self-supervised training
Y Li, R Yuan, G Zhang, Y Ma, X Chen, H Yin, C Lin, A Ragni, E Benetos, ...
ICLR 2024, 2023
862023
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
J Zhan, J Dai, J Ye, Y Zhou, D Zhang, Z Liu, X Zhang, R Yuan, G Zhang, ...
ACL 2024 (main), 2024
632024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
R Yuan, H Lin, Y Wang, Z Tian, S Wu, T Shen, G Zhang, Y Wu, C Liu, ...
ACL 2024 (Findings), 2024
292024
Rq-rag: Learning to refine queries for retrieval augmented generation
CM Chan, C Xu, R Yuan, H Luo, W Xue, Y Guo, J Fu
COLM 2024, 2024
262024
Chinese open instruction generalist: A preliminary release
G Zhang, Y Shi, R Liu, R Yuan, Y Li, S Dong, Y Shu, Z Li, Z Wang, C Lin, ...
arXiv preprint arXiv:2304.07987, 2023
242023
Map-music2vec: A simple and effective baseline for self-supervised music audio representation learning
Y Li, R Yuan, G Zhang, Y Ma, C Lin, X Chen, A Ragni, H Yin, Z Hu, H He, ...
ISMIR 2023 (LBD), 2022
222022
Map-neo: Highly capable and transparent bilingual large language model series
G Zhang, S Qu, J Liu, C Zhang, C Lin, CL Yu, D Pan, E Cheng, J Liu, ...
arXiv preprint arXiv:2405.19327, 2024
202024
Marble: Music audio representation benchmark for universal evaluation
R Yuan, Y Ma, Y Li, G Zhang, X Chen, H Yin, Y Liu, J Huang, Z Tian, ...
NeurIPS 2023, 2024
192024
Lyricwhiz: Robust multilingual lyrics transcription by whispering to chatgpt
L Zhuo, R Yuan, J Pan, Y Ma, Y Li, G Zhang, S Liu, R Dannenberg, J Fu, ...
ISMIR 2023, 2023
18*2023
ComposerX: Multi-Agent Symbolic Music Composition with LLMs
Q Deng, Q Yang, R Yuan, Y Huang, Y Wang, X Liu, Z Tian, J Pan, ...
ISMIR 2024, 2024
112024
LLMs Meet Multimodal Generation and Editing: A Survey
Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu, R Yuan, Y Xing, W Wang, ...
arXiv preprint arXiv:2405.19334, 2024
102024
Chinese tiny llm: Pretraining a chinese-centric large language model
X Du, Z Yu, S Gao, D Pan, Y Cheng, Z Ma, R Yuan, X Qu, J Liu, T Zheng, ...
COLM 2024, 2024
102024
On the effectiveness of speech self-supervised learning for music
Y Ma, R Yuan, Y Li, G Zhang, X Chen, H Yin, C Lin, E Benetos, A Ragni, ...
ISMIR 2023, 2023
102023
Foundation models for music: A survey
Y Ma, A Øland, A Ragni, BMS Del Sette, C Saitis, C Donahue, C Lin, ...
arXiv preprint arXiv:2408.14340, 2024
82024
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models
Y Li, G Zhang, X Qu, J Li, Z Li, Z Wang, H Li, R Yuan, Y Ma, K Zhang, ...
ACL 2024 (Findings), 2024
72024
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation
X Qi, J Pan, P Li, R Yuan, X Chi, M Li, W Luo, W Xue, S Zhang, Q Liu, ...
CVPR 2024, 2023
62023
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning
Y Bai, X Du, Y Liang, Y Jin, Z Liu, J Zhou, T Zheng, X Zhang, N Ma, ...
arXiv preprint arXiv:2403.18058, 2024
52024
Cmmmu: A chinese massive multi-discipline multimodal understanding benchmark
G Zhang, X Du, B Chen, Y Liang, T Luo, T Zheng, K Zhu, Y Cheng, C Xu, ...
arXiv preprint arXiv:2401.11944, 2024
52024
Deid-vc: Speaker de-identification via zero-shot pseudo voice conversion
R Yuan, Y Wu, J Li, J Kim
INTERSPEECH 2022, 2022
52022
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20