Artikel dengan mandat akses publik - Jie Liu (刘杰)Pelajari lebih lanjut
Tersedia di suatu tempat: 4
Beyond one-preference-fits-all alignment: Multi-objective direct preference optimization
Z Zhou, J Liu, J Shao, X Yue, C Yang, W Ouyang, Y Qiao
arXiv preprint arXiv:2310.03708, 2023
Mandat: National Natural Science Foundation of China
Inception convolution with efficient dilation search
J Liu, C Li, F Liang, C Lin, M Sun, J Yan, W Ouyang, D Xu
CVPR 2021 (Oral), 2021
Mandat: Australian Research Council, National Natural Science Foundation of China …
ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency
C Li*, J Liu*, Y Zhang, Y Wei, Y Niu, Y Yang, Y Liu, W Ouyang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2023), 2023
Mandat: Australian Research Council, Medical Research Future Fund, Australia
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
Y Zhang*, J Liu*, C Li, Y Niu, Y Yang, Y Liu, W Ouyang
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI 2024), 2023
Mandat: National Natural Science Foundation of China
Informasi terbitan dan pendanaan ditentukan secara otomatis oleh program komputer