Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3417 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1305 | 2024 |
Slic-hf: Sequence likelihood calibration with human feedback Y Zhao, R Joshi, T Liu, M Khalman, M Saleh, PJ Liu arXiv preprint arXiv:2305.10425, 2023 | 253 | 2023 |
Statistical rejection sampling improves preference optimization T Liu, Y Zhao, R Joshi, M Khalman, M Saleh, PJ Liu, J Liu arXiv preprint arXiv:2309.06657, 2023 | 176 | 2023 |
Calibrating sequence likelihood improves conditional language generation Y Zhao, M Khalman, R Joshi, S Narayan, M Saleh, PJ Liu arXiv preprint arXiv:2210.00045, 2022 | 129 | 2022 |
Direct language model alignment from online ai feedback S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ... arXiv preprint arXiv:2402.04792, 2024 | 113 | 2024 |
Gemini: A family of highly capable multimodal models. CoRR, abs/2312.11805, 2023. doi: 10.48550 R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint ARXIV.2312.11805, 24-28, 0 | 40 | |
Lipo: Listwise preference optimization through learning-to-rank T Liu, Z Qin, J Wu, J Shen, M Khalman, R Joshi, Y Zhao, M Saleh, ... arXiv preprint arXiv:2402.01878, 2024 | 32 | 2024 |
ForumSum: A multi-speaker conversation summarization dataset M Khalman, Y Zhao, M Saleh Findings of the association for computational linguistics: EMNLP 2021, 4592-4599, 2021 | 25 | 2021 |
Building math agents with multi-turn iterative preference learning W Xiong, C Shi, J Shen, A Rosenberg, Z Qin, D Calandriello, M Khalman, ... arXiv preprint arXiv:2409.02392, 2024 | 15 | 2024 |
Calibrating likelihoods towards consistency in summarization models P Zablotskaia, M Khalman, R Joshi, LB Soares, S Jakobovits, J Maynez, ... arXiv preprint arXiv:2310.08764, 2023 | 4 | 2023 |