Takip et
Misha Khalman
Misha Khalman
Anthropic PBC
Doğrulanmış e-posta adresi yok
Başlık
Alıntı yapanlar
Alıntı yapanlar
Yıl
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
34172023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
13052024
Slic-hf: Sequence likelihood calibration with human feedback
Y Zhao, R Joshi, T Liu, M Khalman, M Saleh, PJ Liu
arXiv preprint arXiv:2305.10425, 2023
2532023
Statistical rejection sampling improves preference optimization
T Liu, Y Zhao, R Joshi, M Khalman, M Saleh, PJ Liu, J Liu
arXiv preprint arXiv:2309.06657, 2023
1762023
Calibrating sequence likelihood improves conditional language generation
Y Zhao, M Khalman, R Joshi, S Narayan, M Saleh, PJ Liu
arXiv preprint arXiv:2210.00045, 2022
1292022
Direct language model alignment from online ai feedback
S Guo, B Zhang, T Liu, T Liu, M Khalman, F Llinares, A Rame, T Mesnard, ...
arXiv preprint arXiv:2402.04792, 2024
1132024
Gemini: A family of highly capable multimodal models. CoRR, abs/2312.11805, 2023. doi: 10.48550
R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint ARXIV.2312.11805, 24-28, 0
40
Lipo: Listwise preference optimization through learning-to-rank
T Liu, Z Qin, J Wu, J Shen, M Khalman, R Joshi, Y Zhao, M Saleh, ...
arXiv preprint arXiv:2402.01878, 2024
322024
ForumSum: A multi-speaker conversation summarization dataset
M Khalman, Y Zhao, M Saleh
Findings of the association for computational linguistics: EMNLP 2021, 4592-4599, 2021
252021
Building math agents with multi-turn iterative preference learning
W Xiong, C Shi, J Shen, A Rosenberg, Z Qin, D Calandriello, M Khalman, ...
arXiv preprint arXiv:2409.02392, 2024
152024
Calibrating likelihoods towards consistency in summarization models
P Zablotskaia, M Khalman, R Joshi, LB Soares, S Jakobovits, J Maynez, ...
arXiv preprint arXiv:2310.08764, 2023
42023
Sistem, işlemi şu anda gerçekleştiremiyor. Daha sonra yeniden deneyin.
Makaleler 1–11