Ikuti
Fahim Tajwar
Fahim Tajwar
PhD Student, Machine Learning, Carnegie Mellon University
Email yang diverifikasi di andrew.cmu.edu - Beranda
Judul
Dikutip oleh
Dikutip oleh
Tahun
Surgical Fine-Tuning Improves Adaptation to Distribution Shifts
Y Lee, AS Chen, F Tajwar, A Kumar, H Yao, P Liang, C Finn
International Conference on Learning Representations (ICLR), 2023, 2022
2172022
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
F Tajwar, A Singh, A Sharma, R Rafailov, J Schneider, T Xie, S Ermon, ...
International Conference on Machine Learning (ICML), 2024, 2024
812024
Scalable deep learning to identify brick kilns and aid regulatory capacity
J Lee, NR Brooks, F Tajwar, M Burke, S Ermon, DB Lobell, D Biswas, ...
Proceedings of the National Academy of Sciences 118 (17), e2018863118, 2021
482021
No True State-of-the-Art? OOD Detection Methods are Inconsistent across Datasets
F Tajwar, A Kumar, SM Xie, P Liang
ICML Workshop on Uncertainty & Robustness in Deep Learning, 2021, 2021
27*2021
When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning
A Xie, F Tajwar, A Sharma, C Finn
Neural Information Processing Systems (NeurIPS), 2022, 2022
172022
Do Deep Networks Transfer Invariances Across Classes?
A Zhou, F Tajwar, A Robey, T Knowles, GJ Pappas, H Hassani, C Finn
International Conference on Learning Representations (ICLR), 2022, 2022
142022
Conservative Prediction via Data-Driven Confidence Minimization
C Choi, F Tajwar, Y Lee, H Yao, A Kumar, C Finn
Transactions of Machine Learning Research (TMLR), 2024, 2024
82024
Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias
MS Mark, A Sharma, F Tajwar, R Rafailov, S Levine, C Finn
arXiv preprint arXiv:2310.08558, 2023
4*2023
Training a Generally Curious Agent
F Tajwar, Y Jiang, A Thankaraj, SS Rahman, JZ Kolter, J Schneider, ...
arXiv preprint arXiv:2502.17543, 2025
2025
Self-Regulation and Requesting Interventions
SY Min, Y Wu, J Sun, M Kaufmann, F Tajwar, Y Bisk, R Salakhutdinov
arXiv preprint arXiv:2502.04576, 2025
2025
State Combinatorial Generalization In Decision Making With Conditional Diffusion Models
X Duan, Y He, F Tajwar, WT Chen, R Salakhutdinov, J Schneider
arXiv preprint arXiv:2501.13241, 2025
2025
Fine-tuning LLM Agents with Retrospective In-Context Online Learning
W Chen, J Chen, F Tajwar, H Zhu, X Duan, R Salakhutdinov, J Schneider
Adaptive Foundation Models: Evolving AI for Personalized and Efficient Learning, 2024
2024
Sistem tidak dapat melakukan operasi ini. Coba lagi nanti.
Artikel 1–12