Follow
Gokul Swamy
Gokul Swamy
Verified email at andrew.cmu.edu - Homepage
Title
Cited by
Cited by
Year
Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap
G Swamy, S Choudhury, JA Bagnell, ZS Wu
38th International Conference on Machine Learning (ICML), 2021
80*2021
On the Utility of Model Learning in HRI
G Swamy, J Schulz, R Choudhury, D Hadfield-Menell, A Dragan
arXiv preprint arXiv:1901.01291, 2019
65*2019
A Minimaximalist Approach to Reinforcement Learning from Human Feedback
G Swamy, C Dann, R Kidambi, ZS Wu, A Agarwal
arXiv preprint arXiv:2401.04056, 2024
522024
Scaled autonomy: Enabling human operators to control robot fleets
G Swamy, S Reddy, S Levine, AD Dragan
2020 IEEE International Conference on Robotics and Automation (ICRA), 5942-5948, 2020
492020
Sequence model imitation learning with unobserved contexts
G Swamy, S Choudhury, J Bagnell, SZ Wu
Advances in Neural Information Processing Systems 35, 17665-17676, 2022
272022
Causal imitation learning under temporally correlated noise
G Swamy, S Choudhury, D Bagnell, S Wu
International Conference on Machine Learning, 20877-20890, 2022
272022
Inverse Reinforcement Learning without Reinforcement Learning
G Swamy, S Choudhury, D Bagnell, S Wu
International Conference on Machine Learning, 33299-33318, 2023
202023
Minimax Optimal Online Imitation Learning via Replay Estimation
G Swamy, N Rajaraman, M Peng, S Choudhury, J Bagnell, SZ Wu, J Jiao, ...
Advances in Neural Information Processing Systems 35, 7077-7088, 2022
172022
REBEL: Reinforcement Learning via Regressing Relative Rewards
Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ...
arXiv preprint arXiv:2404.16767, 2024
152024
Hybrid Inverse Reinforcement Learning
J Ren, G Swamy, ZS Wu, JA Bagnell, S Choudhury
arXiv preprint arXiv:2402.08848, 2024
92024
Learning Shared Safety Constraints from Multi-task Demonstrations
K Kim, G Swamy, Z Liu, D Zhao, S Choudhury, SZ Wu
Advances in Neural Information Processing Systems 36, 2024
72024
Understanding Preference Fine-Tuning Through the Lens of Coverage
Y Song, G Swamy, A Singh, JA Bagnell, W Sun
arXiv preprint arXiv:2406.01462, 2024
4*2024
A Critique of Strictly Batch Imitation Learning
G Swamy, S Choudhury, JA Bagnell, ZS Wu
arXiv preprint arXiv:2110.02063, 2021
32021
Generative Models for Pose Transfer
P Chao, A Li, G Swamy
arXiv preprint arXiv:1806.09070, 2018
32018
EvIL: Evolution Strategies for Generalisable Imitation Learning
S Sapora, G Swamy, C Lu, YW Teh, JN Foerster
arXiv preprint arXiv:2406.11905, 2024
22024
Multi-Agent Imitation Learning: Value is Easy, Regret is Hard
J Tang, G Swamy, F Fang, ZS Wu
arXiv preprint arXiv:2406.04219, 2024
12024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
R Wu, Y Chen, G Swamy, K Brantley, W Sun
arXiv preprint arXiv:2410.13855, 2024
2024
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Z Gao, W Zhan, JD Chang, G Swamy, K Brantley, JD Lee, W Sun
arXiv preprint arXiv:2410.04612, 2024
2024
Efficient Inverse Reinforcement Learning without Compounding Errors
NE Dice, G Swamy, S Choudhury, W Sun
First Reinforcement Learning Safety Workshop, 2024
2024
Game-Theoretic Algorithms for Conditional Moment Matching
G Swamy, S Choudhury, JA Bagnell, ZS Wu
arXiv preprint arXiv:2208.09551, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20