Alborz Geramifard

Citée par

	Toutes	Depuis 2019
Citations	2084	1237
indice h	22	17
indice i10	39	27

280

140

210

20072008200920102011201220132014201520162017201820192020202120222023202412 12 21 15 47 88 85 102 107 91 107 138 139 159 228 242 265 196

Coauteurs

Jonathan P. HowRichard C. Maclaurin Professor of Aerospace Engineering, Massachusetts Institute of TechnologyAdresse e-mail validée de mit.edu
Nicholas RoyMITAdresse e-mail validée de csail.mit.edu
Satwik KotturResearch Scientist, Facebook AIAdresse e-mail validée de fb.com
Seungwhan MoonFacebook, Carnegie Mellon UniversityAdresse e-mail validée de fb.com
Ahmad BeiramiGoogle DeepMindAdresse e-mail validée de google.com
Michael BowlingAmii, University of AlbertaAdresse e-mail validée de ualberta.ca
Paul A CrookResearch Scientist, Meta Platforms, Inc.Adresse e-mail validée de fb.com
Nazim Kemal UreIstanbul Technical UniversityAdresse e-mail validée de itu.edu.tr
Richard S. SuttonKeen, Amii, and University of AlbertaAdresse e-mail validée de richsutton.com
Rajen SubbaGoogleAdresse e-mail validée de google.com
Girish ChowdharyAssociate ProfessorAdresse e-mail validée de illinois.edu
Chinnadhurai SankarResearch Lead, SliceX AI | ex-Meta AIAdresse e-mail validée de fb.com
Ankita DeFacebookAdresse e-mail validée de fb.com
Thomas J. WalshSony AIAdresse e-mail validée de sony.com
Babak DamavandiMeta Reality LabsAdresse e-mail validée de fb.com
Csaba SzepesvariDeepMind & University of AlbertaAdresse e-mail validée de cs.ualberta.ca
David WhitneyMetaAdresse e-mail validée de meta.com
Christoph DannResearch Scientist, GoogleAdresse e-mail validée de google.com
Stefanie TellexBrown UniversityAdresse e-mail validée de cs.brown.edu
Will DabneyDeepMindAdresse e-mail validée de google.com

Suivre

Alborz Geramifard

Research Scientist Director at Meta

Adresse e-mail validée de meta.com - Page d'accueil

Reinforcement Learning Conversational AI Planning Brain and Cognitive Sciences


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Dyna-style planning with linear function approximation and prioritized sweeping RS Sutton, C Szepesvári, A Geramifard, MP Bowling arXiv preprint arXiv:1206.3285, 2012	240	2012
A tutorial on linear function approximators for dynamic programming and reinforcement learning A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013	166	2013
Decentralized control of partially observable Markov decision processes C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer 52nd IEEE Conference on Decision and Control, 2398-2405, 2013	159	2013
Cooperative mission planning for multi-UAV teams SS Ponda, LB Johnson, A Geramifard, JP How Handbook of unmanned aerial vehicles 2, 1447-1490, 2015	99	2015
SIMMC 2.0: A task-oriented dialog dataset for immersive multimodal conversations S Kottur, S Moon, A Geramifard, B Damavandi arXiv preprint arXiv:2104.08667, 2021	96	2021
RLPy: a value-function-based reinforcement learning framework for education and research. A Geramifard, C Dann, RH Klein, W Dabney, JP How J. Mach. Learn. Res. 16 (1), 1573-1578, 2015	94	2015
Incremental least-squares temporal difference learning A Geramifard, M Bowling, RS Sutton Proceedings of the 21st national conference on Artificial intelligence …, 2006	94	2006
Situated and interactive multimodal conversations S Moon, S Kottur, PA Crook, A De, S Poddar, T Levin, D Whitney, ... arXiv preprint arXiv:2006.01460, 2020	85	2020
Online Discovery of Feature Dependencies. A Geramifard, F Doshi, J Redding, N Roy, JP How ICML, 881-888, 2011	82	2011
Overview of the ninth dialog system technology challenge: Dstc9 C Gunasekara, S Kim, LF D'haro, A Rastogi, YN Chen, M Eric, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024	74	2024
iLSTD: Eligibility traces and convergence analysis A Geramifard, M Bowling, M Zinkevich, RS Sutton Advances in Neural Information Processing Systems 19, 2006	66	2006
Intelligent cooperative control architecture: a framework for performance improvement using safe learning A Geramifard, J Redding, JP How Journal of Intelligent & Robotic Systems 72, 83-103, 2013	58	2013
On the design and use of a micro air vehicle to track and avoid adversaries R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ... The International Journal of Robotics Research 29 (5), 529-546, 2010	57	2010
Customized movie trailers A Geramifard US Patent App. 14/105,428, 2015	51	2015
Reinforcement learning with misspecified model classes J Joseph, A Geramifard, JW Roberts, JP How, N Roy 2013 IEEE International Conference on Robotics and Automation, 939-946, 2013	47	2013
UAV cooperative control with stochastic risk models A Geramifard, J Redding, N Roy, JP How Proceedings of the 2011 american control conference, 3393-3398, 2011	45	2011
Biased cost pathfinding A Geramifard, P Chubak, V Bulitko Proceedings of the AAAI Conference on Artificial Intelligence and …, 2006	45	2006
Memformer: A memory-augmented transformer for sequence modeling Q Wu, Z Lan, K Qian, J Gu, A Geramifard, Z Yu arXiv preprint arXiv:2010.06891, 2020	39	2020
An intelligent cooperative control architecture J Redding, A Geramifard, A Undurti, HL Choi, JP How Proceedings of the 2010 American control conference, 57-62, 2010	37	2010
Annotation inconsistency and entity bias in MultiWOZ K Qian, A Beirami, Z Lin, A De, A Geramifard, Z Yu, C Sankar arXiv preprint arXiv:2105.14150, 2021	35	2021

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs