Shipra Agrawal

Citado por

	Todos	Desde 2019
Citações	7127	5195
Índice h	29	23
Índice i10	40	31

1000

500

250

750

200620072008200920102011201220132014201520162017201820192020202120222023202432 24 39 52 62 67 67 95 145 184 261 336 443 611 803 912 940 932 995

Acesso público

Ver todos

7 artigos

1 artigo

disponível

não disponível

Com base nas autorizações de financiamento

Coautores

Yinyu YeK.T. Li Professor of Engineering, Stanford UniversityE-mail confirmado em stanford.edu
nikhil r. devanurAmazonE-mail confirmado em nikhildevanur.com
Jayant HaritsaIndian Institute of ScienceE-mail confirmado em iisc.ac.in
Zizhuo WangThe Chinese University of Hong Kong, Shenzhen / Cardinal OperationsE-mail confirmado em cuhk.edu.cn
Rajeev RastogiAmazonE-mail confirmado em amazon.com
Amin SaberiProfessor, Stanford UniversityE-mail confirmado em stanford.edu
Vijay KrishnanIIT Bombay, Stanford University, https://turing.comE-mail confirmado em cs.stanford.edu
Yichuan DingDesautels Faculty of Management, McGill UniversityE-mail confirmado em mcgill.ca
Lihong Li (李力鸿)AmazonE-mail confirmado em amazon.com
Tomáš KocákUniversity of PotsdamE-mail confirmado em uni-potsdam.de
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindE-mail confirmado em meta.com
Rémi MunosGoogle DeepMindE-mail confirmado em inria.fr
Erick DelageProfessor, Department of Decision Sciences, HEC MontréalE-mail confirmado em hec.ca
Supratim DebTechnical Lead @ MetaE-mail confirmado em fb.com
B. Aditya PrakashAssociate Professor, Georgia Institute of TechnologyE-mail confirmado em cs.cmu.edu
Nimrod MegiddoDistinguished Research Staff Member, IBM Almaden Research CenterE-mail confirmado em us.ibm.com

Seguir

Shipra Agrawal

Columbia university

E-mail confirmado em columbia.edu - Página inicial

multi-armed bandits reinforcement learning online and stochastic optmization


Título Ordenar por citações Ordenar por ano Ordenar por título	Citado por Citado por	Ano
Analysis of thompson sampling for the multi-armed bandit problem S Agrawal, N Goyal Conference on learning theory, 39.1-39.26, 2012	1618	2012
Thompson sampling for contextual bandits with linear payoffs S Agrawal, N Goyal International conference on machine learning, 127-135, 2013	1249	2013
Near-optimal regret bounds for thompson sampling S Agrawal, N Goyal Journal of the ACM (JACM) 64 (5), 1-24, 2017	698	2017
A dynamic near-optimal algorithm for online linear programming S Agrawal, Z Wang, Y Ye Operations Research 62 (4), 876-890, 2014	356	2014
Optimistic posterior sampling for reinforcement learning: worst-case regret bounds S Agrawal, R Jia Advances in Neural Information Processing Systems 30, 2017	251	2017
A framework for high-accuracy privacy-preserving mining S Agrawal, JR Haritsa 21st International Conference on Data Engineering (ICDE'05), 193-204, 2005	244	2005
A near-optimal exploration-exploitation approach for assortment selection S Agrawal, V Avadhanula, V Goyal, A Zeevi Proceedings of the 2016 ACM Conference on Economics and Computation, 599-600, 2016	240*	2016
Bandits with concave rewards and convex knapsacks S Agrawal, NR Devanur Proceedings of the fifteenth ACM conference on Economics and computation …, 2014	238	2014
Reinforcement learning for integer programming: Learning to cut Y Tang, S Agrawal, Y Faenza International conference on machine learning, 9367-9376, 2020	232	2020
Fast Algorithms for Online Stochastic Convex Programming S Agrawal, NR Devanur SODA 2015, 2015	201	2015
Price of correlations in stochastic optimization S Agrawal, Y Ding, A Saberi, Y Ye Operations Research 60 (1), 150-162, 2012	175*	2012
Linear contextual bandits with knapsacks S Agrawal, N Devanur Advances in neural information processing systems 29, 2016	174	2016
Bandits with delayed, aggregated anonymous feedback C Pike-Burke, S Agrawal, C Szepesvari, S Grunewalder International Conference on Machine Learning, 4105-4113, 2018	140	2018
Discretizing continuous action space for on-policy optimization Y Tang, S Agrawal Proceedings of the aaai conference on artificial intelligence 34 (04), 5981-5988, 2020	139	2020
Thompson sampling for the mnl-bandit S Agrawal, V Avadhanula, V Goyal, A Zeevi Conference on learning theory, 76-78, 2017	129	2017
An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives S Agrawal, NR Devanur, L Li Conference on Learning Theory, 4-18, 2016	112	2016
On addressing efficiency concerns in privacy-preserving mining S Agrawal, V Krishnan, JR Haritsa Database Systems for Advanced Applications: 9th International Conference …, 2004	111	2004
Learning in structured mdps with convex cost functions: Improved regret bounds for inventory management S Agrawal, R Jia Proceedings of the 2019 ACM Conference on Economics and Computation, 743-744, 2019	85	2019
Efficient detection of distributed constraint violations S Agrawal, S Deb, KVM Naidu, R Rastogi 2007 IEEE 23rd International Conference on Data Engineering, 1320-1324, 2006	76	2006
A unified framework for dynamic prediction market design S Agrawal, E Delage, M Peters, Z Wang, Y Ye Operations research 59 (3), 550-568, 2011	68*	2011

O sistema não pode executar a operação agora. Tente novamente mais tarde.

Artigos 1–20

Citações por ano

Citações duplicadas

Citações mescladas

Adicionar coautoresCoautores

Seguir

Citado por

Coautores