Follow
Suriya Gunasekar
Suriya Gunasekar
MSR Redmond
Verified email at ttic.edu - Homepage
Title
Cited by
Cited by
Year
The Implicit Bias of Gradient Descent on Separable Data
D Soudry, E Hoffer, MS Nacson, S Gunasekar, N Srebro
The Journal of Machine Learning Research, 2018
10242018
Implicit regularization in matrix factorization
S Gunasekar, BE Woodworth, S Bhojanapalli, B Neyshabur, N Srebro
Advances in Neural Information Processing Systems, 6151-6159, 2017
5562017
Characterizing Implicit Bias in Terms of Optimization Geometry
S Gunasekar, J Lee, D Soudry, N Srebro
Proceedings of the 35th International Conference on Machine Learning 80 …, 2018
4762018
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
4742024
Textbooks Are All You Need
S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ...
arXiv preprint arXiv:2306.11644, 2023
4682023
Implicit bias of gradient descent on linear convolutional networks
S Gunasekar, JD Lee, D Soudry, N Srebro
Advances in Neural Information Processing Systems, 9461-9471, 2018
4622018
Learning Non-Discriminatory Predictors
B Woodworth, S Gunasekar, MI Ohannessian, N Srebro
Proceedings of the 2017 Conference on Learning Theory (COLT) 35, 2017
4492017
Kernel and rich regimes in overparametrized models
B Woodworth, S Gunasekar, JD Lee, E Moroshko, P Savarese, I Golan, ...
Conference on Learning Theory, 3635-3673, 2020
3902020
Textbooks Are All You Need II: phi-1.5 technical report
Y Li, S Bubeck, R Eldan, A Del Giorno, S Gunasekar, YT Lee
arXiv preprint arXiv:2309.05463, 2023
3342023
Convergence of gradient descent on separable data
MS Nacson, J Lee, S Gunasekar, PHP Savarese, N Srebro, D Soudry
Proceedings of the 22nd International Conference on Artificial Intelligence …, 2019
1742019
Implicit bias in deep linear classification: Initialization scale vs training accuracy
E Moroshko, BE Woodworth, S Gunasekar, JD Lee, N Srebro, D Soudry
Advances in neural information processing systems 33, 22182-22193, 2020
922020
Methods and Analysis of The First Competition in Predicting Generalization of Deep Learning
Y Jiang, P Natekar, M Sharma, SK Aithal, D Kashyap, N Subramanyam, ...
NeurIPS 2020 Competition and Demonstration Track, 170-190, 2021
83*2021
Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models
MS Nacson, S Gunasekar, JD Lee, N Srebro, D Soudry
Proceedings of the 36 th International Conference on Machine Learning 97, 2019
772019
Review quality aware collaborative filtering
S Raghavan, S Gunasekar, J Ghosh
Proceedings of the sixth ACM conference on Recommender systems, 123-130, 2012
722012
Unveiling Transformers with LEGO: a synthetic reasoning task
Y Zhang, A Backurs, S Bubeck, R Eldan, S Gunasekar, T Wagner
arXiv preprint arXiv:2206.04301, 2022
682022
Data Augmentation as Feature Manipulation
R Shen, S Bubeck, S Gunasekar
International Conference on Machine Learning, 19773-19808, 2022
632022
Exponential family matrix completion under structural constraints
S Gunasekar, P Ravikumar, J Ghosh
International Conference on Machine Learning, 1917-1925, 2014
582014
Mirrorless Mirror Descent: A Natural Derivation of Mirror Descent
S Gunasekar, B Woodworth, N Srebro
International Conference on Artificial Intelligence and Statistics, 2305-2313, 2021
48*2021
Noisy matrix completion using alternating minimization
S Gunasekar, A Acharya, N Gaur, J Ghosh
Joint European conference on machine learning and knowledge discovery in …, 2013
452013
Face detection on distorted images augmented by perceptual quality-aware features
S Gunasekar, J Ghosh, AC Bovik
IEEE transactions on information forensics and security 9 (12), 2119-2131, 2014
392014
The system can't perform the operation now. Try again later.
Articles 1–20