フォロー
Suriya Gunasekar
Suriya Gunasekar
MSR Redmond
確認したメール アドレス: ttic.edu - ホームページ
タイトル
引用先
引用先
The Implicit Bias of Gradient Descent on Separable Data
D Soudry, E Hoffer, MS Nacson, S Gunasekar, N Srebro
The Journal of Machine Learning Research, 2018
10572018
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ...
arXiv preprint arXiv:2404.14219, 2024
9522024
Textbooks Are All You Need
S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ...
arXiv preprint arXiv:2306.11644, 2023
5852023
Implicit regularization in matrix factorization
S Gunasekar, BE Woodworth, S Bhojanapalli, B Neyshabur, N Srebro
Advances in Neural Information Processing Systems, 6151-6159, 2017
5782017
Characterizing Implicit Bias in Terms of Optimization Geometry
S Gunasekar, J Lee, D Soudry, N Srebro
Proceedings of the 35th International Conference on Machine Learning 80 …, 2018
5052018
Implicit bias of gradient descent on linear convolutional networks
S Gunasekar, JD Lee, D Soudry, N Srebro
Advances in Neural Information Processing Systems, 9461-9471, 2018
4812018
Learning Non-Discriminatory Predictors
B Woodworth, S Gunasekar, MI Ohannessian, N Srebro
Proceedings of the 2017 Conference on Learning Theory (COLT) 35, 2017
4612017
Textbooks Are All You Need II: phi-1.5 technical report
Y Li, S Bubeck, R Eldan, A Del Giorno, S Gunasekar, YT Lee
arXiv preprint arXiv:2309.05463, 2023
4352023
Kernel and rich regimes in overparametrized models
B Woodworth, S Gunasekar, JD Lee, E Moroshko, P Savarese, I Golan, ...
Conference on Learning Theory, 3635-3673, 2020
4292020
Convergence of gradient descent on separable data
MS Nacson, J Lee, S Gunasekar, PHP Savarese, N Srebro, D Soudry
Proceedings of the 22nd International Conference on Artificial Intelligence …, 2019
1812019
Implicit bias in deep linear classification: Initialization scale vs training accuracy
E Moroshko, BE Woodworth, S Gunasekar, JD Lee, N Srebro, D Soudry
Advances in neural information processing systems 33, 22182-22193, 2020
982020
Methods and Analysis of The First Competition in Predicting Generalization of Deep Learning
Y Jiang, P Natekar, M Sharma, SK Aithal, D Kashyap, N Subramanyam, ...
NeurIPS 2020 Competition and Demonstration Track, 170-190, 2021
86*2021
Lexicographic and Depth-Sensitive Margins in Homogeneous and Non-Homogeneous Deep Models
MS Nacson, S Gunasekar, JD Lee, N Srebro, D Soudry
Proceedings of the 36 th International Conference on Machine Learning 97, 2019
842019
Unveiling Transformers with LEGO: a synthetic reasoning task
Y Zhang, A Backurs, S Bubeck, R Eldan, S Gunasekar, T Wagner
arXiv preprint arXiv:2206.04301, 2022
762022
Review quality aware collaborative filtering
S Raghavan, S Gunasekar, J Ghosh
Proceedings of the sixth ACM conference on Recommender systems, 123-130, 2012
722012
Data Augmentation as Feature Manipulation
R Shen, S Bubeck, S Gunasekar
International Conference on Machine Learning, 19773-19808, 2022
612022
Exponential family matrix completion under structural constraints
S Gunasekar, P Ravikumar, J Ghosh
International Conference on Machine Learning, 1917-1925, 2014
612014
Phi-4 technical report
M Abdin, J Aneja, H Behl, S Bubeck, R Eldan, S Gunasekar, M Harrison, ...
arXiv preprint arXiv:2412.08905, 2024
542024
Mirrorless Mirror Descent: A Natural Derivation of Mirror Descent
S Gunasekar, B Woodworth, N Srebro
International Conference on Artificial Intelligence and Statistics, 2305-2313, 2021
49*2021
Noisy matrix completion using alternating minimization
S Gunasekar, A Acharya, N Gaur, J Ghosh
Joint European conference on machine learning and knowledge discovery in …, 2013
442013
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20