Understanding the Dynamics of Gradient Flow in Overparameterized Linear models S Tarmoun, G Franca, BD Haeffele, R Vidal International Conference on Machine Learning, 10153-10161, 2021 | 58 | 2021 |
On the explicit role of initialization on the convergence and implicit bias of overparametrized linear networks H Min, S Tarmoun, R Vidal, E Mallada International Conference on Machine Learning, 7760-7768, 2021 | 52 | 2021 |
Linear Convergence of Gradient Descent for Finite Width Over-parametrized Linear Networks with General Initialization Z Xu, H Min, S Tarmoun, E Mallada, R Vidal International Conference on Artificial Intelligence and Statistics, 2262-2284, 2023 | 6 | 2023 |
A LOCAL POLYAK-ŁOJASIEWICZ AND DESCENT LEMMA OF GRADIENT DESCENT FOR OVERPARAMETERIZED LINEAR MODELS Z Xu, H Min, S Tarmoun, E Mallada, R Vidal | | 2023 |
Gradient Preconditioning for Non-Lipschitz smooth Nonconvex Optimization S Tarmoun, S Slocum, BD Haeffele, R Vidal | | 2022 |
Implicit Acceleration of Gradient Flow in Overparameterized Linear Models S Tarmoun, G França, BD Haeffele, R Vidal | | 2020 |
On the Explicit Role of Initialization on the Convergence and Generalization Properties of Overparametrized Linear Networks H Min, S Tarmoun, R Vidal, E Mallada | | 2020 |
Learning Dynamics and Implicit Bias of Gradient Flow in Overparameterized Linear Models R Vidal, S Tarmoun, H Min, B Haeffele, E Mallada, G Franca 2023 Joint Mathematics Meetings (JMM 2023), 0 | | |