フォロー
Adil Salim
Adil Salim
Microsoft Research
確認したメール アドレス: microsoft.com - ホームページ
タイトル
引用先
引用先
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ...
arXiv preprint arXiv:2404.14219, 2024
9522024
Textbooks are all you need
S Gunasekar, Y Zhang, J Aneja, CCT Mendes, A Del Giorno, S Gopi, ...
arXiv preprint arXiv:2306.11644, 2023
5852023
Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions
S Chen, S Chewi, J Li, Y Li, A Salim, AR Zhang
The Eleventh International Conference on Learning Representations, 2022
3172022
Phi-2: The surprising power of small language models
M Javaheripi, S Bubeck, M Abdin, J Aneja, S Bubeck, CCT Mendes, ...
Microsoft Research Blog 1 (3), 3, 2023
2282023
Maximum mean discrepancy gradient flow
M Arbel, A Korba, A Salim, A Gretton
Advances in Neural Information Processing Systems 32, 2019
1702019
The probability flow ODE is provably fast
S Chen, S Chewi, H Lee, Y Li, J Lu, A Salim
Advances in Neural Information Processing Systems 36, 2023
1052023
A non-asymptotic analysis for Stein variational gradient descent
A Korba, A Salim, M Arbel, G Luise, A Gretton
Advances in Neural Information Processing Systems 33, 4672-4682, 2020
952020
Optimal and practical algorithms for smooth and strongly convex decentralized optimization
D Kovalev, A Salim, P Richtárik
Advances in Neural Information Processing Systems 33, 18342-18352, 2020
942020
Towards a theory of non-log-concave sampling: first-order stationarity guarantees for Langevin Monte Carlo
K Balasubramanian, S Chewi, MA Erdogdu, A Salim, S Zhang
Conference on Learning Theory, 2896-2923, 2022
762022
Improved analysis for a proximal algorithm for sampling
Y Chen, S Chewi, A Salim, A Wibisono
Conference on Learning Theory, 2984-3014, 2022
702022
The Wasserstein proximal gradient algorithm
A Salim, A Korba, G Luise
Advances in Neural Information Processing Systems 33, 12356-12366, 2020
642020
Phi-4 technical report
M Abdin, J Aneja, H Behl, S Bubeck, R Eldan, S Gunasekar, M Harrison, ...
arXiv preprint arXiv:2412.08905, 2024
542024
Dualize, split, randomize: Toward fast nonsmooth optimization algorithms
A Salim, L Condat, K Mishchenko, P Richtárik
Journal of Optimization Theory and Applications 195 (1), 102-130, 2022
442022
Primal dual interpretation of the proximal stochastic gradient Langevin algorithm
A Salim, P Richtarik
Advances in Neural Information Processing Systems 33, 3786-3796, 2020
432020
Forward-backward Gaussian variational inference via JKO in the Bures-Wasserstein space
MZ Diao, K Balasubramanian, S Chewi, A Salim
International Conference on Machine Learning, 7960-7991, 2023
372023
Phi-3 technical report: A highly capable language model locally on your phone, 2024
M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ...
URL https://arxiv. org/abs/2404.14219, 2024
362024
An optimal algorithm for strongly convex minimization under affine constraints
A Salim, L Condat, D Kovalev, P Richtárik
International conference on artificial intelligence and statistics, 4482-4498, 2022
312022
Stochastic proximal Langevin algorithm: Potential splitting and nonasymptotic rates
A Salim, D Kovalev, P Richtárik
Advances in Neural Information Processing Systems 32, 2019
312019
A convergence theory for SVGD in the population limit under Talagrand’s inequality T1
A Salim, L Sun, P Richtarik
International Conference on Machine Learning, 19139-19152, 2022
30*2022
A constant step Forward-Backward algorithm involving random maximal monotone operators
P Bianchi, W Hachem, A Salim
Journal of Convex Analysis 26 (2), 387-436, 2019
302019
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20