Obserwuj
Alexander William Bukharin
Alexander William Bukharin
Zweryfikowany adres z gatech.edu - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Adalora: Adaptive budget allocation for parameter-efficient fine-tuning
Q Zhang, M Chen, A Bukharin, N Karampatziakis, P He, Y Cheng, ...
arXiv preprint arXiv:2303.10512, 2023
5212023
Platon: Pruning large transformer models with upper confidence bound of weight importance
Q Zhang, S Zuo, C Liang, A Bukharin, P He, W Chen, T Zhao
International conference on machine learning, 26809-26823, 2022
962022
High-resolution spatio-temporal model for county-level COVID-19 activity in the US
S Zhu, A Bukharin, L Xie, M Santillana, S Yang, Y Xie
ACM Transactions on Management Information Systems (TMIS) 12 (4), 1-20, 2021
282021
Helpsteer2-preference: Complementing ratings with preferences
Z Wang, A Bukharin, O Delalleau, D Egert, G Shen, J Zeng, O Kuchaiev, ...
arXiv preprint arXiv:2410.01257, 2024
212024
Data diversity matters for robust instruction tuning
A Bukharin, T Zhao
EMNLP 2024, 2023
172023
Robust multi-agent reinforcement learning via adversarial regularization: Theoretical foundation and stable algorithms
A Bukharin, Y Li, Y Yu, Q Zhang, Z Chen, S Zuo, C Zhang, S Zhang, ...
Advances in Neural Information Processing Systems 36, 68121-68133, 2023
132023
Early detection of COVID-19 hotspots using spatio-temporal data
S Zhu, A Bukharin, L Xie, K Yamin, S Yang, P Keskinocak, Y Xie
IEEE Journal of Selected Topics in Signal Processing 16 (2), 250-260, 2022
122022
Five-year project-level statewide pavement performance forecasting using a two-stage machine learning approach based on long short-term memory
AW Bukharin, Z Yang, Y Tsai
Transportation Research Record 2675 (11), 280-290, 2021
102021
Adalora: adaptive budget allocation for parameter-efficient fine-tuning (2023)
Q Zhang, M Chen, A Bukharin, N Karampatziakis, P He, Y Cheng, ...
URL https://arxiv. org/abs/2303.10512, 0
8
Ambient noise-based weakly supervised manhole localization methods over deployed fiber networks
A Bukharin, S Han, Y Chen, MF Huang, YK Huang, Y Xie, T Wang
Optics Express 31 (6), 9591-9607, 2023
52023
Data-driven optimization for police beat design in south fulton, georgia
S Zhu, AW Bukharin, L Lu, H Wang, Y Xie
arXiv preprint arXiv:2004.09660, 2020
52020
Deep reinforcement learning from hierarchical weak preference feedback
A Bukharin, Y Li, P He, W Chen, T Zhao
arXiv preprint arXiv:2309.02632, 2023
32023
RNR: Teaching large language models to follow roles and rules
K Wang, A Bukharin, H Jiang, Q Yin, Z Wang, T Zhao, J Shang, C Zhang, ...
arXiv preprint arXiv:2409.13733, 2024
22024
Robust Reinforcement Learning from Corrupted Human Feedback
A Bukharin, I Hong, H Jiang, Z Li, Q Zhang, Z Zhang, T Zhao
NEURIPS 2024, 2024
22024
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
I Hong, Z Li, A Bukharin, Y Li, H Jiang, T Yang, T Zhao
NEURIPS 2024, 2024
22024
Machine learning force fields with data cost aware training
A Bukharin, T Liu, S Wang, S Zuo, W Gao, W Yan, T Zhao
International Conference on Machine Learning, 3219-3232, 2023
22023
Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment
S Sun, Y Zhang, A Bukharin, D Mosallanezhad, J Zeng, S Singhal, ...
arXiv preprint arXiv:2502.00203, 2025
2025
Deep Reinforcement Learning from Hierarchical Preference Design
A Bukharin, Y Li, P He, T Zhao
arXiv preprint arXiv:2309.02632, 2023
2023
Data-Driven Optimization for Police Districting in South Fulton, Georgia
S Zhu, A Bukharin, L Lu, H Wang, Y Xie
KDD Workshop on Data Science for Social Good, 2021
2021
Overparameterization and Efficient Adversarial Training of Neural Networks
K Acharya, A Bukharin, T LaBonte
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20