Obserwuj
Esha  Choukse
Esha Choukse
Microsoft Research
Zweryfikowany adres z utexas.edu - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Bit-plane compression: Transforming data for better compression in many-core architectures
J Kim, M Sullivan, E Choukse, M Erez
ACM SIGARCH Computer Architecture News 44 (3), 329-340, 2016
1212016
Splitwise: Efficient generative llm inference using phase splitting
P Patel, E Choukse, C Zhang, A Shah, Í Goiri, S Maleki, R Bianchini
2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture …, 2024
1182024
Prunetrain: fast neural network training by dynamic sparse model reconfiguration
S Lym, E Choukse, S Zangeneh, W Wen, S Sanghavi, M Erez
Proceedings of the International Conference for High Performance Computing …, 2019
1002019
Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs
E Choukse, M Sullivan, M O'Connor, M Erez, J Pool, D Nellans, S Keckler
47th International Symposium on Computer Architecture (ISCA 2020), 2020
582020
Towards greener llms: Bringing energy-efficiency to the forefront of llm inference
J Stojkovic, E Choukse, C Zhang, I Goiri, J Torrellas
arXiv preprint arXiv:2403.20306, 2024
452024
Compresso: Pragmatic main memory compression
E Choukse, M Erez, AR Alameldeen
(MICRO) 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018
392018
Splitwise: Efficient generative llm inference using phase splitting. In 2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture (ISCA)
P Patel, E Choukse, C Zhang, A Shah, Í Goiri, S Maleki, R Bianchini
IEEE Computer Society, Los Alamitos, CA, USA, 118-132, 2024
352024
Characterizing power management opportunities for llms in the cloud
P Patel, E Choukse, C Zhang, Í Goiri, B Warrier, N Mahalingam, ...
Proceedings of the 29th ACM International Conference on Architectural …, 2024
332024
Dynamollm: Designing llm inference clusters for performance and energy efficiency
J Stojkovic, C Zhang, Í Goiri, J Torrellas, E Choukse
arXiv preprint arXiv:2408.00741, 2024
252024
Towards improved power management in cloud gpus
P Patel, Z Gong, S Rizvi, E Choukse, P Misra, T Anderson, A Sriraman
IEEE Computer Architecture Letters 22 (2), 141-144, 2023
202023
Prunetrain: Gradual structured pruning from scratch for faster neural network training
S Lym, E Choukse, S Zangeneh, W Wen, M Erez, S Shanghavi
arXiv preprint arXiv:1901.09290, 2019
182019
Designing cloud servers for lower carbon
J Wang, DS Berger, F Kazhamiaka, C Irvene, C Zhang, E Choukse, ...
2024 ACM/IEEE 51st Annual International Symposium on Computer Architecture …, 2024
162024
Myths and misconceptions around reducing carbon embedded in cloud platforms
J Lyu, J Wang, K Frost, C Zhang, C Irvene, E Choukse, R Fonseca, ...
Proceedings of the 2nd Workshop on Sustainable Computer Systems, 1-7, 2023
162023
Making kernel bypass practical for the cloud with junction
J Fried, GI Chaudhry, E Saurez, E Choukse, Í Goiri, S Elnikety, R Fonseca, ...
21st USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2024
142024
Overclocking in immersion-cooled datacenters
PA Misra, I Manousakis, E Choukse, M Jalili, Í Goiri, A Raniwala, ...
IEEE Micro 42 (4), 10-17, 2022
132022
Polca: Power oversubscription in llm cloud providers
P Patel, E Choukse, C Zhang, Í Goiri, B Warrier, N Mahalingam, ...
arXiv preprint arXiv:2308.12908, 2023
112023
CompressPoints: An Evaluation Methodology for Compressed Memory Systems
E Choukse, M Erez, AR Alameldeen
IEEE Computer Architecture Letters, 2018
102018
Bit-Plane Compression: Transforming Data for Better Compression in Many-Core Architectures. In 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA …
J Kim, M Sullivan, E Choukse, M Erez
IEEE, 2016
102016
Translation-Optimized Memory Compression for Capacity
CJVT Gagandeep Panwar, Muhammad Laghari, David Bears, Yuqing Liu, ...
55th IEEE/ACM International Symposium on Microarchitecture, 2022
8*2022
Intelligent router for llm workloads: Improving performance through workload-aware scheduling
K Jain, A Parayil, A Mallick, E Choukse, X Qin, J Zhang, Í Goiri, R Wang, ...
arXiv preprint arXiv:2408.13510, 2024
52024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20