Follow
Matei Zaharia
Matei Zaharia
UC Berkeley and Databricks
Verified email at berkeley.edu - Homepage
Title
Cited by
Cited by
Year
A view of cloud computing
M Armbrust, A Fox, R Griffith, AD Joseph, R Katz, A Konwinski, G Lee, ...
Communications of the ACM 53 (4), 50-58, 2010
146322010
Spark: Cluster computing with working sets
M Zaharia, M Chowdhury, MJ Franklin, S Shenker, I Stoica
2nd USENIX workshop on hot topics in cloud computing (HotCloud 10), 2010
12343*2010
Above the Clouds: A Berkeley View of Cloud Computing
M Armbrust
88082009
On the opportunities and risks of foundation models
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
41022021
Apache spark: a unified engine for big data processing
M Zaharia, RS Xin, P Wendell, T Das, M Armbrust, A Dave, X Meng, ...
Communications of the ACM 59 (11), 56-65, 2016
31382016
Mesos: A platform for {Fine-Grained} resource sharing in the data center
B Hindman, A Konwinski, M Zaharia, A Ghodsi, AD Joseph, R Katz, ...
8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11), 2011
26182011
Mllib: Machine learning in apache spark
X Meng, J Bradley, B Yavuz, E Sparks, S Venkataraman, D Liu, ...
Journal of Machine Learning Research 17 (34), 1-7, 2016
24602016
Improving MapReduce performance in heterogeneous environments.
M Zaharia, A Konwinski, AD Joseph, RH Katz, I Stoica
Osdi 8 (4), 7, 2008
24572008
Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling
M Zaharia, D Borthakur, J Sen Sarma, K Elmeleegy, S Shenker, I Stoica
Proceedings of the 5th European conference on Computer systems, 265-278, 2010
20232010
Spark sql: Relational data processing in spark
M Armbrust, RS Xin, C Lian, Y Huai, D Liu, JK Bradley, X Meng, T Kaftan, ...
Proceedings of the 2015 ACM SIGMOD international conference on management of …, 2015
19152015
Dominant resource fairness: Fair allocation of multiple resource types
A Ghodsi, M Zaharia, B Hindman, A Konwinski, S Shenker, I Stoica
8th USENIX symposium on networked systems design and implementation (NSDI 11), 2011
16842011
Discretized streams: Fault-tolerant streaming computation at scale
M Zaharia, T Das, H Li, T Hunter, S Shenker, I Stoica
Proceedings of the twenty-fourth ACM symposium on operating systems …, 2013
14612013
Colbert: Efficient and effective passage search via contextualized late interaction over bert
O Khattab, M Zaharia
Proceedings of the 43rd International ACM SIGIR conference on research and …, 2020
12432020
PipeDream: Generalized pipeline parallelism for DNN training
D Narayanan, A Harlap, A Phanishayee, V Seshadri, NR Devanur, ...
Proceedings of the 27th ACM symposium on operating systems principles, 1-15, 2019
8752019
Managing data transfers in computer clusters with orchestra
M Chowdhury, M Zaharia, J Ma, MI Jordan, I Stoica
SIGCOMM 41 (4), 2011
8182011
Sparrow: distributed, low latency scheduling
K Ousterhout, P Wendell, M Zaharia, I Stoica
Proceedings of the twenty-fourth ACM symposium on operating systems …, 2013
8152013
Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters
M Zaharia, T Das, H Li, S Shenker, I Stoica
Proceedings of the 4th USENIX conference on Hot Topics in Cloud Computing, 10-10, 2012
7982012
Learning spark: lightning-fast big data analysis
H Karau, A Konwinski, P Wendell, M Zaharia
" O'Reilly Media, Inc.", 2015
7252015
Shark: SQL and rich analytics at scale
RS Xin, J Rosen, M Zaharia, MJ Franklin, S Shenker, I Stoica
Proceedings of the 2013 ACM SIGMOD International Conference on Management of …, 2013
6612013
Efficient large-scale language model training on gpu clusters using megatron-lm
D Narayanan, M Shoeybi, J Casper, P LeGresley, M Patwary, ...
Proceedings of the International Conference for High Performance Computing …, 2021
6142021
The system can't perform the operation now. Try again later.
Articles 1–20