Matei Zaharia

Computer science Stanford University Apache Spark
Matei Zaharia
Alma materUC Berkeley (Ph.D.)
University of Waterloo
Known forApache Spark
Scientific career
FieldsComputer science
InstitutionsStanford University
ThesisAn Architecture for Fast and General Data Processing on Large Clusters (2013)
Doctoral advisorIon Stoica
Scott Shenker

Matei Zaharia is a Romanian-Canadian computer scientist and the creator of Apache Spark.[1][2][3]

Zaharia was an undergraduate at the University of Waterloo.[4] While at University of California, Berkeley's AMPLab in 2009, he created Apache Spark as a faster alternative to MapReduce.[5] He received the 2015 ACM Doctoral Dissertation Award for his PhD research on large-scale computing.

In 2013 Zaharia was one of the co-founders of Databricks where he serves as Chief Technology Officer.[2]

He joined the faculty of MIT in 2015, and then became an assistant professor of computer science at Stanford University in 2016.[6]

In 2019, Zaharia received the Presidential Early Career Award for Scientists and Engineers.[4]

In 2019 he was spearheading MLflow at Databricks, while still teaching.[7][8][9]

Zaharia is also a past gold medalist at the International Collegiate Programming Contest, where his team University of Waterloo placed 4th in the world and 1st in North America in 2005.[10]


  1. ^ Fiscutean, Andrada (August 20, 2019). "Why the US has lost to Russia in these top coding trials for almost a decade". ZDNet.
  2. ^ a b "Meet the 'nerdiest rock star': Matei Zaharia co-creator of Apache Spark | Computing". 2015-10-29. Retrieved 2019-12-03.
  3. ^ Piatetsky, Gregory (May 2015). "Exclusive Interview: Matei Zaharia, creator of Apache Spark, on Spark, Hadoop, Flink, and Big Data in 2020".
  4. ^ a b Iyer, Kavya (July 26, 2019). "Twelve Stanford researchers receive Presidential Early Career Award for Scientists and Engineers". Stanford Daily.
  5. ^ Woodie, Alex (March 8, 2019). "A Decade Later, Apache Spark Still Going Strong". Datanami.
  6. ^ "Matei Zaharia receives ACM Doctoral Dissertation award". MIT EECS. April 28, 2015.
  7. ^ Brust, Andrew (June 6, 2019). "AI gets rigorous: Databricks announces MLflow 1.0". ZDNet.
  8. ^ Anadiotis, George. "Unifying cloud storage and data warehouses: Delta Lake project hosted by the Linux Foundation". ZDNet. Retrieved 2019-12-03.
  9. ^ Woodie, Alex (2019-12-02). "Will Databricks Build the First Enterprise AI Platform?". Datanami. Retrieved 2019-12-03.
  10. ^ Zaharia, Matei. "Programming Contest Resources". Retrieved 2020-04-22.