CCC Colloquium: Grzegorz Malewicz (March 1, 2010)
Models of Computation and Realization Challenges in the Cloud
Grzegorz Malewicz (Google, Inc.)
Monday, March 1, 2010
10am, Hornbake 2119
Many companies, including Google, operate datacenters consisting of networked commodity computers. Solving practical computational problems on such datacenters can be challenging because of data imbalances, and computer delays and failures. Many models of computation have complicated semantics, making programming difficult, and some theoretical models do not have any scalable and efficient realization that is suitable for industrial use. Google has introduced several models of computation that meet the challenges. In this talk I will describe these models, the challenges in implementing them, and the techniques that led to the first successful sort of 1PB of data in 6h 2m.
About the Speaker
Greg Malewicz received the BA degrees in computer science and in applied mathematics in 1996 and 1998, respectively, and the MS degree in computer science in 1998, all from the University of Warsaw. He received the PhD degree in computer science from the University of Connecticut in 2003 with his last year at Massachusetts Institute of Technology. He is a Staff Software Engineer at Google designing simple and expressive models of computation and realizing them as scalable systems so as to make data processing in the cloud simple. He co-founded the Pregel project for graph processing and earlier worked on MapReduce, which lead to the first successful 1PB sort (both projects are team efforts). He has had internships at the AT&T Shannon Laboratory (summer 2001) and Microsoft Corp. (summer 2000 and fall 2001). He was a visiting scientist at the University of Massachusetts, Amherst (summer 2004) and Argonne National Laboratory (summer 2005), and an assistant professor at the University of Alabama, where he taught theoretical computer science (2003 until 2005). His research focuses on high-performance parallel and distributed computing, experimental and theoretical algorithmics, combinatorial optimization, and scheduling. His research appears in top journals and conferences and includes a singly authored SIAM Journal on Computing paper that solves a decade-old problem in distributed computing.
Greg is an avid traveler. His bicycle and friends toured the Himalayas, the Andes and the Alps, including Khardung La, which he cycled as the first Pole. Check out Greg's photos from bicycle and other trips. Greg likes making friends! You can reach me at gmalewicz at gmail.
This talk is open to the public and will take place in the Hornbake Building, South Wing, at the University of Maryland, College Park. Directions to campus can be found here and campus maps can be found here.