Skip to main content

Large-Scale Data Analysis at Cloud Scale

John Wilkes, Principal Software Engineer, Technical Infrastructure, Google

14. 11. 2016, 4pm



Google has been tackling large-scale big data problems for more than 15 years.  Experiences with the systems we built to do so has led us to develop a new set of tools for large-scale analysis and queries, including streaming.  I’ll provide an overview of some of the systems we’ve built and are now making available for others to build on.


John Wilkes has been at Google since 2008, where he is working on cluster management for Google‘s compute infrastructure; he was one of the architects of Omega. He is interested in far too many aspects of distributed systems, but a recurring theme has been technologies that allow systems to manage themselves.

He received a PhD in computer science from the University of Cambridge, joined HP Labs in 1982, and was elected an HP Fellow and an ACM Fellow in 2002 for his work on storage system design. Along the way, he’s been program committee chair for SOSP, FAST, EuroSys and HotCloud, and has served on the steering committees for EuroSys, FAST, SoCC and HotCloud. He’s listed as an inventor on 40+ US patents, and has an adjunct faculty appointment at Carnegie-Mellon University. In his spare time he continues, stubbornly, trying to learn how to blow glass.


Location:  MA 042