Presentation: Peloton - Uber's Webscale Unified Scheduler on Mesos & Kubernetes
This presentation is now available to view on InfoQ.com
Watch video with transcriptAbstract
With the increasing scale of Uber’s business, efficient use of cluster resources is important to reduce the cost per trip. As we have learned when operating Mesos clusters in production, it is a challenge to overcommit resources for latency-sensitive services due to their large spread of resource usage patterns. Uber also has significant demand on running large-scale batch jobs for marketplace intelligence, fraud detection, maps, self-driving vehicles etc.
In this talk, we will present Peloton, a Unified Resource Scheduler for collocating heterogeneous workloads in shared Mesos clusters. The goal of Peloton is to manage compute resources more efficiently while providing hierarchical max-min fairness guarantees for different teams. Peloton schedules large-scale batch jobs with millions of tasks and also supports distributed TensorFlow jobs with thousands of GPUs.
Similar Talks
Psychologically Safe Process Evolution in a Flat Structure
Director of Software Development @Hunter_Ind
Christopher Lucian
Not Sold Yet, GraphQL: A Humble Tale From Skeptic to Enthusiast
Software Engineer @Netflix
Garrett Heinlen
Let's talk locks!
Software Engineer @Samsara
Kavya Joshi
How Did Things Go Right? Learning More From Incidents
Site Reliability Engineering @Netflix
Ryan Kitchens
Graceful Degradation as a Feature
Director of Product @GremlinInc
Lorne Kligerman
Liberating Structures @CapitalOne
Agile Coach, Engineering @CapitalOne
Greg Myers
Making 'npm install' Safe
Software Engineer @agoric
Kate Sills
Driving Technology Transformation at @WeWork
Fellow Engineer, Developer Platform @WeWork
Hugo Haas
High Performance Remote and Distributed Teams
VP Engineering @WeWork