Loading…
In-person + Virtual
October 24-28
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon North America 2022 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in Eastern Daylight Time (UTC -4). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change.
Back To Schedule
Thursday, October 27 • 2:30pm - 3:05pm
Thriving With Kubernetes On-Call: Best Practices & Lessons Learned - Sunil Shah & Ramya Krishnan, Airbnb; Ashley Cutalo, Lyft; Madhu C.S., Robinhood; Fabio Kung, Netflix

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.


Kubernetes clusters are critical infrastructure at large, public companies, with large amounts of traffic, complex dependencies on 3rd party services, and constant change as developers release features and traffic scales up and down. In this panel discussion, engineers from Airbnb, Lyft, Netflix and Robinhood share their challenges, experiences and learnings when it comes to managing a sustainable on-call rotation that meets the needs of their internal users whilst maintaining a high uptime to serve business critical workloads. Topics covered will include: +Keeping on-call engineers happy + Balancing rapid response with alert fatigue + Strategies to proactively deal with production issues + Preparing engineers for on-call

Speakers
avatar for Sunil Shah

Sunil Shah

Engineering Manager, Airbnb
Sunil Shah is an Engineering Manager at Airbnb. His team builds and maintains the Kubernetes-based platform that powers Airbnb.com. Prior to Airbnb, Sunil managed compute for Yelp, helped commercialise Apache Mesos at Mesosphere, studied robotics at UC Berkeley, and build ingestion... Read More →
avatar for Ramya Krishnan

Ramya Krishnan

Staff Site Reliability Engineer, Airbnb
Ramya Krishnan is Staff Site Reliability Engineer at Airbnb. She has 15 years of experience working on Unix based systems, integrating services and improving reliability by undertaking large migrations. Last four years, she has been helping Airbnb with kubernetes adoption, scheduling... Read More →
avatar for Ashley Cutalo

Ashley Cutalo

Staff Software Engineer, Lyft
Ashley Cutalo (formerly Kasim) is the Tech Lead of the Application Runtime organization at Lyft and spends her time working on all things infrastructure related. She led Lyft recently through the transition from legacy infrastructure to running 100% on Kubernetes and is now focused... Read More →
avatar for Madhu C.S.

Madhu C.S.

Software Engineer, Robinhood Markets
Madhu is the Tech Lead for the Software Platform - Container Orchestration team at Robinhood which is responsible for developing and operating the Kubernetes-based compute infrastructure for all of Robinhood. Prior to Robinhood, Madhu worked at Google for 8 years where he developed... Read More →
avatar for Fabio Kung

Fabio Kung

Staff Software Engineer, Netflix
Fabio Kung is a Staff Software Engineer in the Compute team at Netflix (previously at Heroku), with 13+ years of experience on backend distributed systems and compute infrastructure. Fabio has built and led systems in all infrastructure layers of Cloud Computing, from hardware to... Read More →


Thursday October 27, 2022 2:30pm - 3:05pm EDT
Virtual Platform Only
  Reliability + Operational Continuity