Presentation: When Streams Fail: Kafka Off the Shore
Abstract
How good is your streaming framework at failure? Does it die gracefully telling you exactly at which point it died? Does it tell you why it died? Does it pick-up where it left off afterwards? Can it easily skip the "erroneous" portions of the stream? Do you always know what was processed and what wasn't? Does it even have to die when process, host, data-center fail?
In this talk we focus on "What Ifs" scenarios and how to evaluate and architect a streaming platform that has high level of resilience. We'll look at Kafka and Spark Streaming as specific examples and share our experience of using these frameworks to process financial transactions answering the questions above along the way. We'll also show examples of tools that we built along our streaming journey which we found invaluable during failure scenarios.
Similar Talks
Scaling DB Access for Billions of Queries Per Day @PayPal
Software Engineer @PayPal
Petrica Voicu
Inside Job: How to Build Great Teams Within a Legacy Organization?
Engineering Director @Meetup
Francisco Trindade
A Dive Into Streams @LinkedIn With Brooklin
Data Infrastructure @LinkedIn
Celia Kung
Self-Selection for Resilience and Better Culture
Agile/DevOps Trainer & Founder of Agile Play Consulting, LLC
Dana Pylayeva
CockroachDB: Architecture of a Geo-Distributed SQL Database
CockroachDB maintainer, Co-founder & CTO @CockroachDB
Peter Mattis
Breaking Hierarchy - How Spotify Enables Engineer Decision Making
Senior Engineering Manager, Data and Machine Learning Infrastructure @Spotify
Kristian Lindwall
Context Matters: Improving the Performance and Wellbeing of Teams
Director of IT @Etsy
Shawn Carney
Maintaining the Go Crypto Libraries
Cryptogopher @Google