DoK Community #41 Designing Stateful Apps for the Cloud and Kubernetes // Evan Chan

DoK Community #41 Designing Stateful Apps for the Cloud and Kubernetes // Evan Chan

Abstract of the talk…

Almost all applications have some kind of state. Some data processing apps and databases have huge amounts of state. How do we navigate a cloud-based world of containers where stateless and functions-as-a-service is all the rage? As a long-time architect, designer, and developer of very stateful apps (databases and data processing apps), I’d like to take you on a journey through the modern cloud world and Kubernetes, offering helpful design patterns, considerations, tips, and where things are going. How is Kubernetes shaking up stateful app design? - What kind of state is there, and what are some important characteristics? - Kubernetes, containers, and the stateless paradigm (pushing state into DBs) - Where state lives and the persistence characteristics - Stateless vs serverless - why stateless is not really stateless, but server less really is - Improving on stateless paradigm using local state pattern - Logs and event streaming for reasoning about state and failure recovery - The case for local disks: ML, Databases, etc. - Kubernetes and the Persistent Volume/StatefulSets - Leveraging Kubernetes PVs as a basis for building distributed data systems - Mapping the solution space

Bio…

Evan has been a distributed systems / data / software engineer for twenty years. He led a team developing FiloDB, an open source (github.com/filodb/FiloDB) distributed time series database that can process a million records per second PER NODE and simultaneously answer a large number of concurrent queries per second. He has architected, developed, and productionized large scale data and telemetry systems at companies including Apple, and loves solving the most challenging technical problems at both large and small scales, from advanced custom data structures to distributed coordination. He is an expert in bleeding edge #jvm #java #scala and #rust performance. Current interests include Rust and columnar compression. He has led the design and implementation of multiple big data platforms based on Apache Storm, Spark, Kafka, Cassandra, and Scala/Akka. He has been an active contributor to the Apache Spark project, and a two-time Datastax Cassandra MVP.

Jaksot(243)

Operating FoundationDB on Kubernetes (DoK Day EU 2022) // Johannes M. Scheuermann

Operating FoundationDB on Kubernetes (DoK Day EU 2022) // Johannes M. Scheuermann

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) FoundationDB is an open-source distributed transactional Key-Value store that is used by ...

27 Touko 20228min

One Click to Run Apache Spark as a Service on Kubernetes (DoK Day EU 2022) // Bo Yang

One Click to Run Apache Spark as a Service on Kubernetes (DoK Day EU 2022) // Bo Yang

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) It is still challenging to run Apache Spark and other big data processing workload on Kub...

27 Touko 20229min

Microservices and Kubernetes for your Full Data Lifecycle (DoK Day EU 2022) // Steve Pousty

Microservices and Kubernetes for your Full Data Lifecycle (DoK Day EU 2022) // Steve Pousty

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Data doesn’t magically appear in our data centers. There are usually several phases and s...

27 Touko 202214min

Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Developers (DoK Day EU 2022) // Arsh Sharma, Lapo Elisacci & Ramiro Berrelleza

Leveraging Running Stateful Workloads on Kubernetes for the Benefit of Developers (DoK Day EU 2022) // Arsh Sharma, Lapo Elisacci & Ramiro Berrelleza

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Kubernetes comes with a lot of useful features like Volumes and StatefulSets, which make ...

27 Touko 202214min

Kanister & Kopia - An Open-Source Data Protection Match Made in Heaven (DoK Day EU 2022) // Pavan Navarathna

Kanister & Kopia - An Open-Source Data Protection Match Made in Heaven (DoK Day EU 2022) // Pavan Navarathna

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Cloud-native applications comprise various components, including data services, storage s...

27 Touko 202213min

Is your database in Kubernetes production ready (DoK Day EU 2022) // Mykola Marzhan

Is your database in Kubernetes production ready (DoK Day EU 2022) // Mykola Marzhan

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) It only looks simple to run databases in Kubernetes. In fact, it is too many things neede...

27 Touko 202215min

How to protect your data (DoK Day EU 2022) // Sarah Julia Kriesch

How to protect your data (DoK Day EU 2022) // Sarah Julia Kriesch

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) How can you keep your data secure and how can you transfer them on a secure way? You will...

27 Touko 20226min

 Growing up fast - Kubernetes and Real-Time Analytic Applications (DoK Day EU 2022) // Robert Hodges

Growing up fast - Kubernetes and Real-Time Analytic Applications (DoK Day EU 2022) // Robert Hodges

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Kubernetes is turning into a preferred platform for real-time analytic app that crunch bi...

27 Touko 202215min