DoK Community #41 Designing Stateful Apps for the Cloud and Kubernetes // Evan Chan

DoK Community #41 Designing Stateful Apps for the Cloud and Kubernetes // Evan Chan

Abstract of the talk…

Almost all applications have some kind of state. Some data processing apps and databases have huge amounts of state. How do we navigate a cloud-based world of containers where stateless and functions-as-a-service is all the rage? As a long-time architect, designer, and developer of very stateful apps (databases and data processing apps), I’d like to take you on a journey through the modern cloud world and Kubernetes, offering helpful design patterns, considerations, tips, and where things are going. How is Kubernetes shaking up stateful app design? - What kind of state is there, and what are some important characteristics? - Kubernetes, containers, and the stateless paradigm (pushing state into DBs) - Where state lives and the persistence characteristics - Stateless vs serverless - why stateless is not really stateless, but server less really is - Improving on stateless paradigm using local state pattern - Logs and event streaming for reasoning about state and failure recovery - The case for local disks: ML, Databases, etc. - Kubernetes and the Persistent Volume/StatefulSets - Leveraging Kubernetes PVs as a basis for building distributed data systems - Mapping the solution space

Bio…

Evan has been a distributed systems / data / software engineer for twenty years. He led a team developing FiloDB, an open source (github.com/filodb/FiloDB) distributed time series database that can process a million records per second PER NODE and simultaneously answer a large number of concurrent queries per second. He has architected, developed, and productionized large scale data and telemetry systems at companies including Apple, and loves solving the most challenging technical problems at both large and small scales, from advanced custom data structures to distributed coordination. He is an expert in bleeding edge #jvm #java #scala and #rust performance. Current interests include Rust and columnar compression. He has led the design and implementation of multiple big data platforms based on Apache Storm, Spark, Kafka, Cassandra, and Scala/Akka. He has been an active contributor to the Apache Spark project, and a two-time Datastax Cassandra MVP.

Jaksot(243)

#11 DoK community: Doing Data Wrong // Jeremy Tanner & David McKay

#11 DoK community: Doing Data Wrong // Jeremy Tanner & David McKay

For our 11th installation of the data on k8s meetup, we talk with both Sr Tech Evangelists Jeremy Tanner and David McKay from Packet about doing data wrong on k8s. // Key takeaways: Data is hard with ...

30 Syys 202053min

#10 DoK community: Data on Kubernetes and container attached storage - an update // Evan Powell

#10 DoK community: Data on Kubernetes and container attached storage - an update // Evan Powell

For our 10th installation of the data on k8s community meetup, we talk with CEO of Mayadata Evan Powell about container attached storage, Portworx acquisition, openEBS, can open source make it, and we...

24 Syys 202058min

#9 DoK community: Geospatial Sensor Networks and Partitioning Data // Alex Miłowski

#9 DoK community: Geospatial Sensor Networks and Partitioning Data // Alex Miłowski

For our 9th installation of the Dokc data on k8s meetup, we will be talking with Alex Milowski from Redis Labs. // Key takeaways: How are data collection and consumption workloads fundamentally differ...

17 Syys 202054min

#8 DoK community: Appropriate workloads for databases in K8s // Rick Vasquez

#8 DoK community: Appropriate workloads for databases in K8s // Rick Vasquez

For our 8th installation of the data on k8s meetup, we spoke with Rick Vasquez, Enablement Lead - Services Portfolio at Percona. // Key takeaways: Large unsharded data footprints are not great for kub...

9 Syys 20201h

#7 DoK community: Conway’s Law & Kubernetes: Centralization vs. small team autonomy // Joseph Sandoval & Mike Tougeron

#7 DoK community: Conway’s Law & Kubernetes: Centralization vs. small team autonomy // Joseph Sandoval & Mike Tougeron

Data on Kubernetes #7: Conway’s Law & Kubernetes - Centralization vs small team autonomy with Mike Tougeron, Lead Site Reliability Engineer, at Adobe & Joseph Sandoval , SRE Manager, Platform Infrast...

3 Syys 202056min

#6 DoK community: Operators, operators, operators… operators // Amit Gupta

#6 DoK community: Operators, operators, operators… operators // Amit Gupta

Data on Kubernetes Community #6: Operators, operators, operators….Kubernetes operators! With Amit Gupta, Group Product Manager, at Confluent. Key takeaways: Kubernetes Operators represent a great oppo...

26 Elo 202057min

#5 DoK community: The full cycle of doing data on k8s: a case study // Dave Cook

#5 DoK community: The full cycle of doing data on k8s: a case study // Dave Cook

Doing Data on Kubernetes this week we dive into Globally distributed Business applications with Dave Cook Founder of Gridworkz Key takeaways: Current data scalability challenges outlined. What’s avail...

21 Elo 202055min

#4 DoK community: The problem of stateful workloads - balance of keeping data HA vs. costs // Ren Lee

#4 DoK community: The problem of stateful workloads - balance of keeping data HA vs. costs // Ren Lee

Balancing redundancy and HA with costs: did you really need all N replicas?AKA We were running what and it cost us how much?! With Ren Lee SRE at Arista Networks Key takeaways: “Lazy but Simple” vs. “...

13 Elo 202059min