#31 DoK Community: The Data Lifecycle - Where Do We Go From Here // Benjamin Rogojan. (Presenter: Bart Farrell)

#31 DoK Community: The Data Lifecycle - Where Do We Go From Here // Benjamin Rogojan. (Presenter: Bart Farrell)

Abstract of the talk…

Going from raw data to machine learning models successfully in companies of all sizes requires more than just an understanding of programming. Teams need to manage their data products lifecycle, their software as well as the data. Data products like machine learning models aren’t created out of thin air. They are built on layers of best practices that ensure the models are using accurate data, they are outputting reliable numbers and they have some method to interact with the outside world. So how do we get there? The purpose of this talk is to discuss the current state of the data lifecycle as it pertains to creating data products. This could be machine learning models, dashboards and data APIs. We will outline the general architecture that helps take data from raw to some form of machine learning model. In addition, we will discuss some of the concepts that are being applied from DevOps as well as being created in MLOps to help better facilitate your data life cycle.

Bio…

Ben has spent his career focused on all forms of data. He has focused on developing algorithms to detect fraud, reduce patient readmission and redesign insurance provider policy to help reduce the overall cost of healthcare. He has also worked in various industries including transportation, Big Tech, start-ups, insurance, Saas and more. In all of these industries he has helped companies develop their data strategy. Often starting from scratch to develop an end-to-end data solution. Ben privately consults on data science and engineering problems both solo with Seattle Data Guy as well as with a company called Acheron Analytics. He has experience both working hands-on with technical problems as well as helping leadership teams develop strategies to maximize their data.

Key take-aways from the talk…

- Creating successful data products and models requires more than just programming skills - Best practices from DevOps can help improve data science and ML models maintenance and lifecycle

Jaksot(243)

DoK Specials - Learn by doing in the DoK Community // Bart Farrell

DoK Specials - Learn by doing in the DoK Community // Bart Farrell

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK As a community we are committed to making learning how to run stateful workloads on Kubernetes as accessible and inclusi...

14 Kesä 202215min

DoK Talks #135 - DoK isn't just Database on Kubernetes // Patrick McFadin

DoK Talks #135 - DoK isn't just Database on Kubernetes // Patrick McFadin

https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK What about your streaming and analytic workloads? If you are all-in on Kubernetes you can't forget about these important parts...

10 Kesä 202246min

DoK Talks #134 - Introducing CloudNativePG // Gabriele Bartolini & Leonardo Cecchi

DoK Talks #134 - Introducing CloudNativePG // Gabriele Bartolini & Leonardo Cecchi

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK CloudNativePG is an open source operator for the orchestration of Postgres workloads with a primary and an arbitrary numb...

9 Kesä 20221h 5min

Dok Talks #133 - My First 90 days with Clickhouse // Alkin Tezuysal

Dok Talks #133 - My First 90 days with Clickhouse // Alkin Tezuysal

https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK This talk will tell the story of an analytics use case database from a non-OLAP and ACID-compliant RDBMS (MySQL) perspective. ...

8 Kesä 202247min

DoK Specials - DEI Panel - We can do better

DoK Specials - DEI Panel - We can do better

https://go.dok.community/slack https://dok.community/ With: Melissa Logan - Director, Data on Kubernetes Lisa-Marie Namphy - Head of Developer Relations, Cockroach Labs Alexandra Rowell - Communi...

3 Kesä 202257min

DoK Talks #132 - Time-series on SQL Server on Kubernetes on ARM64… without SQL Server! // Álvaro Hernández

DoK Talks #132 - Time-series on SQL Server on Kubernetes on ARM64… without SQL Server! // Álvaro Hernández

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK Oh wow. What a weird title. Full of terms that don’t fit together. Or do they? This talk is for believers, those who belie...

2 Kesä 20221h 5min

Why we created one more Operator for MySQL (DoK Day EU 2022) // Sergey Pronin

Why we created one more Operator for MySQL (DoK Day EU 2022) // Sergey Pronin

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) There are plenty Kubernetes Operators for MySQL, including our own at Percona. In this se...

28 Touko 20229min

Why run Postgres in Kubernetes (DoK Day EU 2022) // Gabriele Bartolini

Why run Postgres in Kubernetes (DoK Day EU 2022) // Gabriele Bartolini

https://go.dok.community/slack https://dok.community/ From the DoK Day EU 2022 (https://youtu.be/Xi-h4XNd5tE) Postgres should run inside your Kubernetes cluster. Yes, inside, not outside Kubernetes. ...

28 Touko 202210min