#31 DoK Community: The Data Lifecycle - Where Do We Go From Here // Benjamin Rogojan. (Presenter: Bart Farrell)

#31 DoK Community: The Data Lifecycle - Where Do We Go From Here // Benjamin Rogojan. (Presenter: Bart Farrell)

Abstract of the talk…

Going from raw data to machine learning models successfully in companies of all sizes requires more than just an understanding of programming. Teams need to manage their data products lifecycle, their software as well as the data. Data products like machine learning models aren’t created out of thin air. They are built on layers of best practices that ensure the models are using accurate data, they are outputting reliable numbers and they have some method to interact with the outside world. So how do we get there? The purpose of this talk is to discuss the current state of the data lifecycle as it pertains to creating data products. This could be machine learning models, dashboards and data APIs. We will outline the general architecture that helps take data from raw to some form of machine learning model. In addition, we will discuss some of the concepts that are being applied from DevOps as well as being created in MLOps to help better facilitate your data life cycle.

Bio…

Ben has spent his career focused on all forms of data. He has focused on developing algorithms to detect fraud, reduce patient readmission and redesign insurance provider policy to help reduce the overall cost of healthcare. He has also worked in various industries including transportation, Big Tech, start-ups, insurance, Saas and more. In all of these industries he has helped companies develop their data strategy. Often starting from scratch to develop an end-to-end data solution. Ben privately consults on data science and engineering problems both solo with Seattle Data Guy as well as with a company called Acheron Analytics. He has experience both working hands-on with technical problems as well as helping leadership teams develop strategies to maximize their data.

Key take-aways from the talk…

- Creating successful data products and models requires more than just programming skills - Best practices from DevOps can help improve data science and ML models maintenance and lifecycle

Jaksot(243)

DoK Talks #144 - Mastering MongoDB on Kubernetes, the power of operators // Arek Borucki

DoK Talks #144 - Mastering MongoDB on Kubernetes, the power of operators // Arek Borucki

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK During my first talk for DoK community I want to walk you through the world of NoSQL database MongoDB and Kubernetes Ope...

26 Heinä 20221h

DoK Specials - Why are Operators paramount to running stateful workloads on Kubernetes?

DoK Specials - Why are Operators paramount to running stateful workloads on Kubernetes?

In this panel with Sylvain Kalache, Head of Content at the DoK Community, drives a conversation featuring Nic Vermandé- Principal Developer Advocate at Ondat, Julian Fischer- CEO at anynines, and Serg...

20 Heinä 202253min

DoK Talks #141 - Dossier: multi-tenant distributed Jupyter Notebooks // Iacoppo Colonnelli & Dario Tranchitella

DoK Talks #141 - Dossier: multi-tenant distributed Jupyter Notebooks // Iacoppo Colonnelli & Dario Tranchitella

https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK When providing data analysis as a service, one must tackle several problems. Data privacy and protection by design are crucia...

15 Heinä 20221h

DoK Talks #140 - Data protection of stateful environment // Timothy Dewin

DoK Talks #140 - Data protection of stateful environment // Timothy Dewin

https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK More and more we see stateful workloads pop up in Kubernetes clusters. These workloads generate data that is unique and is eph...

28 Kesä 202242min

DoK Talks #139 - Private DBaaS on Kubernetes // Sergey Pronin

DoK Talks #139 - Private DBaaS on Kubernetes // Sergey Pronin

https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Percona is committed to deliver solutions to run open source databases anywhere without lock in. As part of this commitment, w...

28 Kesä 202253min

DoK Talks #138 - Build your own social media analytics with Apache Kafka // Jakub Scholz

DoK Talks #138 - Build your own social media analytics with Apache Kafka // Jakub Scholz

https://go.dok.community/slack https://dok.community ABSTRACT OF THE TALK Apache Kafka is more than just a messaging broker. It has a rich ecosystem of different components. There are connectors for i...

24 Kesä 202256min

DoK Talks #137 - How to build your own “Doordash” app // Yaniv Ben Hemo

DoK Talks #137 - How to build your own “Doordash” app // Yaniv Ben Hemo

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK The entire app is built in microservices, running on k8s pods and uses k8s-native message broker called memphis WORKSHOP...

23 Kesä 202257min

DoK Talks #136 - Building a mesh for databases from scratch and why // Maxwell Miao

DoK Talks #136 - Building a mesh for databases from scratch and why // Maxwell Miao

https://go.dok.community/slack https://dok.community/ ABSTRACT OF THE TALK In this talk, Maxwell is going to share his thoughts about Service Mesh and database operations, called Database Mesh, a...

15 Kesä 202247min