Next-Gen Data Modeling, Integrity, and Governance with YODA

Next-Gen Data Modeling, Integrity, and Governance with YODA

In this episode, Kris interviews Doron Porat, Director of Infrastructure at Yotpo, and Liran Yogev, Director of Engineering at ZipRecruiter (formerly at Yotpo), about their experiences and strategies in dealing with data modeling at scale. Yotpo has a vast and active data lake, comprising thousands of datasets that are processed by different engines, primarily Apache Spark™. They wanted to provide users with self-service tools for generating and utilizing data with maximum flexibility, but e...

Episoder(284)

Real-time Threat Detection Using Machine Learning and Apache Kafka

Real-time Threat Detection Using Machine Learning and Apache Kafka

Can we use machine learning to detect security threats in real-time? As organizations increasingly rely on distributed systems, it is becoming more important to analyze the traffic that passes through...

29 Nov 202229min

Improving Apache Kafka Scalability and Elasticity with Tiered Storage

Improving Apache Kafka Scalability and Elasticity with Tiered Storage

What happens when you need to store more than a few petabytes of data? Rittika Adhikari (Software Engineer, Confluent) discusses how her team implemented tiered storage, a method for improving the sca...

22 Nov 202229min

Decoupling with Event-Driven Architecture

Decoupling with Event-Driven Architecture

In principle, data mesh architecture should liberate teams to build their systems and gather data in a distributed way, without having to explicitly coordinate. Data is the thing that can and should d...

15 Nov 202238min

If Streaming Is the Answer, Why Are We Still Doing Batch?

If Streaming Is the Answer, Why Are We Still Doing Batch?

Is real-time data streaming the future, or will batch processing always be with us? Interest in streaming data architecture is booming, but just as many teams are still happily batching away. Batch pr...

9 Nov 202243min

Security for Real-Time Data Stream Processing with Confluent Cloud

Security for Real-Time Data Stream Processing with Confluent Cloud

Streaming real-time data at scale and processing it efficiently is critical to cybersecurity organizations like SecurityScorecard. Jared Smith, Senior Director of Threat Intelligence, and Brandon Brow...

3 Nov 202248min

Running Apache Kafka in Production

Running Apache Kafka in Production

What are some recommendations to consider when running Apache Kafka® in production? Jun Rao, one of the original Kafka creators, as well as an ongoing committer and PMC member, shares the essential wi...

27 Okt 202258min

Build a Real Time AI Data Platform with Apache Kafka

Build a Real Time AI Data Platform with Apache Kafka

Is it possible to build a real-time data platform without using stateful stream processing? Forecasty.ai is an artificial intelligence platform for forecasting commodity prices, imparting insights int...

20 Okt 202237min

Optimizing Apache JVMs for Apache Kafka

Optimizing Apache JVMs for Apache Kafka

Java Virtual Machines (JVMs) impact Apache Kafka® performance in production. How can you optimize your event-streaming architectures so they process more Kafka messages using the same number of JVMs? ...

13 Okt 20221h 11min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
forklart
i-retten
stopp-verden
lydartikler-fra-aftenposten
nokon-ma-ga
popradet
det-store-bildet
rss-gukild-johaug
dine-penger-pengeradet
fotballpodden-2
aftenbla-bla
rss-ness
e24-podden
hanna-de-heldige
rss-dannet-uten-piano
frokostshowet-pa-p5
bt-dokumentar-2