Apache Beam with Kenneth Knowles and Pablo Estrada

Apache Beam with Kenneth Knowles and Pablo Estrada

On the podcast this week, your hosts Stephanie Wong and Mark Mirchandani talk about the data processing tool Apache Beam with guests Pablo Estrada and Kenneth Knowles.

Kenn starts us off with an overview of how Apache Beam began and how Cloud Dataflow was involved. The unique batch and stream method and emphasis on correctness garnered support from developers early on and continues to attract users. Pablo helps us understand why Beam is a better option for certain projects looking to process large amounts of data. Our guests describe how Beam may be a better fit than microservices that could become obsolete as company needs change.

Next, we step back and take a look at why batch and stream is the gold standard of data processing because of its balance between low latency and ease of "being done" with data collection. Beam's focus on the correctness of data and correctness in processing that data is a core component. With good data, processing becomes easier, more reliable, and cheaper. Kenn gives examples of how things can go wrong with bad data processing. Beam strives for the perfect combination of low latency, correct data, and affordability. Users can choose where to run Beam pipelines, from other Apache software offerings to Dataflow, which means excellent flexibility. Our guests talk about the pros and cons of some of these options and we hear examples of how companies are using Beam along with supporting software to solve data processing challenges.

To get started with Beam, check out Beam College or attend Beam Summit 2022.

Kenneth Knowles

Kenn Knowles is chair of the Apache Beam Project Management Committee. Kenn has been working on Google Cloud Dataflow—Google's Beam backend—since 2014. Kenn holds a PhD in programming languages from the University of California, Santa Cruz.

Pablo Estrada

Pablo is a Software Engineer at Google, and a management committee member for Apache Beam. Pablo is big into working on an open source project, and has worked all across the Apache Beam stack.

Cool things of the week
  • Under the sea: Building the world's fiber optic internet video
  • Google Data Cloud Summit site
  • It's official—Google Distributed Cloud Edge is generally available blog
    • GCP Podcast Episode 228: Fastly with Tyler McMullen podcast
  • Save big by temporarily suspending unneeded Compute Engine VMs—now GA blog
Interview
  • Apache Beam site
  • Apache Beam Documentation site
  • Dataflow site
  • Apache Flink site
  • Apache Spark site
  • Apache Samza site
  • Apache Nemo site
  • Spanner site
  • BigQuery site
  • Beam College site
  • Beam College on Github site
  • Beam Developer Mailing List email
  • Beam User Mailing List email
  • Beam Summit site
What's something cool you're working on?

Mark is working on a new Apache Beam video series Getting Started Wtih Apache Beam

Hosts

Stephanie Wong and Mark Mirchandani

Jaksot(335)

Database Migration Service with Shachar Guz, Inna Weiner, and Gabe Weiss

Database Migration Service with Shachar Guz, Inna Weiner, and Gabe Weiss

Stephanie Wong talks with guests Shachar Guz, Inna Weiner, and Gabe Weiss about Google's Database Migration Service and how it helps companies move data to Google Cloud. What typically is a complicate...

16 Marras 202240min

ML/AI Data Science for Data Analytics with Jed Dougherty and Dan Darnell

ML/AI Data Science for Data Analytics with Jed Dougherty and Dan Darnell

On the show this week, Carter Morgan and Anu Srivastava talk about AI and ML data analytics with Dataiku VP of Platform Strategy, Jed Dougherty, and Head of Product Marketing, Dan Darnell. Dataiku is ...

9 Marras 202232min

Assured Workloads with Key Access Justifications with Bryce Buffaloe and Seth Denney

Assured Workloads with Key Access Justifications with Bryce Buffaloe and Seth Denney

Hosts Max Saltonstall and Daryl Ducharme are joined by Bryce Buffaloe and Seth Denney to chat about Assured Workloads and the sovereignty control Key Access Justifications so customers can see how the...

2 Marras 202242min

Digital Sovereignty with Archana Ramamoorthy and Julien Blanchez

Digital Sovereignty with Archana Ramamoorthy and Julien Blanchez

This week, Max Saltonstall and Chloe Condon welcome guests Archana Ramamoorthy and Julien Blanchez to talk about digital sovereignty and what goes into a technical strategy for dealing with this compl...

26 Loka 202236min

Top 5 Data & Analytics Launches from Next 2022 with Bruno Aziza and Maire Newton

Top 5 Data & Analytics Launches from Next 2022 with Bruno Aziza and Maire Newton

Debi Cabrera and Stephanie Wong have more great Next content this week as we focus on launches specifically related to data and analytics with guests Bruno Aziza and Maire Newton. We start the episode...

19 Loka 202230min

Next 2022 with Forrest Brazeal and Stephanie Wong

Next 2022 with Forrest Brazeal and Stephanie Wong

Forrest Brazeal joins Stephanie Wong today on the second day of Google Cloud Next '22. We're talking about all the exciting announcements, how the conference has changed in recent years, and what to e...

12 Loka 202243min

2022 State of DevOps Report with Nathen Harvey and Derek DeBellis

2022 State of DevOps Report with Nathen Harvey and Derek DeBellis

On the show this week, we're talking updated DevOps practices for 2022 with hosts Stephanie Wong and Chloe Condon and our guests Nathen Harvey and Derek DeBellis. Nathen and Derek start the show with ...

5 Loka 202244min

DEI and Belonging in the Cloud with Jason Smith

DEI and Belonging in the Cloud with Jason Smith

Jason Smith, founder of the Mixed Googlers group here at Google, joins Stephanie Wong to talk about DEI and the importance of belonging in tech. Jason helps us better understand what the concepts dive...

28 Syys 202233min

Suosittua kategoriassa Politiikka ja uutiset

uutiscast
aikalisa
rss-ootsa-kuullut-tasta
ootsa-kuullut-tasta-2
tervo-halme
politiikan-puskaradio
viisupodi
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
rss-asiastudio
otetaan-yhdet
the-ulkopolitist
rikosmyytit
rss-pallo-keskelle-2
rss-mina-ukkola
rss-kovin-paikka
rss-hyvaa-huomenta-bryssel
rss-terveisia-seelannista
rss-sanna-ukkola-show-verkkouutiset
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset