Apache Beam with Kenneth Knowles and Pablo Estrada

Apache Beam with Kenneth Knowles and Pablo Estrada

On the podcast this week, your hosts Stephanie Wong and Mark Mirchandani talk about the data processing tool Apache Beam with guests Pablo Estrada and Kenneth Knowles.

Kenn starts us off with an overview of how Apache Beam began and how Cloud Dataflow was involved. The unique batch and stream method and emphasis on correctness garnered support from developers early on and continues to attract users. Pablo helps us understand why Beam is a better option for certain projects looking to process large amounts of data. Our guests describe how Beam may be a better fit than microservices that could become obsolete as company needs change.

Next, we step back and take a look at why batch and stream is the gold standard of data processing because of its balance between low latency and ease of "being done" with data collection. Beam's focus on the correctness of data and correctness in processing that data is a core component. With good data, processing becomes easier, more reliable, and cheaper. Kenn gives examples of how things can go wrong with bad data processing. Beam strives for the perfect combination of low latency, correct data, and affordability. Users can choose where to run Beam pipelines, from other Apache software offerings to Dataflow, which means excellent flexibility. Our guests talk about the pros and cons of some of these options and we hear examples of how companies are using Beam along with supporting software to solve data processing challenges.

To get started with Beam, check out Beam College or attend Beam Summit 2022.

Kenneth Knowles

Kenn Knowles is chair of the Apache Beam Project Management Committee. Kenn has been working on Google Cloud Dataflow—Google's Beam backend—since 2014. Kenn holds a PhD in programming languages from the University of California, Santa Cruz.

Pablo Estrada

Pablo is a Software Engineer at Google, and a management committee member for Apache Beam. Pablo is big into working on an open source project, and has worked all across the Apache Beam stack.

Cool things of the week
  • Under the sea: Building the world's fiber optic internet video
  • Google Data Cloud Summit site
  • It's official—Google Distributed Cloud Edge is generally available blog
    • GCP Podcast Episode 228: Fastly with Tyler McMullen podcast
  • Save big by temporarily suspending unneeded Compute Engine VMs—now GA blog
Interview
  • Apache Beam site
  • Apache Beam Documentation site
  • Dataflow site
  • Apache Flink site
  • Apache Spark site
  • Apache Samza site
  • Apache Nemo site
  • Spanner site
  • BigQuery site
  • Beam College site
  • Beam College on Github site
  • Beam Developer Mailing List email
  • Beam User Mailing List email
  • Beam Summit site
What's something cool you're working on?

Mark is working on a new Apache Beam video series Getting Started Wtih Apache Beam

Hosts

Stephanie Wong and Mark Mirchandani

Jaksot(335)

Vertex AI Experiments with Ivan Nardini and Karthik Ramachandran

Vertex AI Experiments with Ivan Nardini and Karthik Ramachandran

Vertex AI Experiments with Ivan Nardini and Karthik Ramachandran Hosts Anu Srivastava and Nikita Namjoshi are joined by guests Ivan Nardini and Karthik Ramachandran in a conversation about Vertex AI E...

21 Syys 202226min

Storage Spotlight with Sean Derrington and Nishant Kohli

Storage Spotlight with Sean Derrington and Nishant Kohli

Host Stephanie Wong chats with storage pros Sean Derrington and Nishant Kohli this week to learn more about cost optimization with storage projects and exciting new launches in the Google Cloud storag...

14 Syys 202230min

GKE Turns 7 with Tim Hockin

GKE Turns 7 with Tim Hockin

Tim Hockin joins Kaslin Fields and Anthony Bushong to celebrate GKE's seventh birthday! Tim starts with a brief background on GKE from its beginnings in 2015 and its relationship to Borg to the vision...

31 Elo 202238min

Launching Products at Google Cloud with Anita Kibunguchy-Grant and Gabe Weiss

Launching Products at Google Cloud with Anita Kibunguchy-Grant and Gabe Weiss

This week, Max Saltonstall and Stephanie Wong go behind the scenes at Google Cloud with Gabe Weiss and Anita Kibunguchy-Grant to learn how new products move from idea to market. To start, our guests w...

24 Elo 202244min

Google Cloud for Higher Education with Laurie White and Aaron Yeats

Google Cloud for Higher Education with Laurie White and Aaron Yeats

On the podcast this week, our guests Laurie White and Aaron Yeats talk with Stephanie Wong and Kelci Mensah about higher education and how Google Cloud is helping students realize their potential. As ...

17 Elo 202248min

Cloud Functions (2nd gen) with Jaisen Mathai and Sara Ford

Cloud Functions (2nd gen) with Jaisen Mathai and Sara Ford

Stephanie Wong and Brian Dorsey are joined today by fellow Googlers Jaisen Mathai and Sara Ford to hear all about Cloud Functions (2nd gen) and how it differs from the original. Jaisen gives us some b...

10 Elo 202241min

Vertex Explainable AI with Irina Sigler and Ivan Nardini

Vertex Explainable AI with Irina Sigler and Ivan Nardini

Max Saltonstall and new host Anu Srivastava are in the studio today talking about Vertex Explainable AI with guests Irina Sigler and Ivan Nardini. Vertex Explainable AI was born from a need for develo...

3 Elo 202226min

Arm Servers on GCP with Jon Masters and Emma Haruka Iwao

Arm Servers on GCP with Jon Masters and Emma Haruka Iwao

We're learning all about Arm servers on Google Cloud Platform this week. Hosts Brian Dorsey and Stephanie Wong welcome fellow Googlers Jon Masters and Emma Haruka Iwao to talk about the newest VMs on ...

27 Heinä 202235min

Suosittua kategoriassa Politiikka ja uutiset

uutiscast
aikalisa
rss-ootsa-kuullut-tasta
politiikan-puskaradio
ootsa-kuullut-tasta-2
tervo-halme
viisupodi
rss-podme-livebox
otetaan-yhdet
et-sa-noin-voi-sanoo-esittaa
rikosmyytit
the-ulkopolitist
rss-asiastudio
io-techin-tekniikkapodcast
aihe
rss-pallo-keskelle-2
radio-antro
rss-kovin-paikka
rss-sanna-ukkola-show-verkkouutiset
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset