Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM
Data Driven22 Apr 2025

Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM

This week, Frank sat down with Dr. Jacob Leverich—Stanford PhD, cofounder of Observe, and a veteran of the Google MapReduce team and Splunk. Jacob’s journey, from tinkering with video game code as a kid, to innovating at the cutting edge of distributed systems and energy efficiency, is as inspiring as it is informative.

Key Takeaways
  • Early Tech Roots: Hear how curiosity with QBasic and classic PCs (think IBM PCXT and Commodore) put Jacob on a path to high-impact data engineering.
  • MapReduce, Dremel, & the Rise of Big Data: Jacob pulls back the curtain on working with some of the most influential data processing tools at Google and how these systems shifted the entire data landscape (hello, BigQuery!).
  • Building Efficient Systems: It’s not just about scale—energy efficiency and performance optimization are the unsung heroes of today’s data infrastructure. Jacob explains why making things “just work” isn’t enough anymore.
  • The Realities of Ops & Observability: Remember the days of grepping logs at 2AM? There’s a better way. Jacob shares how platforms like Observe help teams consolidate, visualize, and act on operational data—turning chaos into actionable insight.
  • Bridging Data & Ops: The lines between data observability and traditional ops are blurring, and Jacob’s unique experience shows how best practices from data warehousing are finally making ops smoother (and less sleepless).
  • Power Concerns & the Future: As data grows, so does energy consumption in data centers. Find out why optimization isn’t just good for performance—it’s key to sustainability.

Timestamps

00:00 Interview with Jacob Levrich

05:59 Journey into Game Programming

06:43 "Pursuing Fast Video Game Code"

10:23 Data Processing and Power Efficiency

16:11 Snowflake's Transformative Database Approach

19:18 Journey to Data Management Industry

21:37 Data Products: Solving Core Challenges

27:07 Early Web Log Analysis Techniques

28:57 Consolidating Data for Efficiency

33:23 Specialized Tools and Context Switching

35:43 Unique Dual-Expertise in Tech

38:58 User-Centric Business Strategies

42:13 IP Data Analysis in Cloud

47:23 Electricity Transport Upsets Local Farms

48:25 Shift to Parallel Computing

52:10 Hardware Specialization & Software Optimization

57:32 "Stay Data Driven"

Avsnitt(300)

Synthetic Populations and the Future of Decision Intelligence

Synthetic Populations and the Future of Decision Intelligence

In this episode of Data Driven, Frank and Andy dive into the future of market intelligence with Dr. Jill Axline, co-founder and CEO of Mavera—a company building synthetic populations that simulate rea...

29 Jan 50min

Microsoft Fabric Unpacked: AI, Data Sovereignty, and a Bit of Clippy Nostalgia

Microsoft Fabric Unpacked: AI, Data Sovereignty, and a Bit of Clippy Nostalgia

In today’s show, BAILeY, your semi-sentient hostess with the mostest metadata, teams up with Frank La Vigne to welcome the ever-insightful Andrew Brust for a deep dive into the evolving Microsoft data...

12 Jan 54min

Celebrating 400 Episodes – How AI Turbocharges Coding, Podcasting, and Creativity

Celebrating 400 Episodes – How AI Turbocharges Coding, Podcasting, and Creativity

Welcome to a milestone episode of Data Driven! In episode 400, hosts BAILeY, Frank La Vigne, and Andy Leonard gather to reflect on nearly a decade at the forefront of podcasting about data, AI, and th...

8 Jan 1h

The Real Risks of LLMs - Guardrails, Judgment, and the Human Element in Cybersecurity

The Real Risks of LLMs - Guardrails, Judgment, and the Human Element in Cybersecurity

In this episode of Data Driven, hosts Frank La Vigne, Candace Gillhoolley, and BAILeY sit down with Mike Armistead, CEO of Pulse Security AI—a cybersecurity veteran who's been fortifying digital defen...

26 Nov 202558min

Going From Spreadsheets to Smart Agents - Modernizing Supply Chain Intelligence

Going From Spreadsheets to Smart Agents - Modernizing Supply Chain Intelligence

In this episode, Frank La Vigne sits down with Itay Haber, CEO of Data Noetic, to unpack how AI is revolutionizing supply chain management. Forget spreadsheets and dashboards—Data Noetic is building a...

19 Nov 202558min

Inside Nvidia GTC DC: AI, Quantum Computing, Robotics, and the Future of Supercomputers

Inside Nvidia GTC DC: AI, Quantum Computing, Robotics, and the Future of Supercomputers

Welcome to another exciting episode of Data Driven! On this week’s show, hosts Frank La Vigne and Candace Gillhoolley take you inside the NVIDIA GTC conference in Washington, DC—an event that’s rapidl...

30 Okt 202554min

The Fast-Moving Train of AI - Sovereignty, Acceleration, & Lessons from History

The Fast-Moving Train of AI - Sovereignty, Acceleration, & Lessons from History

On this episode of Data Driven, hosts Frank La Vigne and Leonard celebrate a major milestone: the 30th anniversary of Franksworld.com, one of the OGs of tech blogging that’s survived multiple browser ...

13 Okt 20251h 15min

Compute, Carbon, and Cashflow Silicon Data’s Big Bet on GPU Markets

Compute, Carbon, and Cashflow Silicon Data’s Big Bet on GPU Markets

Welcome to another episode of Data Driven, where we dive deep into how data and AI are shaping—sometimes shaking—the modern world. In this episode, hosts Frank La Vigne, Andy Leonard, and Carmen Li si...

1 Okt 202550min

Populärt inom Vetenskap

p3-dystopia
pojkmottagningen
svd-nyhetsartiklar
dumma-manniskor
allt-du-velat-veta
kapitalet-en-podd-om-ekonomi
det-morka-psyket
sexet
halsorevolutionen
rss-vetenskapsradion-2
medicinvetarna
rss-vetenskapsradion
rss-ufobortom-rimligt-tvivel-2
4health-med-anna-sparre
dumforklarat
rss-spraket
bildningspodden
paranormalt-med-caroline-giertz
hacka-livet
vetenskapsradion