R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

In this episode of Gradient Dissent, host Lukas Biewald sits down with Mike Knoop, Co-founder and CEO of Ndea, a cutting-edge AI research lab. Mike shares his journey from building Zapier into a major automation platform to diving into the frontiers of AI research. They discuss DeepSeek’s R1, OpenAI’s O-series models, and the ARC Prize, a competition aimed at advancing AI’s reasoning capabilities. Mike explains how program synthesis and deep learning must merge to create true AGI, and why he believes AI reliability is the biggest hurdle for automation adoption.

This conversation covers AGI timelines, research breakthroughs, and the future of intelligent systems, making it essential listening for AI enthusiasts, researchers, and entrepreneurs.

Mentioned Show Notes:

https://ndea.com

https://arcprize.org/blog/r1-zero-r1-results-analysis

https://arcprize.org/blog/oai-o3-pub-breakthrough


🎙 Get our podcasts on these platforms:

Apple Podcasts: http://wandb.me/apple-podcasts

Spotify: http://wandb.me/spotify

Google: http://wandb.me/gd_google

YouTube: http://wandb.me/youtube


Connect with Mike Knoop"

@mikeknoop


Follow Weights & Biases:

https://twitter.com/weights_biases

https://www.linkedin.com/company/wandb


Join the Weights & Biases Discord Server:

https://discord.gg/CkZKRNnaf3


Episoder(134)

The $64M Bet on an AI That Has to Be Right | Carina Hong, CEO of Axiom

The $64M Bet on an AI That Has to Be Right | Carina Hong, CEO of Axiom

Formal verification already consumes years of human effort.In this episode, Lukas Biewald talks with Carina Hong, Founder & CEO of Axiom, about why verification is becoming the real bottleneck in high...

5 Feb 50min

What a $42B Software Co. Really Spends on AI Tools

What a $42B Software Co. Really Spends on AI Tools

“I don't worry about being replaced by AI. I worry about being replaced by someone who's really good at using AI.”Atlassian has 10,000+ engineers currently split-testing the world’s top AI coding tool...

20 Jan 1h 7min

Inside the $41B AI Cloud Challenging Big Tech | CoreWeave SVP

Inside the $41B AI Cloud Challenging Big Tech | CoreWeave SVP

The future of AI training is shaped by one constraint: keeping GPUs fed.In this episode, Lukas Biewald talks with CoreWeave SVP Corey Sanders about why general-purpose clouds start to break down under...

6 Jan 53min

Why Physical AI Needed a Completely New Data Stack

Why Physical AI Needed a Completely New Data Stack

The future of AI is physical. In this episode, Lukas Biewald talks to Nikolaus West, CEO of Rerun, about why the breakthrough required to get AI out of the lab and into the messy real world is blocked...

16 Des 20251h

The Engineering Behind the World’s Most Advanced Video AI

The Engineering Behind the World’s Most Advanced Video AI

Is video AI a viable path toward AGI? Runway ML founder Cristóbal Valenzuela joins Lukas Biewald just after Gen 4.5 reached the #1 position on the Video Arena Leaderboard, according to community votin...

1 Des 202514min

The CEO Behind the Fastest-Growing AI Inference Company | Tuhin Srivastava

The CEO Behind the Fastest-Growing AI Inference Company | Tuhin Srivastava

In this episode of Gradient Dissent, Lukas Biewald talks with Tuhin Srivastava, CEO and founder of Baseten, one of the fastest-growing companies in the AI inference ecosystem. Tuhin shares the real st...

18 Nov 202559min

The Startup Powering The Data Behind AGI

The Startup Powering The Data Behind AGI

In this episode of Gradient Dissent, Lukas Biewald talks with the CEO & founder of Surge AI, the billion-dollar company quietly powering the next generation of frontier LLMs. They discuss Surge's orig...

16 Sep 202556min

Arvind Jain on Building Glean and the Future of Enterprise AI

Arvind Jain on Building Glean and the Future of Enterprise AI

In this episode of Gradient Dissent, Lukas Biewald sits down with Arvind Jain, CEO and founder of Glean. They discuss Glean's evolution from solving enterprise search to building agentic AI tools that...

5 Aug 202543min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
rss-penger-polser-og-politikk
e24-podden
rss-borsmorgen-okonominyhetene
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
pengepodden-2
pengesnakk
utbytte
tid-er-penger-en-podcast-med-peter-warren
morgenkaffen-med-finansavisen
rss-sunn-okonomi
stormkast-med-valebrokk-stordalen
lederpodden
rss-markedspuls-2
liberal-halvtime
rss-politisk-preik
lederskap-nhhs-podkast-om-ledelse