R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

R1, OpenAI’s o3, and the ARC-AGI Benchmark: Insights from Mike Knoop

In this episode of Gradient Dissent, host Lukas Biewald sits down with Mike Knoop, Co-founder and CEO of Ndea, a cutting-edge AI research lab. Mike shares his journey from building Zapier into a major automation platform to diving into the frontiers of AI research. They discuss DeepSeek’s R1, OpenAI’s O-series models, and the ARC Prize, a competition aimed at advancing AI’s reasoning capabilities. Mike explains how program synthesis and deep learning must merge to create true AGI, and why he believes AI reliability is the biggest hurdle for automation adoption.

This conversation covers AGI timelines, research breakthroughs, and the future of intelligent systems, making it essential listening for AI enthusiasts, researchers, and entrepreneurs.

Mentioned Show Notes:

https://ndea.com

https://arcprize.org/blog/r1-zero-r1-results-analysis

https://arcprize.org/blog/oai-o3-pub-breakthrough


🎙 Get our podcasts on these platforms:

Apple Podcasts: http://wandb.me/apple-podcasts

Spotify: http://wandb.me/spotify

Google: http://wandb.me/gd_google

YouTube: http://wandb.me/youtube


Connect with Mike Knoop"

@mikeknoop


Follow Weights & Biases:

https://twitter.com/weights_biases

https://www.linkedin.com/company/wandb


Join the Weights & Biases Discord Server:

https://discord.gg/CkZKRNnaf3


Episoder(134)

Stephan Fabel — Efficient Supercomputing with NVIDIA's Base Command Platform

Stephan Fabel — Efficient Supercomputing with NVIDIA's Base Command Platform

Stephan Fabel is Senior Director of Infrastructure Systems & Software at NVIDIA, where he works on Base Command, a software platform to coordinate access to NVIDIA's DGX SuperPOD infrastructure.Lukas ...

6 Jan 202252min

Chris Padwick — Smart Machines for More Sustainable Farming

Chris Padwick — Smart Machines for More Sustainable Farming

Chris Padwick is Director of Computer Vision Machine Learning at Blue River Technology, a subsidiary of John Deere. Their core product, See & Spray, is a weeding robot that identifies crops and weeds ...

23 Des 20211h

Kathryn Hume — Financial Models, ML, and 17th-Century Philosophy

Kathryn Hume — Financial Models, ML, and 17th-Century Philosophy

Kathryn Hume is Vice President Digital Investments Technology at the Royal Bank of Canada (RBC). At the time of recording, she was Interim Head of Borealis AI, RBC's research institute for machine lea...

16 Des 202152min

Sean & Greg — Biology and ML for Drug Discovery

Sean & Greg — Biology and ML for Drug Discovery

Sean McClain is the founder and CEO, and Gregory Hannum is the VP of AI Research at Absci, a biotech company that's using deep learning to expedite drug discovery and development.Lukas, Sean, and Greg...

2 Des 202155min

Chris, Shawn, and Lukas — The Weights & Biases Journey

Chris, Shawn, and Lukas — The Weights & Biases Journey

You might know him as the host of Gradient Dissent, but Lukas is also the CEO of Weights & Biases, a developer-first ML tools platform!In this special episode, the three W&B co-founders — Chris (CVP),...

5 Nov 202149min

Pete Warden — Practical Applications of TinyML

Pete Warden — Practical Applications of TinyML

Pete is the Technical Lead of the TensorFlow Micro team, which works on deep learning for mobile and embedded devices.Lukas and Pete talk about hacking a Raspberry Pi to run AlexNet, the power and siz...

21 Okt 202153min

Pieter Abbeel — Robotics, Startups, and Robotics Startups

Pieter Abbeel — Robotics, Startups, and Robotics Startups

Pieter is the Chief Scientist and Co-founder at Covariant, where his team is building universal AI for robotic manipulation. Pieter also hosts The Robot Brains Podcast, in which he explores how far hu...

7 Okt 202157min

Chris Albon — ML Models and Infrastructure at Wikimedia

Chris Albon — ML Models and Infrastructure at Wikimedia

In this episode we're joined by Chris Albon, Director of Machine Learning at the Wikimedia Foundation.Lukas and Chris talk about Wikimedia's approach to content moderation, what it's like to work in a...

23 Sep 202156min

Populært innen Business og økonomi

lydartikler-fra-aftenposten
stopp-verden
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
pengepodden-2
livet-pa-veien-med-jan-erik-larssen
finansredaksjonen
pengesnakk
tid-er-penger-en-podcast-med-peter-warren
utbytte
stormkast-med-valebrokk-stordalen
morgenkaffen-med-finansavisen
okonomiamatorene
liberal-halvtime
rss-politisk-preik
rss-markedspuls-2
lederpodden
rss-pa-konto