Ep. 59: Alex Shan, Judgment Labs CEO

Ep. 59: Alex Shan, Judgment Labs CEO

Alex Shan is the CEO of Judgment Labs (judgmentlabs.ai), where he's working on building agent behavior monitoring infrastructure. Before Judgment, he worked at Juniper Networks and Stanford AI Lab.


Delta Institute (deltainstitutes.org) supports exceptional researchers and engineers, from academia to industry and beyond. They host technical events to bring great people together, a podcast that gives industry/academic leaders a platform to share their experiences, a small fellows program that builds a tight-knit community of exceptional people, and a grant program that provides compute/mentorship for research projects.Timestamps:00:00 Mission and Evals Focus00:30 Founder Background02:55 Childhood Co-Founders04:49 Stanford to Industry Pivot07:32 Juniper Agents Experience08:55 Founding Judgment Labs11:14 Why Existing Tools Fail13:23 Deep Agent Observability Model15:56 JudgeEval Open Core Strategy18:56 Evals Advice and Pitfalls23:24 Production Grounded Evals24:12 Rubric Discovery Signals25:06 Benchmarks That Evolve26:24 Legal Redlines Case Study27:22 From Edits To Rubrics30:40 Monitoring First Strategy32:09 Self Improving Agent Loop34:12 Competitive Differentiation36:13 Deep Context Evals42:43 Future Data Intelligence45:19 Closing Thoughts

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(60)

Ep. 60: Ronak Malde, Trajectory CEO and Former DeepMind Researcher

Ep. 60: Ronak Malde, Trajectory CEO and Former DeepMind Researcher

Ronak Malde is the CEO of Trajectory (trajectory.ai), where he's working on bringing continual learning to enterprises. Before Trajectory, he worked on research at DeepMind and trained the SWE-1 model...

28 Mai 29min

Ep. 58: Andrew Dai, Elorian CEO and Former DeepMind Research Director

Ep. 58: Andrew Dai, Elorian CEO and Former DeepMind Research Director

Andrew Dai is the co-founder and CEO of Elorian, a new visual reasoning research and product lab. Before Elorian, Andrew was a research director at DeepMind, where he was the Gemini data area co-lead,...

28 Mai 32min

Ep. 57: Kevin Wang, NeurIPS Best Paper Author and OpenAI Researcher

Ep. 57: Kevin Wang, NeurIPS Best Paper Author and OpenAI Researcher

Kevin Wang is the first author of the NeurIPS 2025 Best Paper, titled "1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities". He's currently a researcher...

28 Mai 46min

Ep. 56: Grace Li, Design Arena Co-Creator and Arcada Labs Co-Founder

Ep. 56: Grace Li, Design Arena Co-Creator and Arcada Labs Co-Founder

Grace Li is the co-founder of Arcada Labs (arcada.dev), creators of Design Arena, Prediction Arena, and Social Arena. Arcada's vision is to build portals that bridge AI to the real world by building r...

28 Mai 38min

Ep. 55: Karthik Narasimhan, GPT Co-Author and Princeton CS Professor

Ep. 55: Karthik Narasimhan, GPT Co-Author and Princeton CS Professor

Karthik Narasimhan is an associate professor at Princeton's CS Department and the co-director of Princeton NLP. He's led numerous projects at the intersection of language and agents, including ReACT, ...

1 Jan 36min

Ep. 54: Michael Wornow: Kinetic Systems CEO and Stanford CS PhD

Ep. 54: Michael Wornow: Kinetic Systems CEO and Stanford CS PhD

Michael is the CEO of Kinetic Systems and recently finished his CS PhD at Stanford, where he was advised by Nigam Shah and Chris Ré. Before coming to Stanford, Michael studied CS and Statistics at Har...

31 Des 202541min

Ep. 53: Brian Zhan, Partner at Striker VP and Investor in Reflection, Skild, Periodic, Ricursive

Ep. 53: Brian Zhan, Partner at Striker VP and Investor in Reflection, Skild, Periodic, Ricursive

Brian Zhan is a partner at Striker Venture Partners. He's invested in several leading research startups, including Periodic Labs, Reflection AI, Skild AI, Dyna Robotics, Voyage AI, and more. Before co...

23 Des 202527min

Populært innen Teknologi

lydartikler-fra-aftenposten
romkapsel
teknisk-sett
tomprat-med-gunnar-tjomlid
energi-og-klima
elektropodden
nasjonal-sikkerhetsmyndighet-nsm
hans-petter-og-co
shifter
pedagogisk-intelligens
teknologi-og-mennesker
rss-anleggspraten
fornybaren
rss-plateprat
rss-ai-forklart
i-loopen
plattformpodden
rss-var-alt-bedre-for
rss-devops
rss-heis