The server-side rendering equivalent for LLM inference workloads

The server-side rendering equivalent for LLM inference workloads

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

Episode notes:

Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring AI products to market fast.

Connect with Tuhin on LinkedIn or reach him at his email tuhin@baseten.co.

Shoutout to user Hitesh for winning a Populist badge for their answer to Cannot drop database because it is currently in use.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Avsnitt(921)

Open source for awkward robots

Open source for awkward robots

Ryan is joined by Jan Liphardt,  CEO and co-founder of OpenMind, to chat about the rapidly evolving world of humanoid robotics and what it means for humans, why OpenMind is building an open source ope...

13 Mars 30min

Even the chip makers are making LLMs

Even the chip makers are making LLMs

Ryan welcomes Kari Briski, NVIDIA’s VP of Generative AI Software for Enterprise, to the show to explore how a chip manufacturer got into the model development game. They discuss NVIDIA’s co-design fee...

10 Mars 26min

Building brains for bulldozers

Building brains for bulldozers

Ryan chats with Kevin Peterson, CTO of Bedrock Robotics, about the evolution of self-driving technology and why robotics is now advancing; how real data is still relevant but simulation becomes essent...

6 Mars 24min

AI-assisted coding needs more than vibes; it needs containers and sandboxes

AI-assisted coding needs more than vibes; it needs containers and sandboxes

SPONSORED BY DOCKERIn this sponsored episode, Ryan chats with Mark Cavage, President and COO of Docker, joins the show to dive into hardened containers and agent sandboxes. They discuss what it means ...

4 Mars 27min

No need for Ctrl+C when you have MCP

No need for Ctrl+C when you have MCP

Ryan sits down with Member of the Technical Staff at Anthropic and Model Context Protocol co-creator David Soria Parra to talk the evolution of MCP from local-only to remote connectivity, how security...

2 Mars 31min

To live in an AI world, knowing is half the battle

To live in an AI world, knowing is half the battle

Ryan welcomes Marcus Fontoura, technical fellow at Microsoft and author of Human Agency in the Digital World, to discuss the intersection of technology, society, and human dignity in a digital-first w...

27 Feb 28min

Dogfood so nutritious it’s building the future of SDLCs

Dogfood so nutritious it’s building the future of SDLCs

Ryan welcomes Thibault Sottiaux, OpenAI’s engineering lead on Codex, to discuss how the Codex team dogfoods Codex to build Codex, what distinguishes an agentic coding tool from a chat-based code assis...

24 Feb 32min

Even GenAI uses Wikipedia as a source

Even GenAI uses Wikipedia as a source

Ryan is joined by Philippe Saade, the AI project lead at Wikimedia Deutschland, to dive into the Wikidata Embedding Project and how their team vectorized 30 million of Wikidata’s 119 million entries f...

20 Feb 26min

Populärt inom Business & ekonomi

framgangspodden
badfluence
varvet
rss-jossan-nina
rss-borsens-finest
avanzapodden
rss-svart-marknad
svd-tech-brief
uppgang-och-fall
rss-dagen-med-di
fill-or-kill
rss-kort-lang-analyspodden-fran-di
rss-den-nya-ekonomin
24fragor
lastbilspodden
dynastin
rss-inga-dumma-fragor-om-pengar
rikatillsammans-om-privatekonomi-rikedom-i-livet
kapitalet-en-podd-om-ekonomi
borsmorgon