The server-side rendering equivalent for LLM inference workloads

The server-side rendering equivalent for LLM inference workloads

Ryan is joined by Tuhin Srivastava, CEO and co-founder of Baseten, to explore the evolving landscape of AI infrastructure and inference workloads, how the shift from traditional machine learning models to large-scale neural networks has made GPU usage challenging, and the potential future of hardware-specific optimizations in AI.

Episode notes:

Baseten is an AI infrastructure platform giving you the tooling, expertise, and hardware needed to bring AI products to market fast.

Connect with Tuhin on LinkedIn or reach him at his email tuhin@baseten.co.

Shoutout to user Hitesh for winning a Populist badge for their answer to Cannot drop database because it is currently in use.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(947)

The find out stage of AI is just supply chain and password protection

The find out stage of AI is just supply chain and password protection

In this two-for-one special recorded at HumanX, Ryan is joined by Dataiku’s Florian Douetteau to chat about the governance, orchestration, and data requirements for serious agentic systems and 1Passwo...

29 Mai 30min

Do you have what it takes to run AI in production?

Do you have what it takes to run AI in production?

From the floor of HumanX, Ryan Donovan is joined by Peter Salanki, CTO and co-founder of CoreWeave, to chat about what it really takes to run AI in production; the growing importance of observability,...

26 Mai 27min

Breaking your AI storage bottlenecks

Breaking your AI storage bottlenecks

Recorded at HumanX, Ryan sits down with Garima Kapoor and Anand Babu Periasamy, co-founders and co-CEOs of MinIO, to chat about eliminating the storage bottlenecks that leave GPUs underutilized, their...

22 Mai 29min

Pack your agentic stack in Slack

Pack your agentic stack in Slack

SPONSORED BY SLACK BY SALESFORCERyan welcomes Jaime DeLanghe, chief product officer at Slack, to chat about how they’re preparing to integrate everybody’s agents in their chat application. They chat a...

20 Mai 29min

Your fridge could be a threat to national security

Your fridge could be a threat to national security

On the floor of HumanX, Ryan is joined by Adam Meyers,  Senior VP of Counter Adversary Operations at Crowdstrike, for a deep dive on their latest Global Threat Report that tracks over 281 adversaries ...

19 Mai 29min

Observability and human intuition in an AI world

Observability and human intuition in an AI world

In this two for one episode recorded at HumanX, Ryan is first joined by Christine Yen, CEO of Honeycomb, to discuss how AI compresses the software development lifecycle, making observability about cap...

15 Mai 29min

How Braze’s CTO is rethinking engineering for the agentic area

How Braze’s CTO is rethinking engineering for the agentic area

Jon Hyman, co-founder and CTO of Braze, joins Stack Overflow CPTO Jody Bailey on Leaders of Code to share how he's led the company's engineering organization over nearly 15 years of growth — and how t...

13 Mai 43min

Connecting the dots for accurate AI

Connecting the dots for accurate AI

At HumanX, Ryan is joined by Philip Rathle, CTO at Neo4j to discuss what knowledge context means for AI agents, how limitations like stale training data make the model-only approach to agents a bad fi...

12 Mai 31min

Populært innen Business og økonomi

stopp-verden
lydartikler-fra-aftenposten
dine-penger-pengeradet
e24-podden
rss-penger-polser-og-politikk
rss-borsmorgen-okonominyhetene
rss-skravla-gar
utbytte
finansredaksjonen
pengepodden-2
livet-pa-veien-med-jan-erik-larssen
rss-pa-konto
stormkast-med-valebrokk-stordalen
morgenkaffen-med-finansavisen
tid-er-penger-en-podcast-med-peter-warren
liberal-halvtime
pengesnakk
okonomiamatorene
rss-markedspuls-2
lederpodden