Inferact: Building the Infrastructure That Runs Modern AI

Inferact: Building the Infrastructure That Runs Modern AI

Inferact is a new AI infrastructure company founded by the creators and core maintainers of vLLM. Its mission is to build a universal, open-source inference layer that makes large AI models faster, cheaper, and more reliable to run across any hardware, model architecture, or deployment environment. Together, they broke down how modern AI models are actually run in production, why “inference” has quietly become one of the hardest problems in AI infrastructure, and how the open-source project vLLM emerged to solve it. The conversation also looked at why the vLLM team started Inferact and their vision for a universal inference layer that can run any model, on any chip, efficiently.

Follow Matt Bornstein on X: https://twitter.com/BornsteinMatt

Follow Simon Mo on X: https://twitter.com/simon_mo_

Follow Woosuk Kwon on X: https://twitter.com/woosuk_k

Follow vLLM on X: https://twitter.com/vllm_project

Stay Updated:

Find a16z on YouTube: YouTube

Find a16z on X

Find a16z on LinkedIn

Listen to the a16z Show on Spotify

Listen to the a16z Show on Apple Podcasts

Follow our host: https://twitter.com/eriktorenberg

Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.


Hosted by Simplecast, an AdsWizz company. See pcm.adswizz.com for information about our collection and use of personal data for advertising.

Avsnitt(1000)

Emil Michael: Iran, Anthropic and the Future of AI at the Pentagon

Emil Michael: Iran, Anthropic and the Future of AI at the Pentagon

This conversation with Emil Michael, undersecretary of defense for research and engineering and acting director of the Defense Innovation Unit, was recorded at the a16z American Dynamism Summit in Was...

13 Mars 27min

Palantir CEO Alex Karp on the Zero-Sum AI Race

Palantir CEO Alex Karp on the Zero-Sum AI Race

This conversation with Alex Karp, cofounder and CEO of Palantir, was recorded at the a16z American Dynamism Summit in Washington, D.C. Karp discusses the role of technology in modern warfare, Silicon ...

12 Mars 32min

What It Takes to Clear a Million Crimes a Year with Flock Safety's CEO

What It Takes to Clear a Million Crimes a Year with Flock Safety's CEO

In this episode, previously aired on Cheeky Pint, Garrett Langley describes how a stolen gun in his Atlanta neighborhood led him to build Flock Safety, now deployed in more than 6,000 cities and invol...

11 Mars 1h 46min

The Top 100 Gen AI Consumer Apps

The Top 100 Gen AI Consumer Apps

Anish Acharya speaks with Olivia Moore about the latest edition of the a16z Top 100 AI Apps report. They cover why ChatGPT is still 30 times bigger than Claude on web, how the three major platforms ar...

10 Mars 40min

Andrew Huberman: Peptides, Sleep Tech, and the End of Obesity

Andrew Huberman: Peptides, Sleep Tech, and the End of Obesity

Daisy Wolf speaks with Dr. Andrew Huberman, professor of neurobiology and ophthalmology at Stanford University and host of the Huberman Lab podcast. They discuss how the pandemic sparked a consumer he...

9 Mars 51min

Atlassian CEO on the SaaS Apocalypse, AI Agents & What Comes Next

Atlassian CEO on the SaaS Apocalypse, AI Agents & What Comes Next

Alex Rampell and Erik Torenberg speak with Mike Cannon-Brookes, cofounder and CEO of Atlassian, about how to make sense of the SaaS selloff, why not all software companies face the same AI-driven risk...

6 Mars 55min

Ben Thompson: Anthropic, the Pentagon, and the Limits of Private Power

Ben Thompson: Anthropic, the Pentagon, and the Limits of Private Power

In this conversation, previously aired on TBPN, John Coogan and Jordi Hays speak with Ben Thompson, founder of Stratechery, about his essay "Anthropic and Alignment" and the broader collision between ...

5 Mars 36min

Deploying AI in Healthcare

Deploying AI in Healthcare

a16z general partner Julie Yoo talks with Nikhil Buduma, CEO and cofounder of Ambience Healthcare, to discuss how AI is transforming clinical workflows. They cover the early days of deep learning, why...

4 Mars 49min

Populärt inom Business & ekonomi

framgangspodden
varvet
badfluence
rss-jossan-nina
rss-borsens-finest
rss-svart-marknad
svd-tech-brief
avanzapodden
uppgang-och-fall
fill-or-kill
rss-inga-dumma-fragor-om-pengar
borsmorgon
rss-dagen-med-di
lastbilspodden
bathina-en-podcast
rss-kort-lang-analyspodden-fran-di
affarsvarlden
market-makers
rss-den-nya-ekonomin
borslunch-2