#222 Andrew Feldman: How Cerebras Systems Is Disrupting AI Inference
Eye On A.I.28 Nov 2024

#222 Andrew Feldman: How Cerebras Systems Is Disrupting AI Inference

This episode is sponsored by Shopify.

Shopify is a commerce platform that allows anyone to set up an online store and sell their products. Whether you're selling online, on social media, or in person, Shopify has you covered on every base. With Shopify you can sell physical and digital products. You can sell services, memberships, ticketed events, rentals and even classes and lessons.

Sign up for a $1 per month trial period at http://shopify.com/eyeonai



In this episode of the Eye on AI podcast, Andrew D. Feldman, Co-Founder and CEO of Cerebras Systems, unveils how Cerebras is disrupting AI inference and high-performance computing.

Andrew joins Craig Smith to discuss the groundbreaking wafer-scale engine, Cerebras' record-breaking inference speeds, and the future of AI in enterprise workflows. From designing the fastest inference platform to simplifying AI deployment with an API-driven cloud service, Cerebras is setting new standards in AI hardware innovation.

We explore the shift from GPUs to custom architectures, the rise of large language models like Llama and GPT, and how AI is driving enterprise transformation. Andrew also dives into the debate over open-source vs. proprietary models, AI's role in climate mitigation, and Cerebras' partnerships with global supercomputing centers and industry leaders.

Discover how Cerebras is shaping the future of AI inference and why speed and scalability are redefining what's possible in computing.

Don't miss this deep dive into AI's next frontier with Andrew Feldman.

Like, subscribe, and hit the notification bell for more episodes!



Stay Updated:

Craig Smith Twitter: https://twitter.com/craigss

Eye on A.I. Twitter: https://twitter.com/EyeOn_AI



(00:00) Intro to Andrew Feldman & Cerebras Systems

(00:43) The rise of AI inference

(03:16) Cerebras' API-powered cloud

(04:48) Competing with NVIDIA's CUDA

(06:52) The rise of Llama and LLMs

(07:40) OpenAI's hardware strategy

(10:06) Shifting focus from training to inference

(13:28) Open-source vs proprietary AI

(15:00) AI's role in enterprise workflows

(17:42) Edge computing vs cloud AI

(19:08) Edge AI for consumer apps

(20:51) Machine-to-machine AI inference

(24:20) Managing uncertainty with models

(27:24) Impact of U.S.–China export rules

(30:29) U.S. innovation policy challenges

(33:31) Developing wafer-scale engines

(34:45) Cerebras' fast inference service

(37:40) Global partnerships in AI

(38:14) AI in climate & energy solutions

(39:58) Training and inference cycles

(41:33) AI training market competition

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(353)

Every Enterprise Is About to Have a 100,000 Agent Problem | Oren Michaels of Barndoor AI

Every Enterprise Is About to Have a 100,000 Agent Problem | Oren Michaels of Barndoor AI

AI agents can now connect to every tool your employees use. The problem is that connecting them and trusting them are two completely different things, and most enterprises have figured out the first w...

6 Jun 59min

More Customers Chose the AI Agent Than Anyone Expected | Tom Chen, Aircall

More Customers Chose the AI Agent Than Anyone Expected | Tom Chen, Aircall

Every time you hit a phone tree or a chatbot with canned answers, you're experiencing the gap between what AI can already do and what most companies are still delivering. Craig Smith sits down with To...

4 Jun 56min

Why the Future of AI Isn't Just Bigger Models. It's Models That Evolve | Risto Miikkulainen of Cognizant

Why the Future of AI Isn't Just Bigger Models. It's Models That Evolve | Risto Miikkulainen of Cognizant

Most AI systems follow a gradient, a mathematical slope that tells them exactly how to improve, step by step, toward a known goal. Neuroevolution doesn't follow any gradient. Instead, it runs hundreds...

2 Jun 1h 4min

How AI Is Reinventing Elder Care | Chia-Lin Simmons of LogicMark

How AI Is Reinventing Elder Care | Chia-Lin Simmons of LogicMark

One in four people over 65 will experience a fall, and for most of them, the technology designed to help is a device that hasn't meaningfully changed since the 1980s. Chia-Lin Simmons, CEO of LogicMar...

1 Jun 53min

The App of the Future Is Voice — Not a Screen. Mitel's CTO Luiz Domingos Explains Why.

The App of the Future Is Voice — Not a Screen. Mitel's CTO Luiz Domingos Explains Why.

Luiz Domingos has spent 25 years watching enterprise communications evolve, from IP telephony to cloud to AI, and his assessment of where things stand now is unusually concrete. Companies have moved p...

28 Mai 54min

Is ChatGPT Conscious? A Pioneer of AI Explains | Dr. Terry Sejnowski

Is ChatGPT Conscious? A Pioneer of AI Explains | Dr. Terry Sejnowski

A fly with 100,000 neurons can fly, find food, and reproduce. A $100 million supercomputer cannot. Dr. Terry Sejnowski used that observation to silence a room full of MIT AI researchers in the 1980s, ...

28 Mai 56min

Your Child's Data Profile Starts Before They're Born | Eamonn Maguire of Proton

Your Child's Data Profile Starts Before They're Born | Eamonn Maguire of Proton

Your child's data profile doesn't start when they get their first phone. It starts before they're born, the moment a parent emails a gynecologist or visits a fertility clinic website. That's the core ...

28 Mai 55min

Training AI Models Without a Billion-Dollar Data Center | Steffen Cruz of Macrocosmos

Training AI Models Without a Billion-Dollar Data Center | Steffen Cruz of Macrocosmos

Training a frontier AI model today requires hundreds of thousands of GPUs, months of compute time, and a budget that only a handful of companies on earth can afford. Steffen Cruz, co-founder and CTO o...

25 Mai 47min

Populært innen Teknologi

lydartikler-fra-aftenposten
romkapsel
teknisk-sett
elektropodden
energi-og-klima
nasjonal-sikkerhetsmyndighet-nsm
tomprat-med-gunnar-tjomlid
shifter
fornybaren
hans-petter-og-co
rss-ki-praten
teknologi-og-mennesker
rss-for-alarmen-gar
i-loopen
rss-alt-som-gar-pa-strom
rss-heis
rss-ai-forklart
rss-digitaliseringspadden
rss-bouvet-bobler
rss-startup