Will inference move to the edge?

Will inference move to the edge?

Today virtually all AI compute takes place in centralized data centers, driving the demand for massive power infrastructure. But as workloads shift from training to inference, and AI applications become more latency-sensitive (autonomous vehicles, anyone?), there‘s another pathway: migrating a portion of inference from centralized computing to the edge. Instead of a gigawatt-scale data center in a remote location, we might see a fleet of smaller data centers clustered around an urban core. Some inference might even shift to our devices. So how likely is a shift like this, and what would need to happen for it to substantially reshape AI power? In this episode, Shayle talks to Dr. Ben Lee, a professor of electrical engineering and computer science at the University of Pennsylvania, as well as a visiting researcher at Google. Shayle and Ben cover topics like: The three main categories of compute: hyperscale, edge, and on-device Why training is unlikely to move from hyperscale The low latency demands of new applications like autonomous vehicles How generative AI is training us to tolerate longer latencies Why distributed inference doesn‘t face the same technical challenges as distributed training Why consumer devices may limit model capability Resources: ACM SIGMETRICS Performance Evaluation Review: A Case Study of Environmental Footprints for Generative AI Inference: Cloud versus Edge Internet of Things and Cyber-Physical Systems: Edge AI: A survey Credits: Hosted by Shayle Kann. Produced and edited by Daniel Woldorff. Original music and engineering by Sean Marquand. Stephen Lacey is our executive editor. Catalyst is brought to you by EnergyHub. EnergyHub helps utilities build next-generation virtual power plants that unlock reliable flexibility at every level of the grid. See how EnergyHub helps unlock the power of flexibility at scale, and deliver more value through cross-DER dispatch with their leading Edge DERMS platform, by visiting energyhub.com. Catalyst is brought to you by Bloom Energy. AI data centers can’t wait years for grid power—and with Bloom Energy’s fuel cells, they don’t have to. Bloom Energy delivers affordable, always-on, ultra-reliable onsite power, built for chipmakers, hyperscalers, and data center leaders looking to power their operations at AI speed. Learn more by visiting⁠ ⁠⁠BloomEnergy.com⁠. Catalyst is supported by Third Way. Third Way’s new PACE study surveyed over 200 clean energy professionals to pinpoint the non-cost barriers delaying clean energy deployment today and offers practical solutions to help get projects over the finish line. Read Third Way's full report, and learn more about their PACE initiative, at www.thirdway.org/pace.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(258)

Surprising trends in global electricity generation

Surprising trends in global electricity generation

While global electricity demand is unquestionably rising, we may nonetheless be underestimating the scale of necessary future generation. In this episode, Shayle speaks to Nic Fulghum, senior energy ...

4 Juni 44min

Building inference data centers on the high seas

Building inference data centers on the high seas

Amidst the increasing urgency of powering data centers, a new solution has entered the mix: send them out to sea. In this episode, Shayle speaks to Garth Sheldon-Coulson, co-founder and CEO of Pantha...

28 Maj 48min

A blueprint for scalable fusion power

A blueprint for scalable fusion power

For years, the prospect of commercial nuclear fusion felt a long way off. But recent breakthroughs—like Lawrence Livermore National Laboratory’s historic 2022 net energy gain—have marked a new chapter...

21 Maj 40min

Inside the global fertilizer crunch

Inside the global fertilizer crunch

While much of the world has been focused on the war in Iran’s impact on the energy sector, another arguably more impactful market has been largely overlooked: fertilizer. The global fertilizer market...

14 Maj 36min

Cracking the code on autonomous trucking

Cracking the code on autonomous trucking

Even though autonomous passenger vehicles have entered the mainstream in cities across the country, autonomous trucks still lag behind. But Humble Robotics thinks it has cracked the code with a new de...

7 Maj 43min

How AI is modernizing EPCs

How AI is modernizing EPCs

As the utility-scale solar market collides with an era defined by massive load growth, EPC (engineering, procurement, and construction) firms are rethinking their strategy to meet the moment. In this...

30 Apr 34min

Inside Google’s massive AI capex (live)

Inside Google’s massive AI capex (live)

As the race to build out artificial intelligence accelerates, the infrastructure required to support it is undergoing a remarkable transformation. In February, Google announced a plan to spend $175 bi...

23 Apr 35min

How Base Power plans to use its fresh $1B [re-published]

How Base Power plans to use its fresh $1B [re-published]

[This episode is a re-run from October 2025. Look out for a new episode of Catalyst on Thursday, April 23.] Yesterday, Base Power announced a ⁠$1 billion series C⁠, giving the residential battery c...

16 Apr 39min

Populärt inom Business & ekonomi

framgangspodden
varvet
badfluence
rss-borsens-finest
uppgang-och-fall
avanzapodden
rss-dagen-med-di
lastbilspodden
fill-or-kill
rss-inga-dumma-fragor-om-pengar
bathina-en-podcast
borsmorgon
24fragor
rss-kort-lang-analyspodden-fran-di
tabberaset
kapitalet-en-podd-om-ekonomi
market-makers
rss-den-nya-ekonomin
bilar-med-sladd
svd-tech-brief