EA Forum Podcast (Curated & popular)2 Feb

[Linkpost] “Inference Scaling and the Log-x Chart” by Toby_Ord

This is a link post.

Improving model performance by scaling up inference compute is the next big thing in frontier AI. But the charts being used to trumpet this new paradigm can be misleading. While they initially appear to show steady scaling and impressive performance for models like o1 and o3, they really show poor scaling (characteristic of brute force) and little evidence of improvement between o1 and o3. I explore how to interpret these new charts and what evidence for strong scaling and progress would look like.

From scaling training to scaling inference

The dominant trend in frontier AI over the last few years has been the rapid scale-up of training — using more and more compute to produce smarter and smarter models. Since GPT-4, this kind of scaling has run into challenges, so we haven’t yet seen models much larger than GPT-4. But we have seen a recent shift towards scaling up the compute used during deployment (aka 'test-time compute’ or ‘inference compute’), with more inference compute producing smarter models.

You could think of this as a change in strategy from improving the quality of your employees’ work via giving them more years of training in which acquire [...]

---

First published:
February 2nd, 2026

Source:
https://forum.effectivealtruism.org/posts/zNymXezwySidkeRun/inference-scaling-and-the-log-x-chart

Linkpost URL:
https://www.tobyord.com/writing/inference-scaling-and-the-log-x-chart

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(250)

“Let’s taboo the V-word” by lincolnq

“How long have you been v*g*n?” This is one of the most common icebreakers at animal protection events. It's a baseline assumption, and it mostly holds true: if you’re out advocating for animals not t...

14 Juli 12min

“Giving What We Can’s first YouTube Video is out now!” by JustinPortela

Hello! I'm Justin Portela. I got hired by GWWC to make YouTube videos after AI in Context did such a kickass job. My channel is using that same cinematic, high-production value beauty to talk about e...

9 Juli 1min

“I’m never satisfied” by Ajeya

Note: This post was crossposted from Planned Obsolescence by the Forum team, with the author's permission. The author may not see or respond to comments on this post. But we get the job done I was twe...

8 Juli 6min

“Maybe do the thing you wish CEA would do” by alejoacelas 🔸

I used AI to fix transcription errors, rerrarange the ideas, and suggest tweaks to the title and some sentences. Three of the most exciting projects to come out of EA in recent years are, in a vague s...

8 Juli 4min

“Mabye do the thing you wish CEA would do” by AlejoAcelas🔸

8 Juli 4min

“Possible mistake EAs are making and shout out to Pause AI UK” by Michelle_Hutchinson

I think right now EAs might be making a significant mistake by paying insufficient attention to the political realm. As EAs we tend to figure out what's most impactful for us to work on and focus hard...

29 Juni 6min

“Coming Around To Political Donations” by Jeff Kaufman 🔸

Five years ago I read a post on the EA Forum arguing that "election campaign contributions might be a way in which you can have a substantial impact as a small donor". It struck me as weird but plausi...

12 Juni 4min

“animal welfare has an evidence problem” by matthes

Why I stopped donating to animal welfare charities but feel more motivated than ever to redirect money and talent to the cause. I have wanted to write this post for a while. It is an uncomfortable thi...

6 Juni 26min