AI and the great developer speed-up, with Joel Becker of METR

AI and the great developer speed-up, with Joel Becker of METR

This week on Complex Systems, Patrick McKenzie (patio11) is joined by Joel Becker from METR. They discuss groundbreaking research on AI coding assistants.


Joel et al’s randomized controlled trial of 16 expert developers working on major open source projects revealed a counterintuitive finding: despite predictions of 24-40% speed improvements, developers actually took 19% longer to complete tasks when using AI tools, even though they retrospectively believed they were 20% faster. The conversation explores why even sophisticated professionals struggle to accurately assess their own productivity with AI tools, the industrial organization of software development, and the implications for AI's recursive self-improvement in research and development. It also touches on other perspectives from software developers using these tools professionally, and where we can expect them to improve rapidly.

Full transcript available here: www.complexsystemspodcast.com/the-great-developer-speed-up-with-joel-becker/


Sponsor:
This episode is brought to you by Mercury, the fintech trusted by 200K+ companies — from first milestones to running complex systems. Mercury offers banking that truly understands startups and scales with them. Start today at Mercury.com

Mercury is a financial technology company, not a bank. Banking services provided by Choice Financial Group, Column N.A., and Evolve Bank & Trust; Members FDIC.

Recommended in this episode:

Timestamps:

(00:00) Intro

(00:34) Understanding AI evaluation methods

(02:04) METR's unique approach to AI evaluation

(03:10) The evolution of AI capabilities

(06:44) AI as coding assistants

(09:15) Research on AI's impact on developer productivity

(13:55) Sponsor: Mercury

(15:07) Challenges in measuring developer productivity

(20:38) Insights from the research paper

(31:26) The formalities of software development

(32:07) Automated tools and human discussions

(32:47) AI and style transfer in software

(34:35) The role of comments in AI coding

(36:51) The future of AI in software engineering

(40:25) Economic implications of AI in software

(46:53) Challenges and risks of AI in software

(59:03) Security concerns with AI-generated code

(01:04:59) Wrap


Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(94)

Forty ways to pay for coffee in Japan

Forty ways to pay for coffee in Japan

Patrick McKenzie (patio11) reads his 2021 essay "Payments in Japan," tracing how Japanese consumers navigate a landscape with dozens of competing payment methods at once: credit cards, electronic mone...

25 Juni 35min

The factory behind your home loan

The factory behind your home loan

Patrick McKenzie reads from his 2022 Bits About Money essay on mortgages, making the case that a mortgage is best understood as a manufactured product, not a simple loan between a bank and a customer....

18 Juni 26min

How brokerage transfers actually work

How brokerage transfers actually work

Patrick McKenzie reads from his 2024 Bits About Money essay on ACATS, the Automated Customer Account Transfer Service that governs how Americans move investment accounts between brokerages, then updat...

4 Juni 43min

Wrong numbers and why they survive, with Aaron Brown

Wrong numbers and why they survive, with Aaron Brown

Patrick McKenzie (patio11) is joined by Aaron Brown, author of Wrong Number, to examine why institutions that produce bad statistics face so few consequences for doing so. They trace the pattern from ...

14 Maj 55min

Defendant, Censor, Politico, Spy

Defendant, Censor, Politico, Spy

The improbable but true story of how non-profits operating a private intelligence agency to combat terrorism decided to interfere with campaign infrastructure in a U.S. election.This piece includes or...

8 Maj 1h 5min

How the SPLC became financial infrastructure

How the SPLC became financial infrastructure

Patrick McKenzie reads from his latest Bits About Money essay, walking through why bank fraud charges are a prosecutor's favorite tool, how the Bank Secrecy Act's surveillance regime is designed to fo...

1 Maj 51min

The honey badger of payments

The honey badger of payments

Patrick McKenzie (patio11) reads his classic Bits about Money essay on how checks shaped the entire American payments infrastructure, from the origins of ACH to why a standard US bank account is, tech...

23 Apr 29min

Cash received is not revenue earned

Cash received is not revenue earned

Patrick McKenzie (patio11) reads his classic Bits about Money essay explaining why revenue recognition in software is more complicated than most engineers, founders, and financial reporters think. The...

16 Apr 33min

Populärt inom Business & ekonomi

badfluence
framgangspodden
varvet
rss-borsens-finest
avanzapodden
uppgang-och-fall
svd-tech-brief
rss-svart-marknad
bathina-en-podcast
lastbilspodden
rss-dagen-med-di
fill-or-kill
24fragor
rss-inga-dumma-fragor-om-pengar
borsmorgon
dynastin
rss-den-nya-ekonomin
rikatillsammans-om-privatekonomi-rikedom-i-livet
rss-kort-lang-analyspodden-fran-di
borslunch-2