Andriy Burkov - The TRUTH About Large Language Models and Agentic AI (with Andriy Burkov, Author "The Hundred-Page Language Models Book")

Andriy Burkov - The TRUTH About Large Language Models and Agentic AI (with Andriy Burkov, Author "The Hundred-Page Language Models Book")

Andriy Burkov is a renowned machine learning expert and leader. He's also the author of (so far) three books on machine learning, including the recently-released "The Hundred-Page Language Models Book", which takes curious people from the very basics of language models all the way up to building their own LLM. Andriy is also a formidable online presence and is never afraid to call BS on over-the-top claims about AI capabilities via his punchy social media posts.

Episode highlights: 1. Large Language Models are neither magic nor conscious

LLMs boil down to relatively simple mathematics at an unfathomably large scale. Humans are terrible at visualising big numbers and cannot comprehend the size of the dataset or the number of GPUs that have been used to create the models. You can train the same LLM on a handful of records and get garbage results, or throw millions of dollars at it and get good results, but the fundamentals are identical, and there's no consciousness hiding in between the equations. We see good-looking output, and we think it's talking to us. It isn't.

2. As soon as we saw it was possible to do mathematics on words, LLMs were inevitable

There were language models before LLMs, but the invention of the transformer architecture truly accelerated everything. That said, the fundamentals trace further back to "simpler" algorithms, such as word2vec, which proved that it is possible to encode language information in a numeric format, which meant that the vast majority of linguistic information could be represented by embeddings, which enabled people to run equations on language. After that, it was just a matter of time before they got scaled out.

3. LLMs look intelligent because people generally ask about things they already know about

The best way to be disappointed by an LLM's results is to ask detailed questions about something you know deeply. It's quite likely that it'll give good results to start with, because most people's knowledge is so unoriginal that, somewhere in the LLM's training data, there are documents that talk about the thing you asked about. But, it will degrade over time and confidently keep writing even when it doesn't know the answer. These are not easily solvable problems and are, in fact, fundamental parts of the design of an LLM.

4. Agentic AI relies on unreliable actors with no true sense of agency

The concept of agents is not new, and people have been talking about them for years. The key aspect of AI agents is that they need self-motivation and goals of their own, rather than being told to have goals and then simulating the desire to achieve them. That's not to say that some agents are not useful in their own right, but the goal of fully autonomous, agentic systems is a long way off, and may not even be solvable.

5. LLMs represent the most incredible technical advance since the personal computer, but people should quit it with their most egregious claims

LLMs are an incredible tool and can open up whole new worlds for people who are able to get the best out of them. There are limits to their utility, and some of their shortcomings are likely unsolvable, but we should not minimise their impact. However, there are unethical people out there making completely unsubstantiated claims based on zero evidence and a fundamental misunderstanding of how these models work. These people are scaring people and encouraging terrible decision-making from the gullible. We need to see through the hype.

Buy "The Hundred-Page Language Model Book"

"Large language models (LLMs) have fundamentally transformed how machines process and generate information. They are reshaping white-collar jobs at a pace comparable only to the revolutionary impact of personal computers. Understanding the mathematical foundations and inner workings of language models has become crucial for maintaining relevance and competitiveness in an increasingly automated workforce. This book guides you through the evolution of language models, starting from machine learning fundamentals. Rather than presenting transformers right away, which can feel overwhelming, we build understanding of language models step by step—from simple count-based methods through recurrent neural networks to modern architectures. Each concept is grounded in clear mathematical foundations and illustrated with working Python code."

Check it out on the book's website: https://thelmbook.com/.

You can also check out Machine Learning Engineering: https://www.mlebook.com and The Hundred-Page Machine Learning Book: https://www.themlbook.com/.

Follow Andriy

You can catch up with Andriy here:

Avsnitt(270)

Myles Sutholt's Hot Take - Leaders Need to Get Better at Using Data for PM Performance Reviews (with Myles Sutholt, Head of Product @ Field Intelligence Inc)

Myles Sutholt's Hot Take - Leaders Need to Get Better at Using Data for PM Performance Reviews (with Myles Sutholt, Head of Product @ Field Intelligence Inc)

Myles Sutholt is a Germany-based product leader working for an Africa-based startup where he's helping to digitise the health supply chain across the continent, with a "laser focus" on creating user v...

11 Feb 202520min

Martin Eriksson - Most PMs Aren't Good At Strategy - Enter The Decision Stack! (with Martin Eriksson, Co-founder of Mind the Product & Creator of The Decision Stack)

Martin Eriksson - Most PMs Aren't Good At Strategy - Enter The Decision Stack! (with Martin Eriksson, Co-founder of Mind the Product & Creator of The Decision Stack)

Martin Eriksson is the co-founder of Mind the Product, and co-author of the "Product Leadership" book. Martin has worked with a multitude of companies and has been heavily involved in the VC side of p...

2 Feb 20251h 7min

Martijn Versteeg's Hot Take - PMs Need to Spend Less Time Learning and More Time Doing (with Martijn Versteeg, Founder @ Group Effort & Organiser @ Product Mastery Conference)

Martijn Versteeg's Hot Take - PMs Need to Spend Less Time Learning and More Time Doing (with Martijn Versteeg, Founder @ Group Effort & Organiser @ Product Mastery Conference)

Martijn Versteeg is the founder of Group Effort, an organisation that fosters connections & facilitates the growth of scale-up leaders through peer groups, offsites and workshops. His hot take? That p...

26 Jan 202522min

Martijn Moret's Hot Take - Most PMs Neglect Data Due To a Lack of Time and Skills (with Martijn Moret, CEO @ DataSquirrel.ai)

Martijn Moret's Hot Take - Most PMs Neglect Data Due To a Lack of Time and Skills (with Martijn Moret, CEO @ DataSquirrel.ai)

Martijn Moret is the founder of DataSquirrel.ai, a company focused on leveraging AI to humanise and simplify data analysis for product managers and non-tech managers. His hot take? Most product manage...

10 Jan 202522min

OKIP LIVE: Jason and Maja's Christmas Fireside Chat (with Maja Voje, Founder @ Growth Lab and author "Go-To-Market Strategist")

OKIP LIVE: Jason and Maja's Christmas Fireside Chat (with Maja Voje, Founder @ Growth Lab and author "Go-To-Market Strategist")

🎄 Deck the Halls with Go-To-Market! 🎄 I spoke with Maja Voje for a convivial Christmas chat about all things product and growth. We discussed: 2024 Retrospectives and 2025 Predictions Product M...

23 Dec 202458min

Adam Dille's Hot Take - The Product Trio is Outdated - Enter the Product Square! (with Adam Dille, SVP Product Engineering at Quantum Metric)

Adam Dille's Hot Take - The Product Trio is Outdated - Enter the Product Square! (with Adam Dille, SVP Product Engineering at Quantum Metric)

Adam Dille is the SVP of Product Engineering at Quantum Metric, a company specialising in experience analytics for some of the world's biggest brands. Despite his engineering roots, Adam's relentless ...

17 Dec 202421min

Grace Yusuff's Hot Take - Introversion is a PM Superpower (with Grace Yusuff, Product Manager & Early Careers Mentor)

Grace Yusuff's Hot Take - Introversion is a PM Superpower (with Grace Yusuff, Product Manager & Early Careers Mentor)

Grace Yusuff is a London-based "reluctant product manager" and introvert who thought she could never do the job. She has since fallen in love with the role and now works as a product manager and early...

9 Dec 202423min

Assaph Mehr's Hot Take - AI Is Just A Tool - What Matters Is How We Use It (with Assaph Mehr, Product Leader and Fantasy Author)

Assaph Mehr's Hot Take - AI Is Just A Tool - What Matters Is How We Use It (with Assaph Mehr, Product Leader and Fantasy Author)

Assaph Mehr is an Australia-based product & people leader as well as a published fantasy author, who also uses his writing chops to produce a newsletter, "Rise of the Product Leader". His hot take? Th...

1 Dec 202420min

Populärt inom Business & ekonomi

framgangspodden
varvet
rss-jossan-nina
rss-svart-marknad
svd-tech-brief
badfluence
rss-borsens-finest
uppgang-och-fall
avanzapodden
bathina-en-podcast
fill-or-kill
tabberaset
24fragor
rss-kort-lang-analyspodden-fran-di
rss-dagen-med-di
lastbilspodden
kapitalet-en-podd-om-ekonomi
borsmorgon
rss-inga-dumma-fragor-om-pengar
rss-veckans-trade