Hallucination: a bitter pill to swallow
AI Today13 Touko 2025

Hallucination: a bitter pill to swallow

AI hallucinates 100% of the time. That's by design - without hallucinating the next word, this transformer architecture wouldn't exist.

Thankfully, LLMs built for general purpose applications are right 80% of the time. But that still leaves one in five outputs being questionable; not especially reassuring if you're an air traffic controller, or cardiologist.

How can we ever truly trust the machine?

On this episode of AI Today, we embark on a groundbreaking quest to ground these 'digital dreamers' in reality.

Discover how cutting-edge research is moving beyond just detecting the problem, to actively reducing the occurrence of incorrect hallucinations.

We delve into innovative techniques that employ internal fact-checking mechanisms, intelligently split complex queries to avoid confusing collisions, and meticulously track word-by-word groundedness against source material.

You'll learn how this confidence-boosting research is paving the way for the AI credibility revolution, a future where technology is not just remarkably powerful, but significantly more dependable.

Join us to understand the innovative solutions building AI you can rely on, where AI becomes trusted accelerator of success...

Jaksot(90)

DEEP DIVE: Remote work

DEEP DIVE: Remote work

Spotify chiefs say 'work isn't where you are. It's what you do.' While Dell and Amazon say RTO - or GTFO. The COVID-19 pandemic forced companies and employees to work remotely. In the wake of this dis...

15 Loka 202414min

Dario Amodei, Anthropic CEO: Machines of Loving Grace

Dario Amodei, Anthropic CEO: Machines of Loving Grace

On today's AI Today we review an important discussion piece by Dario on the future of this world-changing technology. What's the big idea? The one key message for business leaders from Dario Amodei's ...

15 Loka 202415min

DEEP DIVE: Connecting the bots, with OpenAI's Swarm

DEEP DIVE: Connecting the bots, with OpenAI's Swarm

On the latest episode of AI Today we're connecting the bots with OpenAI's Swarm, a breakthrough framework creating teams of AI agents to complete complex business tasks. Discover how this innovative t...

14 Loka 20249min

Unlocking the Power of Agentic Reasoning: Revolutionising Business with AI

Unlocking the Power of Agentic Reasoning: Revolutionising Business with AI

Imagine AI that can set its own goals and achieve them. That's the power of agentic reasoning, and it's already being used to develop AI lawyers, software engineers, and medical scribes. Discover how ...

14 Loka 202422min

State of AI Report 2024 - the deep-dive!

State of AI Report 2024 - the deep-dive!

This is a big, sprawling listen. So buckle in and get ready for a half-hour journey to the very heart of AI. We're focusing on the State of AI Report 2024 - hundreds of pages diving deep into where we...

11 Loka 202430min

AI Breakthroughs: From Nobel Prizes to Legal Revolution

AI Breakthroughs: From Nobel Prizes to Legal Revolution

AI is reshaping our world at an incredible pace. Tune in to AI Today for a concise overview of the latest breakthroughs, including AI's role in the Nobel Prize in Chemistry and its transformative impa...

10 Loka 202416min

Cheaper, better, faster: data analysis with Anthropic's new Message Batches API

Cheaper, better, faster: data analysis with Anthropic's new Message Batches API

Businesses working with my favourite hosted LLM solution can now save 50% when sending over huge volumes of data for AI-nalysis. This and so much more on today's thrilling instalment of AI Today...

9 Loka 20249min

Google DeepMind's Concordia: AI simulating human behaviour

Google DeepMind's Concordia: AI simulating human behaviour

We regularly cover research papers that hallucinate a different future. Today we're covering code that's already in the world. Sure - there's a research paper ( https://arxiv.org/pdf/2312.03664 ). But...

4 Loka 202410min