DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world
AI Today17 Elo 2025

DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world

Google DeepMind's Demis Hassabis and his team have a bold mission: penetrating the 4D chess game that's AI embracing our ever-changing biological, physical world.

Taking a snapshot is one thing. Remembering the molecular topology and their constant changes of state is truly what separates fact from fiction.

It seemed like an impossible target to hit. Until M3-Agent, the work of researchers associated with ByteDance at Shanghai Jiao Tong University, showed up with long-term multimodal memory - allowing the agent to see, hear, remember, and reason just like humans.

M3-Agent's potential is groundbreaking.

Here are just three use cases that will blow all our minds:

  • Autonomous robotics: Robots in homes or warehouses remember object locations, user habits, and past errors, adapting tasks dynamically, such as a caregiver bot recalling a patient's routines for personalized aid
  • Enhanced surveillance: Security systems analyse live video/audio feeds, building memory of normal patterns to detect anomalies, predict threats, and reason through scenarios, like identifying intruders based on historical behaviours
  • Personalised education: AI tutors process student interaction videos, remember progress and misconceptions over time, and deliver tailored lessons, such as adapting math explanations from weeks of observed struggles.


Read the paper: Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory.

Jaksot(90)

DEEP DIVE: Remote work

DEEP DIVE: Remote work

Spotify chiefs say 'work isn't where you are. It's what you do.' While Dell and Amazon say RTO - or GTFO. The COVID-19 pandemic forced companies and employees to work remotely. In the wake of this dis...

15 Loka 202414min

Dario Amodei, Anthropic CEO: Machines of Loving Grace

Dario Amodei, Anthropic CEO: Machines of Loving Grace

On today's AI Today we review an important discussion piece by Dario on the future of this world-changing technology. What's the big idea? The one key message for business leaders from Dario Amodei's ...

15 Loka 202415min

DEEP DIVE: Connecting the bots, with OpenAI's Swarm

DEEP DIVE: Connecting the bots, with OpenAI's Swarm

On the latest episode of AI Today we're connecting the bots with OpenAI's Swarm, a breakthrough framework creating teams of AI agents to complete complex business tasks. Discover how this innovative t...

14 Loka 20249min

Unlocking the Power of Agentic Reasoning: Revolutionising Business with AI

Unlocking the Power of Agentic Reasoning: Revolutionising Business with AI

Imagine AI that can set its own goals and achieve them. That's the power of agentic reasoning, and it's already being used to develop AI lawyers, software engineers, and medical scribes. Discover how ...

14 Loka 202422min

State of AI Report 2024 - the deep-dive!

State of AI Report 2024 - the deep-dive!

This is a big, sprawling listen. So buckle in and get ready for a half-hour journey to the very heart of AI. We're focusing on the State of AI Report 2024 - hundreds of pages diving deep into where we...

11 Loka 202430min

AI Breakthroughs: From Nobel Prizes to Legal Revolution

AI Breakthroughs: From Nobel Prizes to Legal Revolution

AI is reshaping our world at an incredible pace. Tune in to AI Today for a concise overview of the latest breakthroughs, including AI's role in the Nobel Prize in Chemistry and its transformative impa...

10 Loka 202416min

Cheaper, better, faster: data analysis with Anthropic's new Message Batches API

Cheaper, better, faster: data analysis with Anthropic's new Message Batches API

Businesses working with my favourite hosted LLM solution can now save 50% when sending over huge volumes of data for AI-nalysis. This and so much more on today's thrilling instalment of AI Today...

9 Loka 20249min

Google DeepMind's Concordia: AI simulating human behaviour

Google DeepMind's Concordia: AI simulating human behaviour

We regularly cover research papers that hallucinate a different future. Today we're covering code that's already in the world. Sure - there's a research paper ( https://arxiv.org/pdf/2312.03664 ). But...

4 Loka 202410min