DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world
AI Today17 Elo 2025

DeepMind's wet dream is M3-Agent's reality: how long-term multimodal memory is modelling the real world

Google DeepMind's Demis Hassabis and his team have a bold mission: penetrating the 4D chess game that's AI embracing our ever-changing biological, physical world.

Taking a snapshot is one thing. Remembering the molecular topology and their constant changes of state is truly what separates fact from fiction.

It seemed like an impossible target to hit. Until M3-Agent, the work of researchers associated with ByteDance at Shanghai Jiao Tong University, showed up with long-term multimodal memory - allowing the agent to see, hear, remember, and reason just like humans.

M3-Agent's potential is groundbreaking.

Here are just three use cases that will blow all our minds:

  • Autonomous robotics: Robots in homes or warehouses remember object locations, user habits, and past errors, adapting tasks dynamically, such as a caregiver bot recalling a patient's routines for personalized aid
  • Enhanced surveillance: Security systems analyse live video/audio feeds, building memory of normal patterns to detect anomalies, predict threats, and reason through scenarios, like identifying intruders based on historical behaviours
  • Personalised education: AI tutors process student interaction videos, remember progress and misconceptions over time, and deliver tailored lessons, such as adapting math explanations from weeks of observed struggles.


Read the paper: Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory.

Jaksot(90)

Eye of Horus with xAI's API

Eye of Horus with xAI's API

Now anyone can know what everyone is doing in real time! xAI's API gives you access to the 560Gb of data generated every day by the millions of users on X, formerly Twitter. There are some fantastic o...

22 Loka 20249min

Automation and collaboration with AI communication protocol Agora

Automation and collaboration with AI communication protocol Agora

Business is a spider's web of complex processes. And up until now, it's been super hard to get AI working on multi-task processes, without some PhD-level hacking which, unless you're using open source...

21 Loka 202411min

Secrets to next-level AI results

Secrets to next-level AI results

Efficiency. Productivity. Growth. That's why we're here. And on today's show, we're hitting all three with the force of a bullet train. 3 AI experts share how they create the best results, in three to...

20 Loka 202416min

Understanding our physical world, with Archetype AI's Newton Large Behaviour Model

Understanding our physical world, with Archetype AI's Newton Large Behaviour Model

AI's made its name in the digital space. But thanks to Archetype AI, it's broken away from its silicon prison to learn about our physical world. Archetype's Newton, a Large Behaviour Model, is current...

18 Loka 202422min

Get to the heart of Microsoft's AI for Health

Get to the heart of Microsoft's AI for Health

Imagine a world where a doctor, armed with an AI-powered assistant, can diagnose diseases like pancreatic cancer earlier, potentially saving thousands of lives annually. This isn't science fiction; it...

17 Loka 20247min

Thinking of the future - with Meta's Thought Preference Optimisation

Thinking of the future - with Meta's Thought Preference Optimisation

Imagine your marketing team brainstorming hundreds of genuinely inspired ideas in seconds. That's the potential of Thought Preference Optimisation (TPO), a new AI technology from Meta. TPO is like giv...

16 Loka 20246min

How to win a $180,000 AI job

How to win a $180,000 AI job

Former Deloitte consultant Varun Kulkarni spent 8 months honing his application strategy to score a huge payday as a senior AI product manager gig with Cisco. On today's show we explore how he did it ...

16 Loka 202411min

Playbooks to moonshots: how AI is upsetting our apple carts...

Playbooks to moonshots: how AI is upsetting our apple carts...

AI is a paradigm shift for thinking and understanding. Our old frameworks and models are being challenged into obscurity by a new way of looking at the world. And even the things we used to think of a...

16 Loka 20248min