Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

Today, we're joined by Albert Gu, assistant professor at Carnegie Mellon University, to discuss his research on post-transformer architectures for multi-modal foundation models, with a focus on state-space models in general and Albert’s recent Mamba and Mamba-2 papers in particular. We dig into the efficiency of the attention mechanism and its limitations in handling high-resolution perceptual modalities, and the strengths and weaknesses of transformer architectures relative to alternatives for various tasks. We dig into the role of tokenization and patching in transformer pipelines, emphasizing how abstraction and semantic relationships between tokens underpin the model's effectiveness, and explore how this relates to the debate between handcrafted pipelines versus end-to-end architectures in machine learning. Additionally, we touch on the evolving landscape of hybrid models which incorporate elements of attention and state, the significance of state update mechanisms in model adaptability and learning efficiency, and the contribution and adoption of state-space models like Mamba and Mamba-2 in academia and industry. Lastly, Albert shares his vision for advancing foundation models across diverse modalities and applications. The complete show notes for this episode can be found at https://twimlai.com/go/693.

Avsnitt(779)

Generating Ground-Level Images From Overhead Imagery Using GANs with Yi Zhu - TWiML Talk #172

Generating Ground-Level Images From Overhead Imagery Using GANs with Yi Zhu - TWiML Talk #172

Today we’re joined by Yi Zhu, a PhD candidate at UC Merced focused on geospatial image analysis. In our conversation, Yi and I take a look at his recent paper “What Is It Like Down There? Generating D...

13 Aug 201838min

Vision Systems for Planetary Landers and Drones with Larry Matthies - TWiML Talk #171

Vision Systems for Planetary Landers and Drones with Larry Matthies - TWiML Talk #171

Today we’re joined by Larry Matthies, Sr. Research Scientist and head of computer vision in the mobility and robotics division at JPL. In our conversation, we discuss two talks he gave at CVPR a few w...

9 Aug 201843min

Learning Semantically Meaningful and Actionable Representations with Ashutosh Saxena - TWiML Talk #170

Learning Semantically Meaningful and Actionable Representations with Ashutosh Saxena - TWiML Talk #170

In this episode i'm joined by Ashutosh Saxena, a veteran of Andrew Ng’s Stanford Machine Learning Group, and co-founder and CEO of Caspar.ai. Ashutosh and I discuss his RoboBrain project, a computatio...

6 Aug 201845min

AI Innovation for Clinical Decision Support with Joe Connor - TWiML Talk #169

AI Innovation for Clinical Decision Support with Joe Connor - TWiML Talk #169

In this episode I speak with Joe Connor, Founder of Experto Crede. In our conversation, we explore his experiences bringing AI powered healthcare projects to market in collaboration with the UK Natio...

2 Aug 201842min

Dynamic Visual Localization and Segmentation with Laura Leal-Taixé -TWiML Talk #168

Dynamic Visual Localization and Segmentation with Laura Leal-Taixé -TWiML Talk #168

In this episode I'm joined by Laura Leal-Taixé, Professor at the Technical University of Munich where she leads the Dynamic Vision and Learning Group. In our conversation, we discuss several of her r...

30 Juli 201844min

Conversational AI for the Intelligent Workplace with Gillian McCann - TWiML Talk #167

Conversational AI for the Intelligent Workplace with Gillian McCann - TWiML Talk #167

In this episode I'm joined by Gillian McCann, Head of Cloud Engineering and AI at Workgrid Software. In our conversation, which focuses on Workgrid’s use of cloud-based AI services, Gillian details so...

26 Juli 201836min

Computer Vision and Intelligent Agents for Wildlife Conservation with Jason Holmberg - TWiML Talk #166

Computer Vision and Intelligent Agents for Wildlife Conservation with Jason Holmberg - TWiML Talk #166

In this episode, I'm joined by Jason Holmberg, Executive Director and Director of Engineering at WildMe. Jason and I discuss Wildme's pair of open source computer vision based conservation projects, W...

22 Juli 201848min

Pragmatic Deep Learning for Medical Imagery with Prashant Warier - TWiML Talk #165

Pragmatic Deep Learning for Medical Imagery with Prashant Warier - TWiML Talk #165

In this episode I'm joined by Prashant Warier, CEO and Co-Founder of Qure.ai. We discuss the company’s work building products for interpreting head CT scans and chest x-rays. We look at knowledge gain...

19 Juli 201836min

Populärt inom Politik & nyheter

motiv
p3-krim
spar
svenska-fall
flashback-forever
rss-krimstad
rss-viva-fotboll
rss-sanning-konsekvens
aftonbladet-daily
aftonbladet-krim
rss-vad-fan-hande
rss-krimreportrarna
olyckan-inifran
rss-frandfors-horna
fordomspodden
dagens-eko
rss-flodet
svd-ledarredaktionen
politiken
rss-aftonbladet-krim