Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to Apple’s MLX ecosystem, having published over 1,000 models and libraries that make open, multimodal AI accessible and performant on Apple devices. We explore his workflow for adapting new models in MLX, the trade-offs between the GPU and Neural Engine, and how optimization methods like pruning and quantization enhance performance. We also cover his work on "Fusion," a weight-space method for combining model behaviors without retraining, and his popular packages—MLX-Audio, MLX-Embeddings, and MLX-VLM—which streamline the use of MLX across different modalities. Finally, Prince introduces Marvis, a real-time speech-to-speech voice agent, and shares his vision for the future of AI, emphasizing the move towards "media models" that can handle multiple modalities, and more. The complete show notes for this episode can be found at https://twimlai.com/go/744.

Jaksot(782)

Ensuring Privacy for Any LLM with Patricia Thaine - #716

Ensuring Privacy for Any LLM with Patricia Thaine - #716

Today, we're joined by Patricia Thaine, co-founder and CEO of Private AI to discuss techniques for ensuring privacy, data minimization, and compliance when using 3rd-party large language models (LLMs)...

28 Tammi 202551min

AI Engineering Pitfalls with Chip Huyen - #715

AI Engineering Pitfalls with Chip Huyen - #715

Today, we're joined by Chip Huyen, independent researcher and writer to discuss her new book, “AI Engineering.” We dig into the definition of AI engineering, its key differences from traditional machi...

21 Tammi 202557min

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - #714

Today, we're joined by Abhijit Bose, head of enterprise AI and ML platforms at Capital One to discuss the evolution of the company’s approach and insights on Generative AI and platform best practices....

13 Tammi 202558min

Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713

Why Agents Are Stupid & What We Can Do About It with Dan Jeffries - #713

Today, we're joined by Dan Jeffries, founder and CEO of Kentauros AI to discuss the challenges currently faced by those developing advanced AI agents. We dig into how Dan defines agents and distinguis...

16 Joulu 20241h 8min

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

Automated Reasoning to Prevent LLM Hallucination with Byron Cook - #712

Today, we're joined by Byron Cook, VP and distinguished scientist in the Automated Reasoning Group at AWS to dig into the underlying technology behind the newly announced Automated Reasoning Checks fe...

9 Joulu 202456min

AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711

AI at the Edge: Qualcomm AI Research at NeurIPS 2024 with Arash Behboodi - #711

Today, we're joined by Arash Behboodi, director of engineering at Qualcomm AI Research to discuss the papers and workshops Qualcomm will be presenting at this year’s NeurIPS conference. We dig into th...

3 Joulu 202454min

AI for Network Management with Shirley Wu - #710

AI for Network Management with Shirley Wu - #710

Today, we're joined by Shirley Wu, senior director of software engineering at Juniper Networks to discuss how machine learning and artificial intelligence are transforming network management. We explo...

19 Marras 202453min

Why Your RAG System Is Broken, and How to Fix It with Jason Liu - #709

Why Your RAG System Is Broken, and How to Fix It with Jason Liu - #709

Today, we're joined by Jason Liu, freelance AI consultant, advisor, and creator of the Instructor library to discuss all things retrieval-augmented generation (RAG). We dig into the tactical and strat...

11 Marras 202458min

Suosittua kategoriassa Politiikka ja uutiset

uutiscast
aikalisa
ootsa-kuullut-tasta-2
politiikan-puskaradio
rss-ootsa-kuullut-tasta
tervo-halme
rss-vaalirankkurit-podcast
rss-podme-livebox
et-sa-noin-voi-sanoo-esittaa
rss-asiastudio
otetaan-yhdet
rss-hyvaa-huomenta-bryssel
rss-merja-mahkan-rahat
the-ulkopolitist
aihe
rikosmyytit
rss-aijat-hopottaa-podcast
rss-kaikki-uusiksi
rss-raha-talous-ja-politiikka
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset