The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)14 Apr

Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727

In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent papers: "Circuit Tracing: Revealing Language Model Computational Graphs" and "On the Biology of a Large Language Model." Emmanuel explains how his team developed mechanistic interpretability methods to understand the internal workings of Claude by replacing dense neural network components with sparse, interpretable alternatives. The conversation explores several fascinating discoveries about large language models, including how they plan ahead when writing poetry (selecting the rhyming word "rabbit" before crafting the sentence leading to it), perform mathematical calculations using unique algorithms, and process concepts across multiple languages using shared neural representations. Emmanuel details how the team can intervene in model behavior by manipulating specific neural pathways, revealing how concepts are distributed throughout the network's MLPs and attention mechanisms. The discussion highlights both capabilities and limitations of LLMs, showing how hallucinations occur through separate recognition and recall circuits, and demonstrates why chain-of-thought explanations aren't always faithful representations of the model's actual reasoning. This research ultimately supports Anthropic's safety strategy by providing a deeper understanding of how these AI systems actually work. The complete show notes for this episode can be found at https://twimlai.com/go/727.

Upptäck Premium

Prova 14 dagar kostnadsfritt

Skaffa Premium

Avsnitt(763)

From Particle Physics to Audio AI with Scott Stephenson - TWiML Talk #19

This week my guest is Scott Stephenson. Scott is co-Founder & CEO of Deepgram, which has developed an AI-based platform for indexing and searching audio and video. Scott and I cover a ton of interesting topics including applying machine learning techniques to particle physics, his time in a lab 2 miles below the surface of the earth, applying neural networks to audio, and the Deep Learning Framework Kur that his company open-sourced. The show notes can be found at twimlai.com/talk/19.

14 Apr 201756min

(5/5) AlphaVertex - Creating a Worldwide Financial Knowledge Graph - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with AlphaVertex, a FinTech startup creating a worldwide financial knowledge graph to help investors predict stock prices. The notes for this series can be found at twimlai.com/nexuslab. Thanks to Future Labs at NYU Tandon and ffVenture Capital for sponsoring the series!

7 Apr 201726min

(4/5) Behold.ai - Increasing Efficiency of Healthcare Insurance Billing with NLP - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with Behold.ai, which uses computer vision and natural language processing techniques to bring efficiencies to the world of healthcare insurance billing. The notes for this series can be found at twimlai.com/nexuslab. Thanks to Future Labs at NYU Tandon and ffVenture Capital for sponsoring the series!

7 Apr 201716min

(3/5) Cambrian Intelligence - Using AI to Simplify the Programming of Robots - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with Cambrian Intelligence, a company using AI to simplify the programming of industrial robots for the automotive industry. The notes for this series can be found at twimlai.com/nexuslab. Thanks to Future Labs at NYU Tandon and ffVenture Capital for sponsoring the series!

7 Apr 201723min

(2/5) Klustera - Location-Based Intelligence for Smarter Marketing - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with Klustera, a company applying location-based intelligence and machine learning to help brands execute smarter marketing campaigns. The notes for this series can be found at twimlai.com/nexuslab. Thanks to Future Labs at NYU Tandon and ffVenture Capital for sponsoring the series!

7 Apr 201722min

(1/5) HelloVera - AI-Powered Customer Support - TWiML Talk #18

This week I'm on location at NYU/ffVC AI NexusLab startup accelerator, speaking with founders from the 5 companies in the program's inaugural batch. This interview is with HelloVera, a company applying artificial intelligence to the challenge of automating customer support experiences. The notes for this series can be found at https://twimlai.com/nexuslab. Thanks to Future Labs at NYU Tandon and ffVenture Capital for sponsoring the series!

7 Apr 201725min

Interactive Machine Learning Systems with Alekh Agarwal - TWiML Talk #17

This week my guest is Alekh Agarwal. Alekh is a researcher with Microsoft Research whose research is focused on Interactive Machine Learning. In our discussion, Alekh and I discuss various aspects of this exciting area of research such as active learning, reinforcement learning, contextual bandits and more.

31 Mars 201730min

Machine Learning in Cybersecurity with Evan Wright - TWiML Talk #16

This week my guest is Evan Wright, principal data scientist at cybersecurity startup Anomali. In my interview with Evan, he and I discussed about a number of topics surrounding the use of machine learning in cybersecurity. If Evan’s name sounds familiar, it’s because Evan was the winner of the O’Reilly Strata+Hadoop World ticket giveaway earlier this month. We met up at the conference last week and took advantage of the opportunity to record this show. Our conversation covers, among other topics, the three big problems in cybersecurity that ML can help out with, the challenges of acquiring ground truth in cybersecurity and some ways to accomplish it, and the use of decision trees, generative adversarial networks, and other algorithms in the field. The show notes can be found at twimlai.com/talk/16.

24 Mars 20171h 4min

Premium

99 kr/mån

Tillgång till alla Premium-poddar
Lyssna utan reklam
Avsluta när du vill

Prova 14 dagar gratis

Premium

129 kr/mån

Tillgång till alla Premium-poddar
Lyssna utan reklam
Avsluta när du vill
Ett extra konto

Prova 14 dagar gratis

Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727

Upptäck Premium

Avsnitt(763)

From Particle Physics to Audio AI with Scott Stephenson - TWiML Talk #19

(5/5) AlphaVertex - Creating a Worldwide Financial Knowledge Graph - TWiML Talk #18

(4/5) Behold.ai - Increasing Efficiency of Healthcare Insurance Billing with NLP - TWiML Talk #18

(3/5) Cambrian Intelligence - Using AI to Simplify the Programming of Robots - TWiML Talk #18

(2/5) Klustera - Location-Based Intelligence for Smarter Marketing - TWiML Talk #18

(1/5) HelloVera - AI-Powered Customer Support - TWiML Talk #18

Interactive Machine Learning Systems with Alekh Agarwal - TWiML Talk #17

Machine Learning in Cybersecurity with Evan Wright - TWiML Talk #16

Allt en och samma app

Noga utvalt innehåll

Fortsätt när du vill

Premium

Premium

Populärt inom Politik & nyheter

Berättelserna och rösterna du älskar att lyssna på