Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663

Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663

Today we’re joined by Markus Nagel, research scientist at Qualcomm AI Research, who helps us kick off our coverage of NeurIPS 2023. In our conversation with Markus, we cover his accepted papers at the conference, along with other work presented by Qualcomm AI Research scientists. Markus’ first paper, Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing, focuses on tackling activation quantization issues introduced by the attention mechanism and how to solve them. We also discuss Pruning vs Quantization: Which is Better?, which focuses on comparing the effectiveness of these two methods in achieving model weight compression. Additional papers discussed focus on topics like using scalarization in multitask and multidomain learning to improve training and inference, using diffusion models for a sequence of state models and actions, applying geometric algebra with equivariance to transformers, and applying a deductive verification of chain of thought reasoning performed by LLMs. The complete show notes for this episode can be found at twimlai.com/go/663.

Jaksot(781)

Learning to Learn, and other Opportunities in Machine Learning with Graham Taylor - TWiML Talk #62

Learning to Learn, and other Opportunities in Machine Learning with Graham Taylor - TWiML Talk #62

The podcast you’re about to hear is the third of a series of shows recorded at the Georgian Partners Portfolio Conference last week in Toronto. My guest this time is Graham Taylor, professor of engine...

3 Marras 201737min

Building Conversational Application for Financial Services with Kenneth Conroy - TWiML Talk #61

Building Conversational Application for Financial Services with Kenneth Conroy - TWiML Talk #61

The podcast you’re about to hear is the second of a series of shows recorded at the Georgian Partners Portfolio Conference last week in Toronto. My guest for this interview is Kenneth Conroy, VP of da...

1 Marras 201737min

Fighting Fraud with Machine Learning at Shopify with Solmaz Shahalizadeh - TWiML Talk #60

Fighting Fraud with Machine Learning at Shopify with Solmaz Shahalizadeh - TWiML Talk #60

The podcast you’re about to hear is the first of a series of shows recorded at the Georgian Partners Portfolio Conference last week in Toronto. My guest for this show is Solmaz Shahalizadeh, Director ...

30 Loka 201735min

Modeling Human Drivers for Autonomous Vehicles with Katie Driggs-Campbell - TWiML Talk #59

Modeling Human Drivers for Autonomous Vehicles with Katie Driggs-Campbell - TWiML Talk #59

We are back with our third show this week, episode 3 of our Autonomous Vehicles Series. My guest this time is Katie Driggs-Campbell, PostDoc in the Intelligent Systems Lab at Stanford University’s Dep...

27 Loka 201733min

Perception Models for Self-Driving Cars with Jianxiong Xiao - TWiML Talk #58

Perception Models for Self-Driving Cars with Jianxiong Xiao - TWiML Talk #58

We are back with our second show this week, episode 2 of our Autonomous Vehicles Series. This time around we are joined by Jianxiong Xiao of AutoX, a company building computer vision centric solutions...

25 Loka 201741min

Training Data for Autonomous Vehicles - Daryn Nakhuda - TWiML Talk #57

Training Data for Autonomous Vehicles - Daryn Nakhuda - TWiML Talk #57

The episode you are about to hear is the first of a new series of shows on Autonomous Vehicles. We all know that self-driving cars is one of the hottest topics in ML & AI, so we had to dig a little de...

23 Loka 201747min

Human Factors in Machine Intelligence with James Guszcza - TWiML Talk #56

Human Factors in Machine Intelligence with James Guszcza - TWiML Talk #56

As you all know, a few weeks ago, I spent some time in SF at the Artificial Intelligence Conference. I sat down with James Guszcza, US Chief Data Scientist at Deloitte Consulting to talk about human f...

16 Loka 201742min

AI-Powered Conversational Interfaces with Paul Tepper - TWiML Talk #52

AI-Powered Conversational Interfaces with Paul Tepper - TWiML Talk #52

The show you’re about to hear is part of a series of shows recorded in San Francisco at the Artificial Intelligence Conference. My guest for this show is Paul Tepper, worldwide head of cognitive innov...

6 Loka 201736min

Suosittua kategoriassa Politiikka ja uutiset

uutiscast
aikalisa
rss-ootsa-kuullut-tasta
politiikan-puskaradio
ootsa-kuullut-tasta-2
tervo-halme
viisupodi
rss-podme-livebox
rss-asiastudio
rikosmyytit
the-ulkopolitist
et-sa-noin-voi-sanoo-esittaa
otetaan-yhdet
radio-antro
rss-sanna-ukkola-show-verkkouutiset
io-techin-tekniikkapodcast
aihe
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
rss-kyselytunti
rss-tekkipodi