Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663

Quantizing Transformers by Helping Attention Heads Do Nothing with Markus Nagel - #663

Today we’re joined by Markus Nagel, research scientist at Qualcomm AI Research, who helps us kick off our coverage of NeurIPS 2023. In our conversation with Markus, we cover his accepted papers at the conference, along with other work presented by Qualcomm AI Research scientists. Markus’ first paper, Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing, focuses on tackling activation quantization issues introduced by the attention mechanism and how to solve them. We also discuss Pruning vs Quantization: Which is Better?, which focuses on comparing the effectiveness of these two methods in achieving model weight compression. Additional papers discussed focus on topics like using scalarization in multitask and multidomain learning to improve training and inference, using diffusion models for a sequence of state models and actions, applying geometric algebra with equivariance to transformers, and applying a deductive verification of chain of thought reasoning performed by LLMs. The complete show notes for this episode can be found at twimlai.com/go/663.

Episoder(781)

Agile Machine Learning with Jennifer Prendki - TWiML Talk #46

Agile Machine Learning with Jennifer Prendki - TWiML Talk #46

My guest this week is Jennifer Prendki. That name might sound familiar, as she was one of the great speakers from my Future of Data Summit back in May. At the time, Jennifer was senior data science ma...

5 Sep 201748min

LSTMs, Plus a Deep Learning History Lesson with Jürgen Schmidhuber - TWiML Talk #44

LSTMs, Plus a Deep Learning History Lesson with Jürgen Schmidhuber - TWiML Talk #44

This week we have a very special interview to share with you! Those of you who’ve been receiving my newsletter for a while might remember that while in Switzerland last month, I had the pleasure of in...

28 Aug 20171h 3min

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Machine Teaching for Better Machine Learning with Mark Hammond - TWiML Talk #43

Today’s show, which concludes the first season of the Industrial AI Series, features my interview with Bonsai co-founder and CEO Mark Hammond. I sat down with Mark at Bonsai HQ a few weeks ago and we ...

21 Aug 20171h 5min

Marrying Physics-Based and Data-Driven ML Models with Josh Bloom - TWiML Talk #42

Marrying Physics-Based and Data-Driven ML Models with Josh Bloom - TWiML Talk #42

Recently I had a chance to catch up with a friend and friend of the show, Josh Bloom, vice president of data & analytics at GE Digital. If you’ve been listening for a while, you already know that Josh...

14 Aug 201752min

Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41

Data Pipelines at Zymergen with Airflow with Erin Shellman - TWiML Talk #41

The show you’re listening to features my interview with Erin Shellman. Erin is a statistician and data science manager with Zymergen, a company using robots and machine learning to engineer better mic...

5 Aug 201735min

Cognitive Biases in Data Science with Drew Conway - TWiML Talk #39

Cognitive Biases in Data Science with Drew Conway - TWiML Talk #39

This show features my interview with Drew Conway, whose Wrangle keynote could have been called “Confessions of a CIA Data Scientist.” The focus of our interview, and of Drew’s presentation, is an inte...

5 Aug 201734min

Web Scale Engineering for Machine Learning with Sharath Rao - TWiML Talk #40

Web Scale Engineering for Machine Learning with Sharath Rao - TWiML Talk #40

The show you’re about to listen to features my interview with Sharath Rao, Tech Lead Manager & Machine Learning Engineer at Instacart I reached out to Sharath about being on the show and was blown awa...

4 Aug 201731min

Deep Learning for Warehouse Operations with Calvin Seward - TWiML Talk #38

Deep Learning for Warehouse Operations with Calvin Seward - TWiML Talk #38

This week, I’m happy to bring you my interview with Calvin Seward, a research scientist with Berlin, Germany based Zalando. While our American listeners might not know the name Zalando, they’re one of...

31 Jul 201746min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
stopp-verden
forklart
i-retten
lydartikler-fra-aftenposten
popradet
rss-gukild-johaug
det-store-bildet
nokon-ma-ga
dine-penger-pengeradet
rss-ness
aftenbla-bla
hanna-de-heldige
fotballpodden-2
rss-dannet-uten-piano
grasoner-den-nye-kalde-krigen
frokostshowet-pa-p5
e24-podden