Rethinking Model Size: Train Large, Then Compress with Joseph Gonzalez - #378

Rethinking Model Size: Train Large, Then Compress with Joseph Gonzalez - #378

Today we’re joined by Joseph Gonzalez, Assistant Professor in the EECS department at UC Berkeley. In our conversation, we explore Joseph’s paper “Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers,” which looks at compute-efficient training strategies for models. We discuss the two main problems being solved; 1) How can we rapidly iterate on variations in architecture? And 2) If we make models bigger, is it really improving any efficiency?

Populärt inom Politik & nyheter

aftonbladet-krim
svenska-fall
rss-krimstad
p3-krim
fordomspodden
rss-viva-fotboll
svd-dokumentara-berattelser-2
flashback-forever
olyckan-inifran
aftonbladet-daily
rss-sanning-konsekvens
rss-vad-fan-hande
rss-expressen-dok
dagens-eko
rss-frandfors-horna
motiv
krimmagasinet
rss-krimreportrarna
blenda-2
spotlight