Can Language Models Be Too Big? 🦜 with Emily Bender and Margaret Mitchell - #467

Can Language Models Be Too Big? 🦜 with Emily Bender and Margaret Mitchell - #467

Today we’re joined by Emily M. Bender, Professor at the University of Washington, and AI Researcher, Margaret Mitchell. Emily and Meg, as well as Timnit Gebru and Angelina McMillan-Major, are co-authors on the paper On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜. As most of you undoubtedly know by now, there has been much controversy surrounding, and fallout from, this paper. In this conversation, our main priority was to focus on the message of the paper itself. We spend some time discussing the historical context for the paper, then turn to the goals of the paper, discussing the many reasons why the ever-growing datasets and models are not necessarily the direction we should be going. We explore the cost of these training datasets, both literal and environmental, as well as the bias implications of these models, and of course the perpetual debate about responsibility when building and deploying ML systems. Finally, we discuss the thin line between AI hype and useful AI systems, and the importance of doing pre-mortems to truly flesh out any issues you could potentially come across prior to building models, and much much more. The complete show notes for this episode can be found at twimlai.com/go/467.

Episoder(779)

Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697

Simplifying On-Device AI for Developers with Siddhika Nevrekar - #697

Today, we're joined by Siddhika Nevrekar, AI Hub head at Qualcomm Technologies, to discuss on-device AI and how to make it easier for developers to take advantage of device capabilities. We unpack the...

12 Aug 202446min

Genie: Generative Interactive Environments with Ashley Edwards - #696

Genie: Generative Interactive Environments with Ashley Edwards - #696

Today, we're joined by Ashley Edwards, a member of technical staff at Runway, to discuss Genie: Generative Interactive Environments, a system for creating ‘playable’ video environments for training de...

5 Aug 202446min

Bridging the Sim2real Gap in Robotics with Marius Memmel - #695

Bridging the Sim2real Gap in Robotics with Marius Memmel - #695

Today, we're joined by Marius Memmel, a PhD student at the University of Washington, to discuss his research on sim-to-real transfer approaches for developing autonomous robotic agents in unstructured...

30 Jul 202457min

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694

Building Real-World LLM Products with Fine-Tuning and More with Hamel Husain - #694

Today, we're joined by Hamel Husain, founder of Parlance Labs, to discuss the ins and outs of building real-world products using large language models (LLMs). We kick things off discussing novel appli...

23 Jul 20241h 20min

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693

Today, we're joined by Albert Gu, assistant professor at Carnegie Mellon University, to discuss his research on post-transformer architectures for multi-modal foundation models, with a focus on state-...

17 Jul 202457min

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692

Decoding Animal Behavior to Train Robots with EgoPet with Amir Bar - #692

Today, we're joined by Amir Bar, a PhD candidate at Tel Aviv University and UC Berkeley to discuss his research on visual-based learning, including his recent paper, “EgoPet: Egomotion and Interaction...

9 Jul 202443min

How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691

How Microsoft Scales Testing and Safety for Generative AI with Sarah Bird - #691

Today, we're joined by Sarah Bird, chief product officer of responsible AI at Microsoft. We discuss the testing and evaluation techniques Microsoft applies to ensure safe deployment and use of generat...

1 Jul 202457min

Long Context Language Models and their Biological Applications with Eric Nguyen - #690

Long Context Language Models and their Biological Applications with Eric Nguyen - #690

Today, we're joined by Eric Nguyen, PhD student at Stanford University. In our conversation, we explore his research on long context foundation models and their application to biology particularly Hye...

25 Jun 202445min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden-usa
aftenpodden
stopp-verden
forklart
i-retten
popradet
det-store-bildet
fotballpodden-2
nokon-ma-ga
dine-penger-pengeradet
rss-gukild-johaug
bt-dokumentar-2
aftenbla-bla
hanna-de-heldige
rss-ness
frokostshowet-pa-p5
rss-penger-polser-og-politikk
rss-dannet-uten-piano
e24-podden