Powering AI with the World's Largest Computer Chip with Joel Hestness - #684

Powering AI with the World's Largest Computer Chip with Joel Hestness - #684

Today we're joined by Joel Hestness, principal research scientist and lead of the core machine learning team at Cerebras. We discuss Cerebras’ custom silicon for machine learning, Wafer Scale Engine 3, and how the latest version of the company’s single-chip platform for ML has evolved to support large language models. Joel shares how WSE3 differs from other AI hardware solutions, such as GPUs, TPUs, and AWS’ Inferentia, and talks through the homogenous design of the WSE chip and its memory architecture. We discuss software support for the platform, including support by open source ML frameworks like Pytorch, and support for different types of transformer-based models. Finally, Joel shares some of the research his team is pursuing to take advantage of the hardware's unique characteristics, including weight-sparse training, optimizers that leverage higher-order statistics, and more. The complete show notes for this episode can be found at twimlai.com/go/684.

Jaksot(779)

House Hunters: Machine Learning at Redfin with Akshat Kaul - #530

House Hunters: Machine Learning at Redfin with Akshat Kaul - #530

Today we’re joined by Akshat Kaul, the head of data science and machine learning at Redfin. We’re all familiar with Redfin, but did you know that redfin.com is the largest real estate brokerage site i...

26 Loka 202144min

Attacking Malware with Adversarial Machine Learning, w/ Edward Raff - #529

Attacking Malware with Adversarial Machine Learning, w/ Edward Raff - #529

Today we’re joined by Edward Raff, chief scientist and head of the machine learning research group at Booz Allen Hamilton. Edward’s work sits at the intersection of machine learning and cybersecurity,...

21 Loka 202146min

Learning to Ponder: Memory in Deep Neural Networks with Andrea Banino - #528

Learning to Ponder: Memory in Deep Neural Networks with Andrea Banino - #528

Today we’re joined by Andrea Banino, a research scientist at DeepMind. In our conversation with Andrea, we explore his interest in artificial general intelligence by way of episodic memory, the relati...

18 Loka 202137min

Advancing Deep Reinforcement Learning with NetHack, w/ Tim Rocktäschel - #527

Advancing Deep Reinforcement Learning with NetHack, w/ Tim Rocktäschel - #527

Take our survey at twimlai.com/survey21! Today we’re joined by Tim Rocktäschel, a research scientist at Facebook AI Research and an associate professor at University College London (UCL).  Tim’s wor...

14 Loka 202142min

Building Technical Communities at Stack Overflow with Prashanth Chandrasekar - #526

Building Technical Communities at Stack Overflow with Prashanth Chandrasekar - #526

In this special episode of the show, we’re excited to bring you our conversation with Prashanth Chandrasekar, CEO of Stack Overflow. This interview was recorded as a part of the annual Prosus AI Marke...

11 Loka 202140min

Deep Learning is Eating 5G. Here’s How, w/ Joseph Soriaga - #525

Deep Learning is Eating 5G. Here’s How, w/ Joseph Soriaga - #525

Today we’re joined by Joseph Soriaga, a senior director of technology at Qualcomm.  In our conversation with Joseph, we focus on a pair of papers that he and his team will be presenting at Globecom l...

7 Loka 202139min

Modeling Human Cognition with RNNs and Curriculum Learning, w/ Kanaka Rajan - #524

Modeling Human Cognition with RNNs and Curriculum Learning, w/ Kanaka Rajan - #524

Today we’re joined by Kanaka Rajan, an assistant professor at the Icahn School of Medicine at Mt Sinai. Kanaka, who is a recent recipient of the NSF Career Award, bridges the gap between the worlds of...

4 Loka 202147min

Do You Dare Run Your ML Experiments in Production? with Ville Tuulos - #523

Do You Dare Run Your ML Experiments in Production? with Ville Tuulos - #523

Today we’re joined by a friend of the show and return guest Ville Tuulos, CEO and co-founder of Outerbounds. In our previous conversations with Ville, we explored his experience building and deploying...

30 Syys 202140min

Suosittua kategoriassa Politiikka ja uutiset

aikalisa
rss-ootsa-kuullut-tasta
tervo-halme
ootsa-kuullut-tasta-2
politiikan-puskaradio
rss-vaalirankkurit-podcast
viisupodi
rss-podme-livebox
otetaan-yhdet
et-sa-noin-voi-sanoo-esittaa
rss-tasta-on-kyse-ivan-puopolo-verkkouutiset
rss-asiastudio
the-ulkopolitist
mtv-uutiset-polloraati
rss-kaikki-uusiksi
rss-hyvaa-huomenta-bryssel
rss-merja-mahkan-rahat
rss-kuka-mina-olen
rss-raha-talous-ja-politiikka
rss-sanna-ukkola-show-verkkouutiset