Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406

Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406

Today we’re joined by Sameer Singh, an assistant professor in the department of computer science at UC Irvine. Sameer’s work centers on large-scale and interpretable machine learning applied to information extraction and natural language processing. We caught up with Sameer right after he was awarded the best paper award at ACL 2020 for his work on Beyond Accuracy: Behavioral Testing of NLP Models with CheckList. In our conversation, we explore CheckLists, the task-agnostic methodology for testing NLP models introduced in the paper. We also discuss how well we understand the cause of pitfalls or failure modes in deep learning models, Sameer’s thoughts on embodied AI, and his work on the now famous LIME paper, which he co-authored alongside Carlos Guestrin. The complete show notes for this episode can be found at twimlai.com/go/406.

Avsnitt(782)

Francisco Webber - Statistics vs Semantics for Natural Language Processing - TWiML Talk #10

Francisco Webber - Statistics vs Semantics for Natural Language Processing - TWiML Talk #10

My guest this time is Francisco Webber, founder and General Manager of artificial intelligence startup Cortical.io. Francisco presented at the O’Reilly AI conference on an approach to natural language...

3 Dec 201649min

Pascale Fung - Emotional AI: Teaching Computers Empathy - TWiML Talk #9

Pascale Fung - Emotional AI: Teaching Computers Empathy - TWiML Talk #9

My guest this time is Pascale Fung, professor of electrical & computer engineering at Hong Kong University of Science and Technology. Pascale delivered a presentation at the recent O'Reilly AI confere...

8 Nov 201634min

Diogo Almeida - Deep Learning: Modular in Theory, Inflexible in Practice - TWiML Talk #8

Diogo Almeida - Deep Learning: Modular in Theory, Inflexible in Practice - TWiML Talk #8

My guest this time is Diogo Almeida, senior data scientist at healthcare startup Enlitic. Diogo and I met at the O'Reilly AI conference, where he delivered a great presentation on in-the-trenches deep...

23 Okt 201646min

Carlos Guestrin - Explaining the Predictions of Machine Learning Models - TWiML Talk #7

Carlos Guestrin - Explaining the Predictions of Machine Learning Models - TWiML Talk #7

My guest this time is Carlos Guestrin, the Amazon professor of Machine Learning at the University of Washington. Carlos and I recorded this podcast at a conference, shortly after Apple's acquisition o...

9 Okt 201631min

Angie Hugeback - Generating Training Data for Your ML Models - TWiML Talk #6

Angie Hugeback - Generating Training Data for Your ML Models - TWiML Talk #6

My guest this time is Angie Hugeback, who is principal data scientist at Spare5. Spare5 helps customers generate the high-quality labeled training datasets that are so crucial to developing accurate m...

29 Sep 20161h 1min

Joshua Bloom - Machine Learning for the Stars & Productizing AI - TWiML Talk #5

Joshua Bloom - Machine Learning for the Stars & Productizing AI - TWiML Talk #5

My guest this time is Joshua Bloom. Josh is professor of astronomy at the University of California, Berkeley and co-founder and Chief Technology Officer of machine learning startup Wise.io. In this wi...

22 Sep 20161h 28min

Charles Isbell - Interactive AI, Plus Improving ML Education - TWiML Talk #4

Charles Isbell - Interactive AI, Plus Improving ML Education - TWiML Talk #4

My guest this time is Charles Isbell, Jr., Professor and Senior Associate Dean in the College of Computing at Georgia Institute of Technology. Charles and I go back a bit… in fact he’s the first AI re...

10 Sep 20161h 4min

Xavier Amatriain - Engineering Practical Machine Learning Systems - TWiML Talk #3

Xavier Amatriain - Engineering Practical Machine Learning Systems - TWiML Talk #3

My guest this time is Xavier Amatriain. Xavier is a former researcher who went on to lead the machine learning recommendations team at Netflix, and is now the vice president of engineering at Quora, t...

28 Aug 201656min

Populärt inom Politik & nyheter

svenska-fall
p3-krim
rss-krimstad
fordomspodden
aftonbladet-krim
spar
flashback-forever
rss-sanning-konsekvens
rss-vad-fan-hande
aftonbladet-daily
rss-krimreportrarna
motiv
politiken
rss-aftonbladet-krim
rss-frandfors-horna
rss-klubbland-en-podd-mest-om-frolunda
krimmagasinet
rss-flodet
dagens-eko
olyckan-inifran