Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406

Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406

Today we’re joined by Sameer Singh, an assistant professor in the department of computer science at UC Irvine. Sameer’s work centers on large-scale and interpretable machine learning applied to information extraction and natural language processing. We caught up with Sameer right after he was awarded the best paper award at ACL 2020 for his work on Beyond Accuracy: Behavioral Testing of NLP Models with CheckList. In our conversation, we explore CheckLists, the task-agnostic methodology for testing NLP models introduced in the paper. We also discuss how well we understand the cause of pitfalls or failure modes in deep learning models, Sameer’s thoughts on embodied AI, and his work on the now famous LIME paper, which he co-authored alongside Carlos Guestrin. The complete show notes for this episode can be found at twimlai.com/go/406.

Episoder(782)

Learning "Common Sense" and Physical Concepts with Roland Memisevic - TWiML Talk #111

Learning "Common Sense" and Physical Concepts with Roland Memisevic - TWiML Talk #111

In today’s episode, I’m joined by Roland Memisevic, co-founder, CEO, and chief scientist at Twenty Billion Neurons. Roland joined me at the RE•WORK Deep Learning Summit in Montreal to discuss the work...

15 Feb 201832min

Trust in Human-Robot/AI Interactions with Ayanna Howard - TWiML Talk #110

Trust in Human-Robot/AI Interactions with Ayanna Howard - TWiML Talk #110

In this episode, the third in our Black in AI series, I speak with Ayanna Howard, Chair of the Interactive School of Computing at Georgia Tech. Ayanna joined me for a lively discussion about her work ...

13 Feb 201846min

Data Science for Poaching Prevention and Disease Treatment with Nyalleng Moorosi - TWiML Talk #109

Data Science for Poaching Prevention and Disease Treatment with Nyalleng Moorosi - TWiML Talk #109

For today’s show, I'm joined by Nyalleng Moorosi, Senior Data Science Researcher at The Council for Scientific & Industrial Research or CSIR, in Pretoria, South Africa. In our discussion, we discuss t...

8 Feb 201852min

Security and Safety in AI: Adversarial Examples, Bias and Trust w/ Moustapha Cissé - TWiML Talk #108

Security and Safety in AI: Adversarial Examples, Bias and Trust w/ Moustapha Cissé - TWiML Talk #108

In this episode I’m joined by Moustapha Cissé, Research Scientist at Facebook AI Research Lab (or FAIR) Paris. Moustapha’s broad research interests include the security and safety of AI systems, and w...

6 Feb 201850min

Peering into the Home w/ Aerial.ai's Wifi Motion Analytics - TWiML Talk #107

Peering into the Home w/ Aerial.ai's Wifi Motion Analytics - TWiML Talk #107

In this episode I’m joined by Michel Allegue and Negar Ghourchian of Aerial.ai. Aerial is doing some really interesting things in the home automation space, by using wifi signal statistics to identify...

2 Feb 201840min

Physiology-Based Models for Fitness and Training w/ Firstbeat with Ilkka Korhonen - TWiML Talk #106

Physiology-Based Models for Fitness and Training w/ Firstbeat with Ilkka Korhonen - TWiML Talk #106

In this episode i'm joined by Ilkka Korhonen, Vice President of Technology at Firstbeat, a company whose algorithms are embedded in fitness watches from companies like Garmin and Suunto and which use ...

2 Feb 201835min

Machine Learning for Signal Processing Applications w/ Stuart Feffer & Brady Tsai - TWiML Talk #105

Machine Learning for Signal Processing Applications w/ Stuart Feffer & Brady Tsai - TWiML Talk #105

In this episode, I'm joined by Stuart Feffer, co-founder and CEO of Reality AI, which provides tools and services for engineers working with sensors and signals, and Brady Tsai, Business Development M...

1 Feb 201836min

Personalizing the Ferrari Challenge Experience w/ Intel AI - TWiML Talk #104

Personalizing the Ferrari Challenge Experience w/ Intel AI - TWiML Talk #104

In this episode, I'm joined by Andy Keller and Emile Chin-Dickey to discuss Intel's partnership with the Ferrari Challenge North American Series. Andy is a Deep Learning Data Scientist at Intel and Em...

31 Jan 201837min

Populært innen Politikk og nyheter

giver-og-gjengen-vg
aftenpodden
aftenpodden-usa
lydartikler-fra-aftenposten
forklart
stopp-verden
popradet
fotballpodden-2
dine-penger-pengeradet
rss-gukild-johaug
det-store-bildet
hanna-de-heldige
rss-ness
aftenbla-bla
nokon-ma-ga
rss-penger-polser-og-politikk
e24-podden
rss-utenrikskomiteen-med-bogen-og-grasvik
frokostshowet-pa-p5
chit-chat-med-helle