Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406

Beyond Accuracy: Behavioral Testing of NLP Models with Sameer Singh - #406

Today we’re joined by Sameer Singh, an assistant professor in the department of computer science at UC Irvine. Sameer’s work centers on large-scale and interpretable machine learning applied to information extraction and natural language processing. We caught up with Sameer right after he was awarded the best paper award at ACL 2020 for his work on Beyond Accuracy: Behavioral Testing of NLP Models with CheckList. In our conversation, we explore CheckLists, the task-agnostic methodology for testing NLP models introduced in the paper. We also discuss how well we understand the cause of pitfalls or failure modes in deep learning models, Sameer’s thoughts on embodied AI, and his work on the now famous LIME paper, which he co-authored alongside Carlos Guestrin. The complete show notes for this episode can be found at twimlai.com/go/406.

Avsnitt(781)

Inverse Reinforcement Learning Without RL with Gokul Swamy - #643

Inverse Reinforcement Learning Without RL with Gokul Swamy - #643

Today we’re joined by Gokul Swamy, a Ph.D. Student at the Robotics Institute at Carnegie Mellon University. In the final conversation of our ICML 2023 series, we sat down with Gokul to discuss his acc...

21 Aug 202333min

Explainable AI for Biology and Medicine with Su-In Lee - #642

Explainable AI for Biology and Medicine with Su-In Lee - #642

Today we’re joined by Su-In Lee, a professor at the Paul G. Allen School of Computer Science And Engineering at the University Of Washington. In our conversation, Su-In details her talk from the ICML ...

14 Aug 202338min

Transformers On Large-Scale Graphs with Bayan Bruss - #641

Transformers On Large-Scale Graphs with Bayan Bruss - #641

Today we’re joined by Bayan Bruss, Vice President of Applied ML Research at Capital One. In our conversation with Bayan, we covered a pair of papers his team presented at this year’s ICML conference. ...

7 Aug 202338min

The Enterprise LLM Landscape with Atul Deo - #640

The Enterprise LLM Landscape with Atul Deo - #640

Today we’re joined by Atul Deo, General Manager of Amazon Bedrock. In our conversation with Atul, we discuss the process of training large language models in the enterprise, including the pain points ...

31 Juli 202337min

BloombergGPT - an LLM for Finance with David Rosenberg - #639

BloombergGPT - an LLM for Finance with David Rosenberg - #639

Today we’re joined by David Rosenberg, head of the machine learning strategy team in the Office of the CTO at Bloomberg. In our conversation with David, we discuss the creation of BloombergGPT, a cust...

24 Juli 202336min

Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638

Are LLMs Good at Causal Reasoning? with Robert Osazuwa Ness - #638

Today we’re joined by Robert Osazuwa Ness, a senior researcher at Microsoft Research, Professor at Northeastern University, and Founder of Altdeep.ai. In our conversation with Robert, we explore wheth...

17 Juli 202348min

Privacy vs Fairness in Computer Vision with Alice Xiang - #637

Privacy vs Fairness in Computer Vision with Alice Xiang - #637

Today we’re joined by Alice Xiang, Lead Research Scientist at Sony AI, and Global Head of AI Ethics at Sony Group Corporation. In our conversation with Alice, we discuss the ongoing debate between pri...

10 Juli 202337min

Unifying Vision and Language Models with Mohit Bansal - #636

Unifying Vision and Language Models with Mohit Bansal - #636

Today we're joined by Mohit Bansal, Parker Professor, and Director of the MURGe-Lab at UNC, Chapel Hill. In our conversation with Mohit, we explore the concept of unification in AI models, highlightin...

3 Juli 202348min

Populärt inom Politik & nyheter

svenska-fall
p3-krim
aftonbladet-krim
spar
fordomspodden
rss-krimstad
flashback-forever
rss-sanning-konsekvens
motiv
aftonbladet-daily
krimmagasinet
rss-krimreportrarna
rss-frandfors-horna
rss-vad-fan-hande
politiken
grans
rss-flodet
sydsvenskan-dok
dagens-eko
rss-aftonbladet-krim