Annotator Bias
Data Skeptic23 Marras 2019

Annotator Bias

The modern deep learning approaches to natural language processing are voracious in their demands for large corpora to train on. Folk wisdom estimates used to be around 100k documents were required for effective training. The availability of broadly trained, general-purpose models like BERT has made it possible to do transfer learning to achieve novel results on much smaller corpora.

Thanks to these advancements, an NLP researcher might get value out of fewer examples since they can use the transfer learning to get a head start and focus on learning the nuances of the language specifically relevant to the task at hand. Thus, small specialized corpora are both useful and practical to create.

In this episode, Kyle speaks with Mor Geva, lead author on the recent paper Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets, which explores some unintended consequences of the typical procedure followed for generating corpora.

Source code for the paper available here: https://github.com/mega002/annotator_bias

Jaksot(590)

Change Point Detection Algorithms

Change Point Detection Algorithms

Gerrit van den Burg, Postdoctoral Researcher at The Alan Turing Institute, joins us today to discuss his work "An Evaluation of Change Point Detection Algorithms."

8 Marras 202130min

Time Series for Good

Time Series for Good

Bahman Rostami-Tabar, Senior Lecturer in Management Science at Cardiff University, joins us today to talk about his work "Forecasting and its Beneficiaries."

1 Marras 202137min

Long Term Time Series Forecasting

Long Term Time Series Forecasting

Alex Mallen, Computer Science student at the University of Washington, and Henning Lange, a Postdoctoral Scholar in Applied Math at the University of Washington, join us today to share their work "Deep Probabilistic Koopman: Long-term Time-Series Forecasting Under Periodic Uncertainties."

25 Loka 202137min

Fast and Frugal Time Series Forecasting

Fast and Frugal Time Series Forecasting

Fotios Petropoulos, Professor of Management Science at the University of Bath in The U.K., joins us today to talk about his work "Fast and Frugal Time Series Forecasting."

17 Loka 202137min

Causal Inference in Educational Systems

Causal Inference in Educational Systems

Manie Tadayon, a PhD graduate from the ECE department at University of California, Los Angeles, joins us today to talk about his work "Comparative Analysis of the Hidden Markov Model and LSTM: A Simulative Approach."

11 Loka 202141min

Boosted Embeddings for Time Series

Boosted Embeddings for Time Series

Sankeerth Rao Karingula, ML Researcher at Palo Alto Networks, joins us today to talk about his work "Boosted Embeddings for Time Series Forecasting." Works Mentioned Boosted Embeddings for Time Series Forecasting by Sankeerth Rao Karingula, Nandini Ramanan, Rasool Tahmasbi, Mehrnaz Amjadi, Deokwoo Jung, Ricky Si, Charanraj Thimmisetty, Luisa Polania Cabrera, Marjorie Sayer, Claudionor Nunes Coelho Jr https://www.linkedin.com/in/sankeerthrao/ https://twitter.com/sankeerthrao3 https://lod2021.icas.cc/

4 Loka 202128min

Change Point Detection in Continuous Integration Systems

Change Point Detection in Continuous Integration Systems

David Daly, Performance Engineer at MongoDB, joins us today to discuss "The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System". Works Mentioned The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System by David Daly, William Brown, Henrik Ingo, Jim O'Leary, David BradfordSocial Media David's Website David's Twitter Mongodb

27 Syys 202133min

Applying k-Nearest Neighbors to Time Series

Applying k-Nearest Neighbors to Time Series

Samya Tajmouati, a PhD student in Data Science at the University of Science of Kenitra, Morocco, joins us today to discuss her work Applying K-Nearest Neighbors to Time Series Forecasting: Two New Approaches.

20 Syys 202124min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
utelias-mieli
tiedekulma-podcast
rss-lihavuudesta-podcast
hippokrateen-vastaanotolla
menologeja-tutkimusmatka-vaihdevuosiin
rss-poliisin-mieli
rss-totta-vai-tuubaa
rss-duodecim-lehti
rss-metsanomistaja-podcast
docemilia
sotataidon-ytimessa
filocast-filosofian-perusteet
rss-bios-podcast
rss-ammamafia
rss-astetta-parempi-elama-podcast
rss-radplus
rss-ilmasto-kriisissa
rss-tervetta-skeptisyytta