Predictive Models on Random Data
Data Skeptic22 Juli 2016

Predictive Models on Random Data

This week is an insightful discussion with Claudia Perlich about some situations in machine learning where models can be built, perhaps by well-intentioned practitioners, to appear to be highly predictive despite being trained on random data. Our discussion covers some novel observations about ROC and AUC, as well as an informative discussion of leakage.

Much of our discussion is inspired by two excellent papers Claudia authored: Leakage in Data Mining: Formulation, Detection, and Avoidance and On Cross Validation and Stacking: Building Seemingly Predictive Models on Random Data. Both are highly recommended reading!

Populärt inom Vetenskap

p3-dystopia
paranormalt-med-caroline-giertz
dumma-manniskor
allt-du-velat-veta
rss-vetenskapligt-talat
svd-nyhetsartiklar
kapitalet-en-podd-om-ekonomi
rss-vetenskapspodden
dumforklarat
rss-vetenskapsradion-2
sexet
rss-vetenskapsradion
medicinvetarna
rss-ufobortom-rimligt-tvivel
rss-i-hjarnan-pa-louise-epstein
det-morka-psyket
bildningspodden
halsorevolutionen
rss-spraket
vetenskapsradion