Predictive Models on Random Data
Data Skeptic22 Jul 2016

Predictive Models on Random Data

This week is an insightful discussion with Claudia Perlich about some situations in machine learning where models can be built, perhaps by well-intentioned practitioners, to appear to be highly predictive despite being trained on random data. Our discussion covers some novel observations about ROC and AUC, as well as an informative discussion of leakage.

Much of our discussion is inspired by two excellent papers Claudia authored: Leakage in Data Mining: Formulation, Detection, and Avoidance and On Cross Validation and Stacking: Building Seemingly Predictive Models on Random Data. Both are highly recommended reading!

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Populært innen Vitenskap

fastlegen
jss
tingenes-tilstand
forskningno
rekommandert
rss-paradigmepodden
villmarksliv
sinnsyn
rss-zahid-ali-hjelper-deg
vett-og-vitenskap-med-gaute-einevoll
kvinnehelsepodden
rss-inn-til-kjernen-med-sunniva-rose
nordnorsk-historie
nevropodden
tidlose-historier
liberal-halvtime
fjellsportpodden
rss-overskuddsliv
grunnstoffene
rss-rekommandert