[MINI] Leakage
Data Skeptic1 Jul 2016

[MINI] Leakage

If you'd like to make a good prediction, your best bet is to invent a time machine, visit the future, observe the value, and return to the past. For those without access to time travel technology, we need to avoid including information about the future in our training data when building machine learning models. Similarly, if any other feature whose value would not actually be available in practice at the time you'd want to use the model to make a prediction, is a feature that can introduce leakage to your model.

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Automated Fact Checking

Automated Fact Checking

Fake news can be responded to with fact-checking. However, it's easier to create fake news than the fact check it. Full Fact is the UK's independent fact-checking organization. In this episode, Kyle i...

16 Nov 201831min

[MINI] Single Source of Truth

[MINI] Single Source of Truth

In mathematics, truth is universal.  In data, truth lies in the where clause of the query. As large organizations have grown to rely on their data more significantly for decision making, a common prob...

9 Nov 201829min

Detecting Fast Radio Bursts with Deep Learning

Detecting Fast Radio Bursts with Deep Learning

Fast radio bursts are an astrophysical phenomenon first observed in 2007. While many observations have been made, science has yet to explain the mechanism for these events. This has led some to ask: c...

2 Nov 201844min

Being Bayesian

Being Bayesian

This episode explores the root concept of what it is to be Bayesian: describing knowledge of a system probabilistically, having an appropriate prior probability, know how to weigh new evidence, and fo...

26 Okt 201824min

Modeling Fake News

Modeling Fake News

This is our interview with Dorje Brody about his recent paper with David Meier, How to model fake news. This paper uses the tools of communication theory and a sub-topic called filtering theory to des...

19 Okt 201833min

The Louvain Method for Community Detection

The Louvain Method for Community Detection

Without getting into definitions, we have an intuitive sense of what a "community" is. The Louvain Method for Community Detection is one of the best known mathematical techniques designed to detect co...

12 Okt 201826min

Cultural Cognition of Scientific Consensus

Cultural Cognition of Scientific Consensus

In this episode, our guest is Dan Kahan about his research into how people consume and interpret science news. In an era of fake news, motivated reasoning, and alternative facts, important questions n...

5 Okt 201831min

False Discovery Rates

False Discovery Rates

A false discovery rate (FDR) is a methodology that can be useful when struggling with the problem of multiple comparisons. In any experiment, if the experimenter checks more than one dependent variabl...

28 Sep 201825min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
liberal-halvtime
sinnsyn
villmarksliv
rss-zahid-ali-hjelper-deg
forskningno
rekommandert
rss-overskuddsliv
jss
rss-paradigmepodden
tidlose-historier
vett-og-vitenskap-med-gaute-einevoll
fjellsportpodden
dekodet-2
rss-nysgjerrige-norge
rss-inn-til-kjernen-med-sunniva-rose
nevropodden
kvinnehelsepodden
diagnose