[MINI] Leakage
Data Skeptic1 Jul 2016

[MINI] Leakage

If you'd like to make a good prediction, your best bet is to invent a time machine, visit the future, observe the value, and return to the past. For those without access to time travel technology, we need to avoid including information about the future in our training data when building machine learning models. Similarly, if any other feature whose value would not actually be available in practice at the time you'd want to use the model to make a prediction, is a feature that can introduce leakage to your model.

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Eugene Goostman

Eugene Goostman

In this episode, Kyle shares his perspective on the chatbot Eugene Goostman which (some claim) "passed" the Turing Test. As a second topic Kyle also does an intro of the Winograd Schema Challenge.

13 Apr 201817min

The Theory of Formal Languages

The Theory of Formal Languages

In this episode, Kyle and Linhda discuss the theory of formal languages. Any language can (theoretically) be a formal language. The requirement is that the language can be rigorously described as a se...

6 Apr 201823min

The Loebner Prize

The Loebner Prize

The Loebner Prize is a competition in the spirit of the Turing Test.  Participants are welcome to submit conversational agent software to be judged by a panel of humans.  This episode includes intervi...

30 Mar 201833min

Chatbots

Chatbots

In this episode, Kyle chats with Vince from iv.ai and Heather Shapiro who works on the Microsoft Bot Framework. We solicit their advice on building a good chatbot both creatively and technically. Our ...

23 Mar 201827min

The Master Algorithm

The Master Algorithm

In this week's episode, Kyle Polich interviews Pedro Domingos about his book, The Master Algorithm: How the quest for the ultimate learning machine will remake our world. In the book, Domingos describ...

16 Mar 201846min

The No Free Lunch Theorems

The No Free Lunch Theorems

What's the best machine learning algorithm to use? I hear that XGBoost wins most of the Kaggle competitions that aren't won with deep learning. Should I just use XGBoost all the time? That might work ...

9 Mar 201827min

ML at Sloan Kettering Cancer Center

ML at Sloan Kettering Cancer Center

For a long time, physicians have recognized that the tools they have aren't powerful enough to treat complex diseases, like cancer. In addition to data science and models, clinicians also needed actua...

2 Mar 201838min

Optimal Decision Making with POMDPs

Optimal Decision Making with POMDPs

In a previous episode, we discussed Markov Decision Processes or MDPs, a framework for decision making and planning. This episode explores the generalization Partially Observable MDPs (POMDPs) which a...

23 Feb 201818min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
forskningno
liberal-halvtime
rekommandert
rss-zahid-ali-hjelper-deg
sinnsyn
villmarksliv
rss-paradigmepodden
jss
tomprat-med-gunnar-tjomlid
fjellsportpodden
tidlose-historier
rss-overskuddsliv
dekodet-2
kvinnehelsepodden
rss-inn-til-kjernen-med-sunniva-rose
diagnose
nevropodden