[MINI] Leakage
Data Skeptic1 Jul 2016

[MINI] Leakage

If you'd like to make a good prediction, your best bet is to invent a time machine, visit the future, observe the value, and return to the past. For those without access to time travel technology, we need to avoid including information about the future in our training data when building machine learning models. Similarly, if any other feature whose value would not actually be available in practice at the time you'd want to use the model to make a prediction, is a feature that can introduce leakage to your model.

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

Mapping Dialects with Twitter Data

Mapping Dialects with Twitter Data

When users on Twitter post with geographic tags, it creates the opportunity for a variety of interesting questions to be posed having to do with language, dialects, and location.  In this episode, Kyl...

26 Apr 201925min

Sentiment Analysis

Sentiment Analysis

This is an interview with Ellen Loeshelle, Director of Product Management at Clarabridge.  We primarily discuss sentiment analysis.

20 Apr 201927min

Attention Primer

Attention Primer

A gentle introduction to the very high-level idea of "attention" in machine learning, as it will play a major role in some upcoming episodes over the next few weeks.

13 Apr 201914min

Cross-lingual Short-text Matching

Cross-lingual Short-text Matching

Modern messaging technology has facilitated a trend towards highly compact, short messages send by users who can presume a great amount of context held between the communicating parties.  The rules of...

5 Apr 201924min

ELMo

ELMo

ELMo (Embeddings from Language Models) introduced the idea of deep contextualized word representations. It extends previous ideas like word2vec and GloVe. The ELMo model is a neural network able to ma...

29 Mar 201923min

BLEU

BLEU

Bilingual evaluation understudy (or BLEU) is a metric for evaluating the quality of machine translation using human translation as examples of acceptable quality results. This metric has become a wide...

23 Mar 201942min

Simultaneous Translation at Baidu

Simultaneous Translation at Baidu

While at NeurIPS 2018, Kyle chatted with Liang Huang about his work with Baidu research on simultaneous translation, which was demoed at the conference.

15 Mar 201924min

Human vs Machine Transcription

Human vs Machine Transcription

Machine transcription (the process of translating audio recordings of language to text) has come a long way in recent years. But how do the errors made during machine transcription compare to the erro...

8 Mar 201932min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
rss-nysgjerrige-norge
forskningno
rekommandert
sinnsyn
rss-zahid-ali-hjelper-deg
villmarksliv
rss-paradigmepodden
jss
liberal-halvtime
tomprat-med-gunnar-tjomlid
fjellsportpodden
tidlose-historier
kvinnehelsepodden
nevropodden
rss-overskuddsliv
nordnorsk-historie
dekodet-2
aldring-og-helse-podden