[MINI] Leakage
Data Skeptic1 Juli 2016

[MINI] Leakage

If you'd like to make a good prediction, your best bet is to invent a time machine, visit the future, observe the value, and return to the past. For those without access to time travel technology, we need to avoid including information about the future in our training data when building machine learning models. Similarly, if any other feature whose value would not actually be available in practice at the time you'd want to use the model to make a prediction, is a feature that can introduce leakage to your model.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(601)

Earthquake Detection with Crowd-sourced Data

Earthquake Detection with Crowd-sourced Data

Have you ever wanted to hear what an earthquake sounds like? Today on the show we have Omkar Ranadive, Computer Science Masters student at NorthWestern University, who collaborates with Suzan van der ...

25 Dec 202029min

Byzantine Fault Tolerant Consensus

Byzantine Fault Tolerant Consensus

Byzantine fault tolerance (BFT) is a desirable property in a distributed computing environment. BFT means the system can survive the loss of nodes and nodes becoming unreliable. There are many differe...

22 Dec 202035min

Alpha Fold

Alpha Fold

Kyle shared some initial reactions to the announcement about Alpha Fold 2's celebrated performance in the CASP14 prediction.  By many accounts, this exciting result means protein folding is now a solv...

11 Dec 202023min

Arrow's Impossibility Theorem

Arrow's Impossibility Theorem

Above all, everyone wants voting to be fair. What does fair mean and how can we measure it? Kenneth Arrow posited a simple set of conditions that one would certainly desire in a voting system. For exa...

4 Dec 202026min

Face Mask Sentiment Analysis

Face Mask Sentiment Analysis

As the COVID-19 pandemic continues, the public (or at least those with Twitter accounts) are sharing their personal opinions about mask-wearing via Twitter. What does this data tell us about public op...

27 Nov 202041min

Counting Briberies in Elections

Counting Briberies in Elections

Niclas Boehmer, second year PhD student at Berlin Institute of Technology, comes on today to discuss the computational complexity of bribery in elections through the paper "On the Robustness of Winner...

20 Nov 202037min

Sybil Attacks on Federated Learning

Sybil Attacks on Federated Learning

Clement Fung, a Societal Computing PhD student at Carnegie Mellon University, discusses his research in security of machine learning systems and a defense against targeted sybil-based poisoning called...

13 Nov 202031min

Differential Privacy at the US Census

Differential Privacy at the US Census

Simson Garfinkel, Senior Computer Scientist for Confidentiality and Data Access at the US Census Bureau, discusses his work modernizing the Census Bureau disclosure avoidance system from private to pu...

6 Nov 202029min

Populärt inom Vetenskap

allt-du-velat-veta
p3-dystopia
dumma-manniskor
rss-ufobortom-rimligt-tvivel
kapitalet-en-podd-om-ekonomi
ufo-sverige
svd-nyhetsartiklar
rss-spraket
paranormalt-med-caroline-giertz
hacka-livet
medicinvetarna
dumforklarat
rss-vetenskapsradion
det-morka-psyket
ufo-sverige-2
sexet
rss-tidsmaskinen
halsorevolutionen
rss-tidslinjen-podcast
rss-vetenskapsradion-2