[MINI] Leakage
Data Skeptic1 Jul 2016

[MINI] Leakage

If you'd like to make a good prediction, your best bet is to invent a time machine, visit the future, observe the value, and return to the past. For those without access to time travel technology, we need to avoid including information about the future in our training data when building machine learning models. Similarly, if any other feature whose value would not actually be available in practice at the time you'd want to use the model to make a prediction, is a feature that can introduce leakage to your model.

Denne episoden er hentet fra en åpen RSS-feed og er ikke publisert av Podme. Den kan derfor inneholde annonser.

Episoder(601)

k-means Image Segmentation

k-means Image Segmentation

Linh Da joins us to explore how image segmentation can be done using k-means clustering.  Image segmentation involves dividing an image into a distinct set of segments.  One such approach is to do thi...

22 Feb 202223min

Tracking Elephant Clusters

Tracking Elephant Clusters

In today's episode, Gregory Glatzer explained his machine learning project that involved the prediction of elephant movement and settlement, in a bid to limit the activities of poachers. He used two m...

18 Feb 202226min

k-means clustering

k-means clustering

Welcome to our new season, Data Skeptic: k-means clustering.  Each week will feature an interview or discussion related to this classic algorithm, it's use cases, and analysis. This episode is an over...

14 Feb 202224min

Snowflake Essentials

Snowflake Essentials

Frank Bell, Snowflake Data Superhero, and SnowPro, joins us today to talk about his book "Snowflake Essentials: Getting Started with Big Data in the Cloud."  Snowflake Essentials: Getting Started wit...

7 Feb 202246min

Explainable Climate Science

Explainable Climate Science

Zack Labe, a Post-Doctoral Researcher at Colorado State University, joins us today to discuss his work "Detecting Climate Signals using Explainable AI with Single Forcing Large Ensembles." Works Menti...

31 Jan 202234min

Energy Forecasting Pipelines

Energy Forecasting Pipelines

Erin Boyle, the Head of Data Science at Myst AI, joins us today to talk about her work with Myst AI, a time series forecasting platform and service with the objective for positively impacting sustaina...

24 Jan 202243min

Matrix Profiles in Stumpy

Matrix Profiles in Stumpy

Sean Law, Principle Data Scientist, R&D at a Fortune 500 Company, comes on to talk about his creation of the STUMPY Python Library. Sponsored by Hello Fresh and mParticle: Go to Hellofresh.com/dataske...

17 Jan 202239min

The Great Australian Prediction Project

The Great Australian Prediction Project

Data scientists and psychics have at least one major thing in common. Both professions attempt to predict the future. In the case of a data scientist, this is done using algorithms, data, and often co...

14 Jan 202225min

Populært innen Vitenskap

fastlegen
tingenes-tilstand
jss
rss-zahid-ali-hjelper-deg
liberal-halvtime
sinnsyn
rekommandert
forskningno
villmarksliv
rss-paradigmepodden
vett-og-vitenskap-med-gaute-einevoll
rss-overskuddsliv
nordnorsk-historie
tidlose-historier
rss-inn-til-kjernen-med-sunniva-rose
dekodet-2
kvinnehelsepodden
grunnstoffene
fjellsportpodden
rss-nysgjerrige-norge