[MINI] Leakage
Data Skeptic1 Heinä 2016

[MINI] Leakage

If you'd like to make a good prediction, your best bet is to invent a time machine, visit the future, observe the value, and return to the past. For those without access to time travel technology, we need to avoid including information about the future in our training data when building machine learning models. Similarly, if any other feature whose value would not actually be available in practice at the time you'd want to use the model to make a prediction, is a feature that can introduce leakage to your model.

Tämä jakso on lisätty Podme-palveluun avoimen RSS-syötteen kautta eikä se ole Podmen omaa tuotantoa. Siksi jakso saattaa sisältää mainontaa.

Jaksot(601)

k-means Image Segmentation

k-means Image Segmentation

Linh Da joins us to explore how image segmentation can be done using k-means clustering.  Image segmentation involves dividing an image into a distinct set of segments.  One such approach is to do thi...

22 Helmi 202223min

Tracking Elephant Clusters

Tracking Elephant Clusters

In today's episode, Gregory Glatzer explained his machine learning project that involved the prediction of elephant movement and settlement, in a bid to limit the activities of poachers. He used two m...

18 Helmi 202226min

k-means clustering

k-means clustering

Welcome to our new season, Data Skeptic: k-means clustering.  Each week will feature an interview or discussion related to this classic algorithm, it's use cases, and analysis. This episode is an over...

14 Helmi 202224min

Snowflake Essentials

Snowflake Essentials

Frank Bell, Snowflake Data Superhero, and SnowPro, joins us today to talk about his book "Snowflake Essentials: Getting Started with Big Data in the Cloud."  Snowflake Essentials: Getting Started wit...

7 Helmi 202246min

Explainable Climate Science

Explainable Climate Science

Zack Labe, a Post-Doctoral Researcher at Colorado State University, joins us today to discuss his work "Detecting Climate Signals using Explainable AI with Single Forcing Large Ensembles." Works Menti...

31 Tammi 202234min

Energy Forecasting Pipelines

Energy Forecasting Pipelines

Erin Boyle, the Head of Data Science at Myst AI, joins us today to talk about her work with Myst AI, a time series forecasting platform and service with the objective for positively impacting sustaina...

24 Tammi 202243min

Matrix Profiles in Stumpy

Matrix Profiles in Stumpy

Sean Law, Principle Data Scientist, R&D at a Fortune 500 Company, comes on to talk about his creation of the STUMPY Python Library. Sponsored by Hello Fresh and mParticle: Go to Hellofresh.com/dataske...

17 Tammi 202239min

The Great Australian Prediction Project

The Great Australian Prediction Project

Data scientists and psychics have at least one major thing in common. Both professions attempt to predict the future. In the case of a data scientist, this is done using algorithms, data, and often co...

14 Tammi 202225min

Suosittua kategoriassa Tiede

rss-poliisin-mieli
tiedekulma-podcast
rss-mita-tulisi-tietaa
docemilia
filocast-filosofian-perusteet
rss-tiedetta-vai-tarinaa
rss-lapsuuden-rakentajat-podcast
sotataidon-ytimessa
menologeja-tutkimusmatka-vaihdevuosiin
rss-duodecim-lehti
rss-lihavuudesta-podcast
radio-antro
rss-bios-podcast
rss-metsantuntijat-podcast
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita