
Data Provenance and Reproducibility with Pachyderm
Versioning isn't just for source code. Being able to track changes to data is critical for answering questions about data provenance, quality, and reproducibility. Daniel Whitenack joins me this week ...
3 Feb 201740min
![[MINI] Logistic Regression on Audio Data](https://cdn.podme.com/podcast-images/2593D3D31B8AC2FEF0A851C55B7D161F_small.jpg)
[MINI] Logistic Regression on Audio Data
Logistic Regression is a popular classification algorithm. In this episode, we discuss how it can be used to determine if an audio clip represents one of two given speakers. It assumes an output varia...
27 Jan 201720min

Studying Competition and Gender Through Chess
Prior work has shown that people's response to competition is in part predicted by their gender. Understanding why and when this occurs is important in areas such as labor market outcomes. A well stru...
20 Jan 201734min
![[MINI] Dropout](https://cdn.podme.com/podcast-images/A0F8119BFC2AAAB5BB173824F8C1A651_small.jpg)
[MINI] Dropout
Deep learning can be prone to overfit a given problem. This is especially frustrating given how much time and computational resources are often required to converge. One technique for fighting overfit...
13 Jan 201715min

The Police Data and the Data Driven Justice Initiatives
In this episode I speak with Clarence Wardell and Kelly Jin about their mutual service as part of the White House's Police Data Initiative and Data Driven Justice Initiative respectively. The Police D...
6 Jan 201749min

The Library Problem
We close out 2016 with a discussion of a basic interview question which might get asked when applying for a data science job. Specifically, how a library might build a model to predict if a book will ...
30 Dec 201635min

2016 Holiday Special
Today's episode is a reading of Isaac Asimov's Franchise. As mentioned on the show, this is just a work of fiction to be enjoyed and not in any way some obfuscated political statement. Enjoy, and h...
23 Dec 201639min
![[MINI] Entropy](https://cdn.podme.com/podcast-images/B1A01069E3E48736390A3EA4BB213338_small.jpg)
[MINI] Entropy
Classically, entropy is a measure of disorder in a system. From a statistical perspective, it is more useful to say it's a measure of the unpredictability of the system. In this episode we discuss how...
16 Dec 201616min

















