Introduction
Data Skeptic23 Maj 2014

Introduction

The Data Skeptic Podcast features conversations with topics related to data science, statistics, machine learning, artificial intelligence and the like, all from the perspective of applying critical thinking and the scientific method to evaluate the veracity of claims and efficacy of approaches.

This first episode is a short discussion about what this podcast is all about.

Avsnitt(588)

[MINI] Stationarity and Differencing

[MINI] Stationarity and Differencing

Mystery shoppers and fruit cultivation help us discuss stationarity - a property of some time serieses that are invariant to time in several ways. Differencing is one approach that can often convert a non-stationary process into a stationary one. If you have a stationary process, you get the benefits of many known statistical properties that can enable you to do a significant amount of inferencing and prediction.

20 Maj 201613min

Feather

Feather

I'm joined by Wes McKinney (@wesmckinn) and Hadley Wickham (@hadleywickham) on this episode to discuss their joint project Feather. Feather is a file format for storing data frames along with some metadata, to help with interoperability between languages. At the time of recording, libraries are available for R and Python, making it easy for data scientists working in these languages to quickly and effectively share datasets and collaborate.

13 Maj 201623min

[MINI] Bargaining

[MINI] Bargaining

Bargaining is the process of two (or more) parties attempting to agree on the price for a transaction. Game theoretic approaches attempt to find two strategies from which neither party is motivated to deviate. These strategies are said to be in equilibrium with one another. The equilibriums available in bargaining depend on the the transaction mechanism and the information of the parties. Discounting (how long parties are willing to wait) has a significant effect in this process. This episode discusses some of the choices Kyle and Linh Da made in deciding what offer to make on a house.

6 Maj 201615min

deepjazz

deepjazz

Deepjazz is a project from Ji-Sung Kim, a computer science student at Princeton University. It is built using Theano, Keras, music21, and Evan Chow's project jazzml. Deepjazz is a computational music project that creates original jazz compositions using recurrent neural networks trained on Pat Metheny's "And Then I Knew". You can hear some of deepjazz's original compositions on soundcloud.

29 Apr 201629min

[MINI] Auto-correlative functions and correlograms

[MINI] Auto-correlative functions and correlograms

When working with time series data, there are a number of important diagnostics one should consider to help understand more about the data. The auto-correlative function, plotted as a correlogram, helps explain how a given observations relates to recent preceding observations. A very random process (like lottery numbers) would show very low values, while temperature (our topic in this episode) does correlate highly with recent days. See the show notes with details about Chapel Hill, NC weather data by visiting: https://dataskeptic.com/blog/episodes/2016/acf-correlograms

22 Apr 201614min

Early Identification of Violent Criminal Gang Members

Early Identification of Violent Criminal Gang Members

This week I spoke with Elham Shaabani and Paulo Shakarian (@PauloShakASU) about their recent paper Early Identification of Violent Criminal Gang Members (also available onarXiv). In this paper, they use social network analysis techniques and machine learning to provide early detection of known criminal offenders who are in a high risk group for committing violent crimes in the future. Their techniques outperform existing techniques used by the police. Elham and Paulo are part of the Cyber-Socio Intelligent Systems (CySIS) Lab.

15 Apr 201627min

[MINI] Fractional Factorial Design

[MINI] Fractional Factorial Design

A dinner party at Data Skeptic HQ helps teach the uses of fractional factorial design for studying 2-way interactions.

8 Apr 201611min

Machine Learning Done Wrong

Machine Learning Done Wrong

Cheng-tao Chu (@chengtao_chu) joins us this week to discuss his perspective on common mistakes and pitfalls that are made when doing machine learning. This episode is filled with sage advice for beginners and intermediate users of machine learning, and possibly some good reminders for experts as well. Our discussion parallels his recent blog postMachine Learning Done Wrong. Cheng-tao Chu is an entrepreneur who has worked at many well known silicon valley companies. His paper Map-Reduce for Machine Learning on Multicore is the basis for Apache Mahout. His most recent endeavor has just emerged from steath, so please check out OneInterview.io.

1 Apr 201625min

Populärt inom Vetenskap

p3-dystopia
paranormalt-med-caroline-giertz
dumma-manniskor
svd-nyhetsartiklar
allt-du-velat-veta
rss-vetenskapligt-talat
kapitalet-en-podd-om-ekonomi
rss-vetenskapspodden
medicinvetarna
rss-ufobortom-rimligt-tvivel
dumforklarat
sexet
det-morka-psyket
bildningspodden
rss-vetenskapsradion-2
rss-vetenskapsradion
halsorevolutionen
rss-i-hjarnan-pa-louise-epstein
rss-spraket
a-kursen