Building the howto100m Video Corpus
Data Skeptic19 Elo 2019

Building the howto100m Video Corpus

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen.

This episode is a discussion of the HowTo100m dataset - a project which has assembled a video corpus of 136M video clips with captions covering 23k activities.

Related Links

The paper will be presented at ICCV 2019

@antoine77340

Antoine on Github

Antoine's homepage

Jaksot(590)

Defending the p-value

Defending the p-value

Yudi Pawitan joins us to discuss his paper Defending the P-value.

12 Loka 202030min

Retraction Watch

Retraction Watch

Ivan Oransky joins us to discuss his work documenting the scientific peer-review process at retractionwatch.com.

5 Loka 202032min

Crowdsourced Expertise

Crowdsourced Expertise

Derek Lim joins us to discuss the paper Expertise and Dynamics within Crowdsourced Musical Knowledge Curation: A Case Study of the Genius Platform.

21 Syys 202027min

The Spread of Misinformation Online

The Spread of Misinformation Online

Neil Johnson joins us to discuss the paper The online competition between pro- and anti-vaccination views.

14 Syys 202035min

Consensus Voting

Consensus Voting

Mashbat Suzuki joins us to discuss the paper How Many Freemasons Are There? The Consensus Voting Mechanism in Metric Spaces. Check out Mashbat's and many other great talks at the 13th Symposium on Algorithmic Game Theory (SAGT 2020)

7 Syys 202022min

Voting Mechanisms

Voting Mechanisms

Steven Heilman joins us to discuss his paper Designing Stable Elections. For a general interest article, see: https://theconversation.com/the-electoral-college-is-surprisingly-vulnerable-to-popular-vote-changes-141104 Steven Heilman receives funding from the National Science Foundation. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation.

31 Elo 202027min

False Consensus

False Consensus

Sami Yousif joins us to discuss the paper The Illusion of Consensus: A Failure to Distinguish Between True and False Consensus. This work empirically explores how individuals evaluate consensus under different experimental conditions reviewing online news articles. More from Sami at samiyousif.org Link to survey mentioned by Daniel Kerrigan: https://forms.gle/TCdGem3WTUYEP31B8

24 Elo 202033min

Fraud Detection in Real Time

Fraud Detection in Real Time

In this solo episode, Kyle overviews the field of fraud detection with eCommerce as a use case.  He discusses some of the techniques and system architectures used by companies to fight fraud with a focus on why these things need to be approached from a real-time perspective.

18 Elo 202038min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
utelias-mieli
tiedekulma-podcast
hippokrateen-vastaanotolla
rss-lihavuudesta-podcast
rss-poliisin-mieli
rss-totta-vai-tuubaa
menologeja-tutkimusmatka-vaihdevuosiin
rss-duodecim-lehti
rss-metsanomistaja-podcast
docemilia
radio-antro
rss-bios-podcast
rss-astetta-parempi-elama-podcast
rss-radplus
rss-ilmasto-kriisissa