Building the howto100m Video Corpus
Data Skeptic19 Aug 2019

Building the howto100m Video Corpus

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen.

This episode is a discussion of the HowTo100m dataset - a project which has assembled a video corpus of 136M video clips with captions covering 23k activities.

Related Links

The paper will be presented at ICCV 2019

@antoine77340

Antoine on Github

Antoine's homepage

Populärt inom Vetenskap

p3-dystopia
svd-nyhetsartiklar
dumma-manniskor
allt-du-velat-veta
kapitalet-en-podd-om-ekonomi
paranormalt-med-caroline-giertz
dumforklarat
rss-ufobortom-rimligt-tvivel
rss-i-hjarnan-pa-louise-epstein
rss-vetenskapsradion
sexet
rss-vetenskapspodden
medicinvetarna
det-morka-psyket
rss-broccolipodden-en-podcast-som-inte-handlar-om-broccoli
barnpsykologerna
rss-vetenskapsradion-2
bildningspodden
rss-spraket
4health-med-anna-sparre