Building the howto100m Video Corpus
Data Skeptic19 Aug 2019

Building the howto100m Video Corpus

Video annotation is an expensive and time-consuming process. As a consequence, the available video datasets are useful but small. The availability of machine transcribed explainer videos offers a unique opportunity to rapidly develop a useful, if dirty, corpus of videos that are "self annotating", as hosts explain the actions they are taking on the screen.

This episode is a discussion of the HowTo100m dataset - a project which has assembled a video corpus of 136M video clips with captions covering 23k activities.

Related Links

The paper will be presented at ICCV 2019

@antoine77340

Antoine on Github

Antoine's homepage

Episoder(589)

Populært innen Vitenskap

fastlegen
rekommandert
jss
tingenes-tilstand
rss-nysgjerrige-norge
sinnsyn
rss-rekommandert
tomprat-med-gunnar-tjomlid
villmarksliv
dekodet-2
rss-paradigmepodden
doktor-fives-podcast
forskningno
pod-britannia
fjellsportpodden
vett-og-vitenskap-med-gaute-einevoll
rss-overskuddsliv
tidlose-historier
abid-nadia-skyld-og-skam
rss-inn-til-kjernen-med-sunniva-rose