Video Recommendations in Industry
Data Skeptic26 Dec 2025

Video Recommendations in Industry

In this episode, Kyle Polich sits down with Cory Zechmann, a content curator working in streaming television with 16 years of experience running the music blog "Silence Nogood." They explore the intersection of human curation and machine learning in content discovery, discussing the concept of "algatorial" curation—where algorithms and editorial expertise work together. Key topics include the cold start problem, why every metric is just a "proxy metric" for what users actually want, the challenge of filter bubbles, and the importance of balancing familiarity with discovery. Cory shares insights on why TikTok's algorithm works so well (clean data and massive interaction volume), the crucial role of homepage curation, and how human curators help by contextualizing content, cleaning data, and identifying positive feedback loops that algorithms might miss.

The conversation covers practical challenges like measuring "surprise and delight," the content deluge created by democratized creation tools, and why trust in tech companies is essential for better personalization. Cory emphasizes that discovery is "a good type of friction" and explains how the CODE framework (Capture, Organize, Distill, Express, plus Analysis) guides professional curation work. Looking to the future, they discuss the need for systems thinking that creates narrative connections between content, the potential for conversational AI to help users articulate preferences, and why diverse perspectives beyond engineering are crucial for building effective discovery systems. Resources mentioned include the newsletter "Top Information Retrieval Papers of the Week" and Notebook LM for synthesizing research.

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(601)

seq2seq

seq2seq

A sequence to sequence (or seq2seq) model is neural architecture used for translation (and other tasks) which consists of an encoder and a decoder. The encoder/decoder architecture has obvious promise...

1 Mars 201921min

Text Mining in R

Text Mining in R

Kyle interviews Julia Silge about her path into data science, her book Text Mining with R, and some of the ways in which she's used natural language processing in projects both personal and profession...

22 Feb 201920min

Recurrent Relational Networks

Recurrent Relational Networks

One of the most challenging NLP tasks is natural language understanding and reasoning. How can we construct algorithms that are able to achieve human level understanding of text and be able to answer ...

15 Feb 201919min

Text World and Word Embedding Lower Bounds

Text World and Word Embedding Lower Bounds

In the first half of this episode, Kyle speaks with Marc-Alexandre Côté and Wendy Tay about Text World.  Text World is an engine that simulates text adventure games.  Developers are encouraged to try ...

8 Feb 201939min

word2vec

word2vec

Word2vec is an unsupervised machine learning model which is able to capture semantic information from the text it is trained on. The model is based on neural networks. Several large organizations like...

1 Feb 201931min

Authorship Attribution

Authorship Attribution

In a recent paper, Leveraging Discourse Information Effectively for Authorship Attribution, authors Su Wang, Elisa Ferracane, and Raymond J. Mooney describe a deep learning methodology for predict whi...

25 Jan 201950min

Very Large Corpora and Zipf's Law

Very Large Corpora and Zipf's Law

The earliest efforts to apply machine learning to natural language tended to convert every token (every word, more or less) into a unique feature. While techniques like stemming may have cut the numbe...

18 Jan 201924min

Semantic search at Github

Semantic search at Github

Github is many things besides source control. It's a social network, even though not everyone realizes it. It's a vast repository of code. It's a ticketing and project management system. And of course...

11 Jan 201934min

Populärt inom Vetenskap

allt-du-velat-veta
p3-dystopia
dumma-manniskor
kapitalet-en-podd-om-ekonomi
rss-ufobortom-rimligt-tvivel
ufo-sverige
svd-nyhetsartiklar
rss-spraket
paranormalt-med-caroline-giertz
medicinvetarna
rss-vetenskapsradion
det-morka-psyket
dumforklarat
hacka-livet
sexet
ufo-sverige-2
rss-tidsmaskinen
vetenskapsradion
rss-arkeologi-historia-podden-som-graver-i-vart-kulturlandskap
rss-geopodden-2