DataRec Library for Reproducible in Recommend Systems

DataRec Library for Reproducible in Recommend Systems

In this episode of Data Skeptic's Recommender Systems series, host Kyle Polich explores DataRec, a new Python library designed to bring reproducibility and standardization to recommender systems research. Guest Alberto Carlo Maria Mancino, a postdoc researcher from Politecnico di Bari, Italy, discusses the challenges of dataset management in recommendation research—from version control issues to preprocessing inconsistencies—and how DataRec provides automated downloads, checksum verification, and standardized filtering strategies for popular datasets like MovieLens, Last.fm, and Amazon reviews.

The conversation covers Alberto's research journey through knowledge graphs, graph-based recommenders, privacy considerations, and recommendation novelty. He explains why small modifications in datasets can significantly impact research outcomes, the importance of offline evaluation, and DataRec's vision as a lightweight library that integrates with existing frameworks rather than replacing them. Whether you're benchmarking new algorithms or exploring recommendation techniques, this episode offers practical insights into one of the most critical yet overlooked aspects of reproducible ML research.

Avsnitt(589)

Time Series for Good

Time Series for Good

Bahman Rostami-Tabar, Senior Lecturer in Management Science at Cardiff University, joins us today to talk about his work "Forecasting and its Beneficiaries."

1 Nov 202137min

Long Term Time Series Forecasting

Long Term Time Series Forecasting

Alex Mallen, Computer Science student at the University of Washington, and Henning Lange, a Postdoctoral Scholar in Applied Math at the University of Washington, join us today to share their work "Deep Probabilistic Koopman: Long-term Time-Series Forecasting Under Periodic Uncertainties."

25 Okt 202137min

Fast and Frugal Time Series Forecasting

Fast and Frugal Time Series Forecasting

Fotios Petropoulos, Professor of Management Science at the University of Bath in The U.K., joins us today to talk about his work "Fast and Frugal Time Series Forecasting."

17 Okt 202137min

Causal Inference in Educational Systems

Causal Inference in Educational Systems

Manie Tadayon, a PhD graduate from the ECE department at University of California, Los Angeles, joins us today to talk about his work "Comparative Analysis of the Hidden Markov Model and LSTM: A Simulative Approach."

11 Okt 202141min

Boosted Embeddings for Time Series

Boosted Embeddings for Time Series

Sankeerth Rao Karingula, ML Researcher at Palo Alto Networks, joins us today to talk about his work "Boosted Embeddings for Time Series Forecasting." Works Mentioned Boosted Embeddings for Time Series Forecasting by Sankeerth Rao Karingula, Nandini Ramanan, Rasool Tahmasbi, Mehrnaz Amjadi, Deokwoo Jung, Ricky Si, Charanraj Thimmisetty, Luisa Polania Cabrera, Marjorie Sayer, Claudionor Nunes Coelho Jr https://www.linkedin.com/in/sankeerthrao/ https://twitter.com/sankeerthrao3 https://lod2021.icas.cc/

4 Okt 202128min

Change Point Detection in Continuous Integration Systems

Change Point Detection in Continuous Integration Systems

David Daly, Performance Engineer at MongoDB, joins us today to discuss "The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System". Works Mentioned The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System by David Daly, William Brown, Henrik Ingo, Jim O'Leary, David BradfordSocial Media David's Website David's Twitter Mongodb

27 Sep 202133min

Applying k-Nearest Neighbors to Time Series

Applying k-Nearest Neighbors to Time Series

Samya Tajmouati, a PhD student in Data Science at the University of Science of Kenitra, Morocco, joins us today to discuss her work Applying K-Nearest Neighbors to Time Series Forecasting: Two New Approaches.

20 Sep 202124min

Ultra Long Time Series

Ultra Long Time Series

Dr. Feng Li, (@f3ngli) is an Associate Professor of Statistics in the School of Statistics and Mathematics at Central University of Finance and Economics in Beijing, China. He joins us today to discuss his work Distributed ARIMA Models for Ultra-long Time Series.

13 Sep 202128min

Populärt inom Vetenskap

p3-dystopia
dumma-manniskor
allt-du-velat-veta
svd-nyhetsartiklar
paranormalt-med-caroline-giertz
kapitalet-en-podd-om-ekonomi
det-morka-psyket
rss-i-hjarnan-pa-louise-epstein
rss-vetenskapsradion
medicinvetarna
sexet
dumforklarat
rss-broccolipodden-en-podcast-som-inte-handlar-om-broccoli
rss-vetenskapsradion-2
rss-vetenskapspodden
barnpsykologerna
bildningspodden
rss-ufobortom-rimligt-tvivel
rss-vetenskapligt-talat
vetenskapsradion