DataRec Library for Reproducible in Recommend Systems
Data Skeptic13 Marras

DataRec Library for Reproducible in Recommend Systems

In this episode of Data Skeptic's Recommender Systems series, host Kyle Polich explores DataRec, a new Python library designed to bring reproducibility and standardization to recommender systems research. Guest Alberto Carlo Maria Mancino, a postdoc researcher from Politecnico di Bari, Italy, discusses the challenges of dataset management in recommendation research—from version control issues to preprocessing inconsistencies—and how DataRec provides automated downloads, checksum verification, and standardized filtering strategies for popular datasets like MovieLens, Last.fm, and Amazon reviews.

The conversation covers Alberto's research journey through knowledge graphs, graph-based recommenders, privacy considerations, and recommendation novelty. He explains why small modifications in datasets can significantly impact research outcomes, the importance of offline evaluation, and DataRec's vision as a lightweight library that integrates with existing frameworks rather than replacing them. Whether you're benchmarking new algorithms or exploring recommendation techniques, this episode offers practical insights into one of the most critical yet overlooked aspects of reproducible ML research.

Jaksot(589)

User Perceptions of Problematic Ads

User Perceptions of Problematic Ads

Eric Zeng joins us to discuss his study around understanding bad ads and efforts that can be taken to limit bad ads online. He discussed how he and his co authors scrapped a large amount of ad data, applied a machine learning algorithm, and commensurate statistical results.

25 Heinä 202237min

Political Digital Advertising Analysis

Political Digital Advertising Analysis

NaLette Brodnax, a political scientist and an Assistant Professor in the McCourt School of Public Policy at Georgetown University joins us to discuss her work on analyzing digital advertisements for political campaigns. She used data for electoral campaigns on Facebook to answer questions that help us better understand how digital ads affect the outcome of elections. Click here for additional show notes! Thanks to our sponsor! https://neptune.ai/ Log, store, query, display, organize and compare all your model metadata in a single place

21 Heinä 202235min

Fraud Detection in Crowdfunding Campaigns

Fraud Detection in Crowdfunding Campaigns

18 Heinä 202235min

Artificial Intelligence and Auction Design

Artificial Intelligence and Auction Design

11 Heinä 202243min

Privacy Preference Signals

Privacy Preference Signals

Have you ever wondered what goes on under the hood when you accept a website's cookies? Today, Maximilian Hils, a PhD student in Computer Science, at the University of Innsbruck, Austria, dissects the ad tech industry and the standards put in place to protect users' data. He also shares his thoughts on the use of VPNs as well as other tools that help shield your data from prying eyes on the internet. Click here for additional show notes Thanks to our sponsor: https://clear.ml/ ClearML is an open-source MLOps solution users love to customize, helping you easily Track, Orchestrate, and Automate ML workflows at scale.

4 Heinä 202233min

Neural Architecture Search for CTR Prediction

Neural Architecture Search for CTR Prediction

Ravi Krishna joins us today to talk about his recent work on a differentiable NAS framework for ads CTR prediction. He discussed what CTR prediction is about and why his NAS framework helps in building neural networks for better ads recommendation. Listen to learn about methodology, related literature and his results. Click for additional show notes Thanks to our sponsor: https://astrato.io Astrato is a modern BI and analytics platform built for the Snowflake Data Cloud. A next-generation live query data visualization and analytics solution, empowering everyone to make live data decisions.

27 Kesä 202228min

Algorithmic PPC Management

Algorithmic PPC Management

Effectively managing a large budget of pay per click advertising demands software solutions. When spending multi-million dollar budgets on hundreds of thousands of keywords, an effective algorithmic strategy is required to optimize marketing objectives. In this episode, Nathan Janos joins us to share insights from his work in the ad tech industry. Click for additional show notes Thanks to our sponsor! https://wandb.com/ The developer-first MLOps platform. Build better models faster with experiment tracking, dataset versioning, and model management.

21 Kesä 202243min

Data Skeptic: Ad Tech

Data Skeptic: Ad Tech

Increasingly, people get most if not all of the information they consume online. Alongside the web sites, videos, apps, and other destinations, we're consistently served advertisements alongside the organic content we search for or discover. Targetted ads make it possible for you to discover relevant new products you might otherwise not have heard about. Targetting can also open a pandora's box of ethical considerations. Online advertising is a complex network of automated systems. Algorithms controlling algorithms controlling what we see. This season of Data Skeptic will focus on the applications of data science to digital advertising technology. In this first episode in particular, Kyle shares some of his own personal experiences and insights working in pay-per-click marketing. Click for additional show notes

18 Kesä 202242min

Suosittua kategoriassa Tiede

rss-mita-tulisi-tietaa
utelias-mieli
rss-poliisin-mieli
hippokrateen-vastaanotolla
tiedekulma-podcast
docemilia
rss-duodecim-lehti
rss-lihavuudesta-podcast
filocast-filosofian-perusteet
rss-astetta-parempi-elama-podcast
rss-ylistys-elaimille
mielipaivakirja
radio-antro
rss-totta-vai-tuubaa
rss-tiedetta-vai-tarinaa
rss-ilmasto-kriisissa
rss-luontopodi-samuel-glassar-tutkii-luonnon-ihmeita
rss-lapsuuden-rakentajat-podcast