Tragedy of the (data) commons

Tragedy of the (data) commons

The Data Provenance Initiative is a collective of volunteer AI researchers from around the world. They conduct large-scale audits of the massive datasets that power state-of-the-art AI models with a goal of mapping the landscape of AI training data to improve transparency, documentation, and informed use of data. Their Explorer tool allows users to filter and analyze the training datasets typically used by large language models.

Shayne and Robert are the authors of a new study called Consent in Crisis: The Rapid Decline of the AI Data Commons: the first large-scale, longitudinal audit of the consent protocols for the web domains underlying AI training sets.

Connect with Shayne via his website.

Connect with Robert via his website or on LinkedIn.

Stack Overflow user George Hawkins earned a Populist badge by explaining How to get base url in angular 5?.

See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

Jaksot(922)

Keeping the lights on for open source

Keeping the lights on for open source

Ryan sits down with Chainguard CEO Dan Lorenc to chat about how his team is keeping the foundation of the internet—open source projects—alive by forking archived but widely-used repos to provide secur...

17 Maalis 29min

Open source for awkward robots

Open source for awkward robots

Ryan is joined by Jan Liphardt,  CEO and co-founder of OpenMind, to chat about the rapidly evolving world of humanoid robotics and what it means for humans, why OpenMind is building an open source ope...

13 Maalis 30min

Even the chip makers are making LLMs

Even the chip makers are making LLMs

Ryan welcomes Kari Briski, NVIDIA’s VP of Generative AI Software for Enterprise, to the show to explore how a chip manufacturer got into the model development game. They discuss NVIDIA’s co-design fee...

10 Maalis 26min

Building brains for bulldozers

Building brains for bulldozers

Ryan chats with Kevin Peterson, CTO of Bedrock Robotics, about the evolution of self-driving technology and why robotics is now advancing; how real data is still relevant but simulation becomes essent...

6 Maalis 24min

AI-assisted coding needs more than vibes; it needs containers and sandboxes

AI-assisted coding needs more than vibes; it needs containers and sandboxes

SPONSORED BY DOCKERIn this sponsored episode, Ryan chats with Mark Cavage, President and COO of Docker, joins the show to dive into hardened containers and agent sandboxes. They discuss what it means ...

4 Maalis 27min

No need for Ctrl+C when you have MCP

No need for Ctrl+C when you have MCP

Ryan sits down with Member of the Technical Staff at Anthropic and Model Context Protocol co-creator David Soria Parra to talk the evolution of MCP from local-only to remote connectivity, how security...

2 Maalis 31min

To live in an AI world, knowing is half the battle

To live in an AI world, knowing is half the battle

Ryan welcomes Marcus Fontoura, technical fellow at Microsoft and author of Human Agency in the Digital World, to discuss the intersection of technology, society, and human dignity in a digital-first w...

27 Helmi 28min

Dogfood so nutritious it’s building the future of SDLCs

Dogfood so nutritious it’s building the future of SDLCs

Ryan welcomes Thibault Sottiaux, OpenAI’s engineering lead on Codex, to discuss how the Codex team dogfoods Codex to build Codex, what distinguishes an agentic coding tool from a chat-based code assis...

24 Helmi 32min

Suosittua kategoriassa Liike-elämä ja talous

sijotuskasti
mimmit-sijoittaa
rss-rahapodi
psykopodiaa-podcast
rss-rahamania
rss-seuraava-potilas
herrasmieshakkerit
ostan-asuntoja-podcast
rss-20-30-40-podcast
taloudellinen-mielenrauha
pomojen-suusta
rss-sisalto-kuntoon
rahapuhetta
rss-lahtijat
rss-myynnilla-on-asiaa-kert-kenner
rss-draivi
juristipodi
rss-startup-ministerio
rss-bisnesta-bebeja
rss-karon-grilli