AI lab TL;DR |  João Pedro Quintais - Untangling AI Copyright and Data Mining in EU Compliance

AI lab TL;DR | João Pedro Quintais - Untangling AI Copyright and Data Mining in EU Compliance

🔍 In this TL;DR episode, João Quintais (Institute for Information Law) explains the interaction between the AI Act and EU copyright law, focusing on text and data mining (TDM). He unpacks key issues like lawful access, opt-out mechanisms, and transparency obligations for AI developers. João explores challenges such as extraterritoriality and trade secrets, offering insights into how voluntary codes of practice and contractual diligence could help align AI innovation with EU copyright rules.


#AI #ArtificialIntelligence #GenerativeAI


📌 TL;DR Highlights

⏲️[00:00] Intro

⏲️[00:51] Q1-What are the key interactions between the AI Act and EU copyright law, particularly concerning text and data mining (TDM) practices?

⏲️[7:12] Q2-How do you see the transparency obligations of the AI Act shaping the relationship between AI developers and rightsholders?

⏲️[15:22] Q3-Is the extraterritorial reference of Recital 106 of the AI Act enforceable?

⏲️[24:35] Wrap-up & Outro


💭 Q1 - What are the key interactions between the AI Act and EU copyright law, particularly concerning text and data mining (TDM) practices


🗣️ "The AI Act links to Article 4 by obliging general-purpose AI (GPAI) model providers to identify and respect rights reservation mechanisms and disclose a sufficiently detailed summary about the training data used."

🗣️ "The Copyright Directive of 2019 introduces exceptions and limitations for text and data mining (TDM), with Article 3 aimed at research and Article 4 applying broadly but with additional requirements like rights reservation or opt-out mechanisms."

🗣️ "The concept of TDM is so broad that it applies to activities involved in pre-training and training AI models, impacting entities across the value chain, not just model providers."

🗣️ "Entities like Common Crawl or LAION that perform upstream activities like web scraping are not directly regulated by the AI Act but are part of the broader TDM definition under the Copyright Directive."

🗣️ "One debated requirement is the rights reservation or opt-out mechanism for publicly accessible online content."


💭 Q2 - How do you see the transparency obligations of the AI Act shaping the relationship between AI developers and rightsholders?


🗣️ "The transparency provision in Article 53(1)(d) requires GPAI model providers to make publicly available a sufficiently detailed summary of training data, balancing interests like copyright, privacy, and fair competition."

🗣️ "If the summary is too vague, it becomes meaningless; if too detailed, it might infringe on trade secrets—so finding a balance is critical."

🗣️ "The usefulness of the training data summary lies in clarifying whether TDM exception requirements, such as lawful access and respect for opt-outs, have been met."

🗣️ "A significant challenge is ensuring compliance when data sets are obtained from upstream providers not subject to the AI Act, raising questions about responsibility and enforcement."

🗣️ "The Act balances interests, acknowledging the impossibility of listing all copyrighted works used in training due to territorial fragmentation and low originality thresholds."


💭 Q3 - Is the extraterritorial reference of Recital 106 of the AI Act enforceable?


🗣️ "Recital 106 aims to prevent regulatory arbitrage by requiring compliance with EU copyright standards, even for AI models trained outside the EU."

🗣️ "The principle of territoriality in copyright law conflicts with the extraterritorial implications of the AI Act, as copyright rules are typically governed by the location of the activity."

🗣️ "Using contractual obligations and voluntary meta-regulation, such as commitments from upstream providers, offers a more consistent way to enforce compliance than extending the law extraterritorially."

🗣️ "The Act's compliance incentives might still push GPAI providers to align with EU standards to avoid severe sanctions, even if extraterritorial enforcement remains uncertain."

🗣️ "Some suggest contractual obligations or meta-regulation as more practical solutions to ensure upstream compliance with EU law."


📌 About Our Guest

🎙️ João Pedro Quintais | Assistant Professor, Institute for Information Law (IViR)

🌐 Article | Generative AI, Copyright, and the AI Act

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4912701

🌐 Joao Pedro Quintais

https://www.ivir.nl/employee/quintais/


Dr João Pedro Quintais is Assistant Professor at the University of Amsterdam’s Law School, in the Institute for Information Law (IViR). João notably studies how intellectual property law applies to new technologies and the implications of copyright law and its enforcement by algorithms on the rights and freedoms of Internet users, on the remuneration of creators, and on technological development. João is also Co-Managing Editor of the widely read Kluwer Copyright Blog and has published extensively in the area of information law.

Episoder(37)

AI lab TL;DR | Joan Barata - Transparency Obligations for All AI Systems

AI lab TL;DR | Joan Barata - Transparency Obligations for All AI Systems

🔍 In this TL;DR episode, Joan explains how Article 50 of the EU AI Act sets out high-level transparency obligations for AI developers and deployers—requiring users to be informed when they interact w...

10 Des 202517min

AI lab TL;DR | Aline Larroyed - The Fallacy Of The File

AI lab TL;DR | Aline Larroyed - The Fallacy Of The File

🔍 In this episode, Caroline and Alene unravel why the popular idea of “AI memorisation” leads policymakers down the wrong path—and how this metaphor obscures what actually happens inside large langua...

27 Nov 20257min

AI lab TL;DR | Anna Mills and Nate Angell - The Mirage of Machine Intelligence

AI lab TL;DR | Anna Mills and Nate Angell - The Mirage of Machine Intelligence

🔍 In this TL;DR episode, Anna and Nate unpack why calling AI outputs “hallucinations” misses the mark—and introduce “AI Mirage” as a sharper, more accurate metaphor. From scoring alternative terms to...

26 Mai 202520min

AI lab TL;DR | Emmie Hine - Can Europe Lead the Open-Source AI Race?

AI lab TL;DR | Emmie Hine - Can Europe Lead the Open-Source AI Race?

🔍 In this TL;DR episode, Emmie Hine (Yale Digital Ethics Center) makes the case for Europe’s leadership in open-source AI—thanks to strong infrastructure, multilingual data, and regulatory clarity. W...

12 Mai 202511min

AI lab TL;DR | Milton Mueller - Why Regulating AI Misses the Point

AI lab TL;DR | Milton Mueller - Why Regulating AI Misses the Point

🔍 In this TL;DR episode, Milton Mueller (the Georgia Institute of Technology School of Public Policy) argues that what we call “AI” is really just part of a broader digital ecosystem. Instead of vagu...

21 Apr 202518min

AI lab TL;DR | Kevin Frazier - How Smarter Copyright Law Can Unlock Fairer AI

AI lab TL;DR | Kevin Frazier - How Smarter Copyright Law Can Unlock Fairer AI

🔍 In this TL;DR episode, Kevin Frazier (University of Texas at Austin school of Law) outlines a proposal to realign U.S. copyright law with its original goal of spreading knowledge. The discussion in...

7 Apr 202516min

AI lab TL;DR | Paul Keller - A Vocabulary for Opting Out of AI Training and TDM

AI lab TL;DR | Paul Keller - A Vocabulary for Opting Out of AI Training and TDM

🔍 In this TL;DR episode, Paul Keller (The Open Future Foundation) outlines a proposal for a common opt-out vocabulary to improve how EU copyright rules apply to AI training. The discussion introduces...

24 Mar 202515min

Populært innen Vitenskap

fastlegen
rekommandert
tingenes-tilstand
sinnsyn
forskningno
rss-rekommandert
liberal-halvtime
smart-forklart
jss
tomprat-med-gunnar-tjomlid
villmarksliv
fjellsportpodden
rss-paradigmepodden
dekodet-2
pod-britannia
psykopoden
rss-overskuddsliv
tidlose-historier
aldring-og-helse-podden
nevropodden