Skip to main content

Paper | Jiménez, Arkko: AI, Robots.txt
slides-aicontrolws-ai-robotstxt-00

Slides IAB Workshop on AI-CONTROL (aicontrolws) Team
Title Paper | Jiménez, Arkko: AI, Robots.txt
Abstract
Large Language Models (LLMs) and their use of Internet-sourced material present numerous technical, commercial, legal, societal, and ethical challenges. An emerging practice proposes extending the …
Large Language Models (LLMs) and their use of Internet-sourced material present numerous technical, commercial, legal, societal, and ethical challenges. An emerging practice proposes extending the robots.txt file to enable website owners to declare if they wish to "opt-out" from having their site’s content used in training AI models.

This paper explores the topic. We argue that the problem is much broader than the simple opt-out mechanism, given the coming new applications, the many different ways to access training material, different AI techniques, and the need to both facilitate access to training material and enable opting out from it.
State Active
Other versions pdf
Last updated 2024-09-09

slides-aicontrolws-ai-robotstxt-00
Not available as plain text. Download as PDF.