# robots.txt — futureproofwebsite.com # AI-ready directives: this site welcomes well-behaved AI crawlers and points # them at its machine-readable surface (criteria D2). User-agent: * Allow: / Disallow: /subscribe Disallow: /admin # Content-Signal — how AI systems may use this content (content-signals.org). # We welcome search indexing and AI answer-grounding; we reserve training. Content-Signal: search=yes, ai-input=yes, ai-train=no # Reputable AI crawlers — explicitly welcomed for indexing and answering. User-agent: GPTBot Allow: / User-agent: ClaudeBot Allow: / User-agent: PerplexityBot Allow: / User-agent: Google-Extended Allow: / # Training-only crawlers — disallowed to enforce our Content-Signal ai-train=no. # These feed AI training corpora (and Bytespider is abusive); they don't drive the # live AI answers/citations the crawlers above do, so blocking them costs no visibility. User-agent: CCBot Disallow: / User-agent: Applebot-Extended Disallow: / User-agent: Bytespider Disallow: / # Point crawlers at the structured surfaces. Sitemap: https://futureproofwebsite.com/sitemap.xml # AI guidance file: https://futureproofwebsite.com/llms.txt