Skip to content

Instantly share code, notes, and snippets.

@henshaw
Last active April 16, 2024 23:33
Show Gist options
  • Star 3 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save henshaw/aa8b68ad8b7f897c709bd0ef4fd03b48 to your computer and use it in GitHub Desktop.
Save henshaw/aa8b68ad8b7f897c709bd0ef4fd03b48 to your computer and use it in GitHub Desktop.
Disallow all pages from being trained for LLMs by all major GenAI bots
User-agent: AdsBot-Google
Disallow: /
User-agent: Amazonbot
Disallow: /
User-agent: Anthropic-ai
Disallow: /
User-agent: AwarioRssBot
Disallow: /
User-agent: AwarioSmartBot
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: ChatGPT-User
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Claude-Web
Disallow: /
User-agent: Cohere-ai
Disallow: /
User-agent: DataForSeoBot
Disallow: /
User-agent: FacebookBot
Disallow: /
User-agent: Google-Extended
Disallow: /
User-agent: GPTBot
Disallow: /
User-agent: ImagesiftBot
Disallow: /
User-agent: Magpie-crawler
Disallow: /
User-agent: Omgili
Disallow: /
User-agent: Omgilibot
Disallow: /
User-agent: Peer39_crawler
Disallow: /
User-agent: Peer39_crawler/1.0
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: YouBot
Disallow: /
@henshaw
Copy link
Author

henshaw commented Sep 29, 2023

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment