Skip to content

Instantly share code, notes, and snippets.

@nunosans
Last active June 23, 2024 19:23
Show Gist options
  • Save nunosans/b6f64428fcdc9c0304f5e164c207c911 to your computer and use it in GitHub Desktop.
Save nunosans/b6f64428fcdc9c0304f5e164c207c911 to your computer and use it in GitHub Desktop.
Robots TXT file to disallow AI crawlers
# Disallow the Amazon AI bot.
User-agent: Amazonbot
Disallow: /
# Disallow the Anthropic AI bot.
User-agent: anthropic-ai
Disallow: /
# Disallow the Apple Intelligence bot.
User-Agent: Applebot
Disallow: /
# Disallow the ByteDance bot.
User-agent: Bytespider
Disallow: /
# Disallow the Common Crawl bot.
User-agent: CCBot
Disallow: /
# Disallow the bot for OpenAI LLMs (ChatGPT).
User-agent: ChatGPT
Disallow: /
# Disallow the Anthropic AI bot.
User-agent: Claude-Web
Disallow: /
# Disallow the Anthropic AI bot.
User-agent: ClaudeBot
Disallow: /
# Disallow You bot.
User-agent: Diffbot
Disallow: /
# Disallow the bot for Meta LLMs.
User-Agent: FacebookBot
Disallow: /
# Disallow the bot for Google LLMs (Bard, Gemini).
User-agent: Google-Extended
Disallow: /
# Disallow the OpenAI bot.
User-agent: GPTBot
Disallow: /
# Disallow Hive's bot for image generation (Image2dataset).
User-agent: Image2dataset
Disallow: /
# Disallow Hive's bot for image generation (ImageSift).
User-agent: ImagesiftBot
Disallow: /
# Disallow the bot for webz.io index.
User-agent: Omgili
Disallow: /
# Disallow the bot for webz.io index.
User-agent: Omgilibot
Disallow: /
# Disallow the Perplexity bot.
User-agent: PerplexityBot
Disallow: /
# Disallow the you.com bot.
User-agent: YouBot
Disallow: /
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment