Skip to content

Instantly share code, notes, and snippets.

@vnykmshr
Created October 18, 2013 22:31
Show Gist options
  • Save vnykmshr/7049206 to your computer and use it in GitHub Desktop.
Save vnykmshr/7049206 to your computer and use it in GitHub Desktop.
Who exactly is crawling your site? Find out more - http://danbirken.com/seo/2013/10/17/who-exactly-is-crawling-my-site.html. Adding author's robots.txt here for reference.
User-Agent: Googlebot
Allow: /
User-Agent: Googlebot-Mobile
Allow: /
User-Agent: msnbot
Allow: /
User-Agent: bingbot
Allow: /
# Adsense
User-Agent: Mediapartners-Google
Allow: /
# Blekko
User-Agent: ScoutJet
Allow: /
User-Agent: Yandex
Allow: /
User-agent: baiduspider
Allow: /
User-agent: DuckDuckBot
Allow: /
# CommonCrawl
User-agent: ccbot
Allow: /
User-Agent: *
Disallow: /
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment