Skip to content

Instantly share code, notes, and snippets.

@mingderwang
Created November 19, 2019 04:05
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 0 You must be signed in to fork a gist
  • Save mingderwang/6f8719fd5197f77bdb9d6425f72df3c4 to your computer and use it in GitHub Desktop.
Save mingderwang/6f8719fd5197f77bdb9d6425f72df3c4 to your computer and use it in GitHub Desktop.
from autocad
#
# robots.txt
#
# This file is to prevent the crawling and indexing of certain parts
# of your site by web crawlers and spiders run by sites like Yahoo!
# and Google. By telling these "robots" where not to go on your site,
# you save bandwidth and server resources.
#
# This file will be ignored unless it is at the root of your host:
# Used: http://example.com/robots.txt
# Ignored: http://example.com/site/robots.txt
#
# For more information about the robots.txt standard, see:
# http://www.robotstxt.org/robotstxt.html
User-agent: *
Disallow: /
# Only allow search engine crawler bots from google, bing, baidu, yahoo, yandex, goo, naver , ask .
# Only image, mobile, and regular search engine crawler bots allowed.
# The user-agent values are not case-sensitive and can be ‘grouped’, per developers.google.com.
# Google
# http://www.mcanerin.com/EN/search-engine/robots-txt.asp
User-agent: Googlebot
User-agent: googlebot-image
User-agent: googlebot-mobile
# Microsoft Bing
# http://www.mcanerin.com/EN/search-engine/robots-txt.asp
User-agent: MSNBot
User-agent: Bingbot
# Yahoo
# http://www.mcanerin.com/EN/search-engine/robots-txt.asp
User-agent: Slurp
User-agent: yahoo-mmcrawler
# Ask
# http://www.mcanerin.com/EN/search-engine/robots-txt.asp
User-agent: Teoma
# Baidu (China)
# Info: http://www.baidu.com/search/spider.htm
User-agent: baiduspider
User-agent: baiduspider-image
# Yandex (Russia)
# http://www.mcanerin.com/EN/search-engine/robots-txt.asp
User-agent: Yandex
# Goo (Japan)
# Info (Japanese): http://help.goo.ne.jp/help/article/704/
# Info (English): http://help.goo.ne.jp/help/article/853/
User-agent: moget
User-agent: ichiro
# Naver (Korea)
# Info: http://help.naver.com/customer/etc/webDocument02.nhn
User-agent: naverbot
User-agent: yeti
User-agent: Twitterbot
#Seer Interative Bot
User-agent: SeerBot/1.0
User-agent: seer-interactive-tseo
Crawl-delay: 10
Allow: /
#sitemap index file
sitemap: https://knowledge.autodesk.com/sitemap-index.xml
# Directories
Disallow: /includes/
Disallow: /misc/
Disallow: /modules/
Disallow: /profiles/
Disallow: /scripts/
Disallow: /themes/
Disallow: /sites/
Disallow: /admin/
Disallow: /comment/reply/
Disallow: /filter/tips/
Disallow: /node/add/
Disallow: /search/
# Files
Disallow: /CHANGELOG.txt
Disallow: /cron.php
Disallow: /INSTALL.mysql.txt
Disallow: /INSTALL.pgsql.txt
Disallow: /INSTALL.sqlite.txt
Disallow: /install.php
Disallow: /INSTALL.txt
Disallow: /LICENSE.txt
Disallow: /MAINTAINERS.txt
Disallow: /update.php
Disallow: /UPGRADE.txt
Disallow: /xmlrpc.php
# Paths (clean URLs)
Disallow: /admin
Disallow: /taxonomy
Disallow: /comment/reply
Disallow: /filter/tips
Disallow: /node/add
Disallow: /search?
Disallow: /user
Disallow: /user/register/
Disallow: /user/password/
Disallow: /user/login/
Disallow: /user/logout/
Disallow: /ajax/adsk/products/all/getting-started
Disallow: /all-products
Disallow: /downloads
Disallow: /get-ready
Disallow: /network-license-management
Disallow: /ooooops
# Paths (no clean URLs)
Disallow: /?q=admin
Disallow: /?q=taxonomy
Disallow: /?q=comment/reply
Disallow: /?q=filter/tips
Disallow: /?q=node/add
Disallow: /?q=search&
Disallow: /?q=user
Disallow: /?q=user/password/
Disallow: /?q=user/register/
Disallow: /?q=user/login/
Disallow: /?q=user/logout/
Disallow: /?q=ajax/adsk/products/all/getting-started
Disallow: /?q=all-products
Disallow: /?q=downloads
Disallow: /?q=get-ready
Disallow: /?q=network-license-management
Disallow: /?q=ooooops
Disallow: /?q=admin/
Disallow: /?q=comment/reply/
Disallow: /?q=filter/tips/
Disallow: /?q=node/add/
Disallow: /?q=search/
# Content
Disallow: /*/caas/blog
Disallow: /*/caas/discussion
Disallow: /*/caas/documentation
Disallow: /*/caas/education-curriculum
Disallow: /*/caas/mne-help
Disallow: /gps/
# Developer Documentation Content
Disallow: /*/cloudhelp/2015/ENU/Maya-SDK/
# Community preview pages
Disallow: /community/article/
Disallow: /community/screencast/
# CSS and JS
Allow: /sites/default/files/css
Allow: /sites/default/files/js
# AKNSITE-9638
Disallow: /?sort=
Disallow: /signin?
Disallow: /?aid=
Disallow: /?_ga=
Disallow: /?st=
Disallow: /?v=%
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment