Navigation Menu

Skip to content

Instantly share code, notes, and snippets.

@yireo
Created July 18, 2014 10:55
Show Gist options
  • Star 0 You must be signed in to star a gist
  • Fork 1 You must be signed in to fork a gist
  • Save yireo/21e03c2f185e31403609 to your computer and use it in GitHub Desktop.
Save yireo/21e03c2f185e31403609 to your computer and use it in GitHub Desktop.
Example robots.txt file for MageBridge environments
# Example robots.txt for MageBridge.
# Jooomla! is assumed to be in the root /
# Magento is assumed to be installed in /magento/
# MageBridge Root Menu-Item is assumed to have an URL /shop/
User-agent: *
Disallow: /administrator/
Disallow: /bin/
Disallow: /cache/
Disallow: /cli/
Disallow: /components/
Disallow: /includes/
Disallow: /installation/
Disallow: /language/
Disallow: /layouts/
Disallow: /libraries/
Disallow: /logs/
Disallow: /media/
Disallow: /modules/
Disallow: /plugins/
Disallow: /templates/
Disallow: /tmp/
## Do not crawl development files and folders: CVS, svn directories and dump files
Disallow: /CVS
Disallow: /*.svn$
Disallow: /*.idea$
Disallow: /*.sql$
Disallow: /*.tgz$
## GENERAL MAGENTO SETTINGS
## Do not crawl Magento admin page
Disallow: /magento/admin/
## Do not crawl common Magento technical folders
Disallow: /magento/app/
Disallow: /magento/downloader/
Disallow: /magento/errors/
Disallow: /magento/includes/
Disallow: /magento/lib/
Disallow: /magento/pkginfo/
Disallow: /magento/shell/
Disallow: /magento/var/
## Do not crawl common Magento files
Disallow: /magento/api.php
Disallow: /magento/cron.php
Disallow: /magento/cron.sh
Disallow: /magento/error_log
Disallow: /magento/get.php
Disallow: /magento/install.php
Disallow: /magento/LICENSE.html
Disallow: /magento/LICENSE.txt
Disallow: /magento/LICENSE_AFL.txt
Disallow: /magento/README.txt
Disallow: /magento/RELEASE_NOTES.txt
## Do not crawl checkout and user account pages
Disallow: /shop/checkout/
Disallow: /shop/onestepcheckout/
Disallow: /shop/customer/
Disallow: /shop/customer/account/
Disallow: /shop/customer/account/login/
## Do not crawl sub category pages that are sorted or filtered.
Disallow: /*?dir*
Disallow: /*?dir=desc
Disallow: /*?dir=asc
Disallow: /*?limit=all
Disallow: /*?mode*
## Do not crawl links with session IDs
Disallow: /*?SID=
## Do not crawl seach pages and not-SEO optimized catalog links
Disallow: /shop/catalogsearch/
Disallow: /shop/catalog/product_compare/
Disallow: /shop/catalog/category/view/
Disallow: /shop/catalog/product/view/
Crawl-delay: 2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment