# Modified 5/6/2010 | PAB # robots.txt # # This file is to prevent the crawling and indexing of certain parts # of your site by web crawlers and spiders run by sites like Yahoo! # and Google. By telling these "robots" where not to go on your site, # you save bandwidth and server resources. # # This file will be ignored unless it is at the root of your host: # Used: http://example.com/robots.txt # Ignored: http://example.com/site/robots.txt # # For more information about the robots.txt standard, see: # http://www.robotstxt.org/wc/robots.html # # For syntax checking, see: # http://www.sxw.org.uk/computing/robots/check.html User-agent: * # Directories Disallow: /includes/ Disallow: /misc/ Disallow: /modules/ Disallow: /profiles/ Disallow: /scripts/ Disallow: /sites/ Disallow: /themes/ Disallow: /admin/ Disallow: /comment/reply/ Disallow: /contact/ Disallow: /logout/ Disallow: /node/add/ Disallow: /search/ Disallow: /*.js$ Disallow: /*.gif$ Disallow: /*.jpg$ Disallow: /*.png$ Disallow: /*.swf$ Disallow: /*.doc$ Disallow: /user/* Disallow: /?* # Internet Archiver Wayback Machine User-agent: ia_archiver Disallow: / # Yahoo Piples # http://pipes.yahoo.com/pipes/docs?doc=troubleshooting User-agent: Yahoo Pipes 1.0 Disallow: / User-agent: Yahoo Pipes 2.0 Disallow: / # http://help.yandex.ru/webmaster/?id=996567 User-agent: Yandex Disallow: / # http://www.baidu.com/search/spider.htm User-agent: Baiduspider Disallow: / # http://www.guruji.com/en/WebmasterFAQ.html User-agent: GurujiBot Disallow: / # http://www.gnu.org/software/wget/manual/html_node/Robot-Exclusion.html User-agent: wget Disallow: / # http://help.naver.com/customer_webtxt_02.jsp User-agent: naverbot Disallow: / # http://help.naver.com/customer_webtxt_02.jsp User-agent: yeti Disallow: / # http://www.google.com/support/webmasters/bin/answer.py?answer=40360 # http://help.yahoo.com/l/us/yahoo/search/webcrawler/slurp-02.html # http://www.bing.com/community/blogs/webmaster/archive/2008/06/03/robots-exclusion-protocol-joining-together-to-provide-better-documentation.aspx # http://googlewebmastercentral.blogspot.com/2008/06/improving-on-robots-exclusion-protocol.html