# Sample robots.txt file (make sure the filename is ALL LOWERCASE on Linux/Unix systems) # This file should go in your web site's ROOT directory # The root directory is where your site's main /index.html file would be found # It is usually found in /yourhomedir/public_html/ or /yourhomedir/httpdocs # Where "yourhomedir" is your user account's name # This says to apply these settings to ALL search engine spiders/crawlers User-agent: * # These settings will keep spiders from indexing your unwanted pages # This assumes that your OSC install is in your web site's ROOT directory # ie: http://www.yoursite.com/index.php <- Use if this brings up your OSC main page Disallow: /admin Disallow: /sugarcrm Disallow: /weberp Disallow: /phpsurveyor Disallow: /osc_sync Disallow: /catalog_osc Disallow: /catalog/includes Disallow: /catalog/account.php Disallow: /catalog/advanced_search.php Disallow: /catalog/checkout_shipping.php Disallow: /catalog/create_account.php Disallow: /catalog/login.php Disallow: /catalog/password_forgotten.php Disallow: /catalog/popup_image.php Disallow: /catalog/shopping_cart.php Disallow: /catalog/contact_us.php Disallow: /catalog/product_reviews_write.php Disallow: /catalog/cookie_usage.php Disallow: /catalog/images Disallow: /dev Disallow: /shop_tmp Disallow: /shopm131 Disallow: /atmailopen Disallow: /atmailopen_1.03 # Feel free to add any other pages on your site that you don't want to be indexed by # the search engines. # PLEASE NOTE: Any pages that you list here should be secured by other means if you # don't want people to be able to view them, as some malicious users (BadBots) will look at a # robots.txt file to try to find "hidden" or "secret" areas of web sites to find # confidential information. # Just Uncomment a line or add new ones as you see fit. # Disallow: /private # Disallow: /hidden # for magento Disallow: /shop/index.php/ Disallow: /shop/*? Disallow: /shop/*.js$ Disallow: /shop/*.css$ Disallow: /shop/checkout/ Disallow: /shop/tag/ Disallow: /shop/catalogsearch/ Disallow: /shop/review/ Disallow: /shop/app/ Disallow: /shop/downloader/ Disallow: /shop/js/ Disallow: /shop/lib/ Disallow: /shop/media/ Disallow: /shop/*.php$ Disallow: /shop/pkginfo/ Disallow: /shop/report/ Disallow: /shop/skin/ Disallow: /shop/var/ Disallow: /shop/catalog/ Disallow: /shop/customer/ Disallow: /shop/*SID= Disallow: /shop/*condition= Disallow: /shop/*manufacturer= Disallow: /shop/*mode= Disallow: /shop/*p= Disallow: /shop/*price= Disallow: /shop/*order= Disallow: /shop/*cat= Sitemap: http://www.mediamixhobby.com.sg/shop/sitemap.xml # IF YOU DO NOT WISH TO HAVE THE GOOGLE IMAGE BOT SCAN YOUR DOMAIN FOR IMAGES # THEN YOU CAN INCLUDE THE FOLLOWING IN YOUR ROBOTS FILE. # I FOUND THAT MY BANDWIDTH USAGE DROPPED BY A MASSIVE AMOUNT AFTER I GOT RID # OF THE GOOGLE IMAGE BOT. ALL I HAD WAS IMAGE HUNTERS STEALING PRODUCT SHOTS # AND NOT EVEN BROWSING THE SITE. User-agent: Googlebot-Image Disallow: / User-agent: ia_archiver Disallow: / User-agent: msnbot Disallow: / #Crawl-delay: 20 User-agent: MJ12bot Disallow: / User-agent: Yandex Disallow: /