Robots.txt File - No Website Should Be Without One

Jerry West

Jerry’s previous experience as a software tester for WordPerfect and Novell gives him the foundation needed to dominate the SEO market. This is because he tests every approach, taking nothing for truth until the data proves it so. He’s been one of the most sought-after SEO professionals in the industry since 1996. Jerry is also a regular contributor to Webmaster World and PubCon, key events in the SEO community.

Coding Tips

Robots.txt File – No Website Should Be Without One

by Jerry West
Updated April 15, 2005

The Robots Exclusion Protocol -- Robots.txt File
by Jerry West

When a search engine spider or robot visits a web site if first checks for the presence of a robots.txt file. If this file is found, the search engine spider or robot will analyze the contents of the file for:

User-agent: *
Disallow: /

The Robots Exclusion Protocol is a method that allows website administrators to indicate which parts of their site should NOT be visited by a search engine robot.

There can only be one robots.txt file per domain. If you have users with sub-domains you must either merge all information to the one robots.txt file or instruct your users to use the Robots Meta Tag.

The robots.txt file is case sensitive and you should use all lowercase letters.

What To Put Into the robots.txt file

The "robots.txt" file usually contains a record looking like this:

User-agent: *
Disallow: /cgi-bin/
Disallow: /temp/
Disallow: /images/

In the above example, three directories are excluded. You need to separate the "Disallow" line for each directory.

A good source is: The Robots Text Pages.

If you wish to check the syntax of your robots.txt file, visit:

The Robot.txt Syntax Checker

Robots.txt File Facts

if it is present, search engines will obey it
without a robots.txt file Google will not index your site as deep
you cannot exclude "bad sites" using a robots.txt file as bad sites ignore the file
exclude your images folder to not allow the search engines (like Yahoo! and Google) to grab your images for their image directory

------
© 2000 - 2005, WebMarketingNow.com
Jerry West is the Director of Internet Marketing for WebMarketingNow. He has been consulting on the web since 1996 and has assisted hundreds of companies gain an upper-hand over their competition. Visit Web Marketing Now for the latest in marketing tips that are tested and proven.

The above article can be reproduced on your site or e-zine as long as the signature file.

Article Search Phrases: robots.txt, robots txt, exclude search engines, disallow search engines

Top 10 Tips

Search Engine Marketing
Looking for expert search engine marketing advice and SEO tips?
Speed Optimization
Speed of your web site — it is THE most important aspect of your site
WordPress
Want a website that’s as easy to update as writing and sending an email?
Taglines & Slogans
This is a list of some of the top slogans major corporations have used over the years.
Search Commands
You were looking for certain information on the web. After several attempts...
Words Strengthen
Words are the raw material from which you construct your presentation.
Google Sandbox – What to Do
Go to any Internet marketing forum and one of the topics is sure to be whether or not there is a “sandbox” at Google
Motivation Factor Index Quiz
What has kept you from succeeding in your life and career? The following quiz is designed
Time Management
Most small business owners and mid-sized company marketing managers have to wear multiple marketing hats all at once.
Retargeting
Retargeting is a form of online advertising keep your brand in front of bounced traffic after they leave your website.

Products

Seo Revolution

Google Best Practices

Link Building

Fresh PubDate

Link Privacy

Kitchen Table Copy

What We Offer

SEO

Web

Business

Life

Hosting

Domains

Marketing Tools

Design

Software

WordPress Plugins

Jerry West

Meet Our Team

Our Partners

Career with WebMarketingNow

Coding Tips

Robots.txt File – No Website Should Be Without One