Search
Recommended Products
Related Links

 

 

Informative Articles

Did you forget your own name?
(Note to editor: Thanks for taking the time to review this!) Begin Article ======================================= This article may be freely distributed provided the author bio is included in its entirety. ======================================= ...

Forget Google PR, It’s All About “Real-Time” Data!
Forget Google PR, It’s All About “Real-Time” Data! The “Holy Grail” in Digital Marketing is “Real-Time” Data. Isn’t that really what the transition from HTML 4.01 to XHTML to XML, and CSS and the lot is all about? I don’t really understand the...

How to Build a Database Driven Web Site
If you ever want to create a state directory, article directory, dating site or link directory, you need to know about creating database driven web sites. This may seem like a daunting task at first, if you have never done it, however I was in the...

'Stop Me Before I Domain Again'
They appear to be average American women. They meet in parenting chat rooms and discussion lists to exchange photos of their kids, cheer on each other's pregnancies and ccasionally complain about their husbands' snoring. They swap recipes and advice...

Where to Get Cheap Web Hosting
If you are looking for a cheap web hosting company you may wish to know where to look. Cheap web hosting is available to anyone. But the cost of it depends on several factors. For some people, it may be impossible to get cheap web hosting. You will...

 
The Proper Way To Use The robots.txt File Update

In my last article about the robots.txt file I had spelled it wrong. It should have been robots.txt instead of robot.txt. The article should read like this:

When optimizing your web site most webmasters don’t consider using the robots.txt file.This is a very important file for your site. It let the spiders and crawlers know what they can and can not index. This is helpful in keeping them out of folders that you do not want index like the admin or stats folder.

Here is a list of variables that you can include in a robot.txt file and there meaning:

1) User-agent: In this field you can specify a specific robot to describe access policy

for or a “*” for all robots more explained in example.

2) Disallow: In the field you specify the files and folders not to include in the crawl.

3) The # is to represent comments

Here are some examples of a


Matt Wilson: Trios, Quartets And 'Don Knotts'
Like a comedian, the drummer knows something about offhand dexterity and split-second timing.

Leala Cyr And Ricardo Vogt: Live At Berklee
She's from a small Midwestern town. He's from Southern Brazil. But they've been collaborating for years — she with voice and trumpet, he on guitar — and they demonstrate their blend of musical traditions in a live broadcast Feb. 9 at 8 p.m. ET.


robots.txt file
User-agent: *
Disallow:

The above would let all spiders index all content.

Here another example

User-agent: *
Disallow: /cgi-bin/

The above would block all spiders from indexing the cgi-bin directory.

User-agent: googlebot
Disallow:

User-agent: *
Disallow: /admin.php
Disallow: /cgi-bin/
Disallow: /admin/
Disallow: /stats/

In the above example googlebot can index everything while all other spiders can not index admin.php, cgi-bin, admin, and stats directory. Notice that you can block single files like admin.php.

About The Author

Jimmy Whisenhunt is the webmaster at VIP Enterprises http://www.vipenterprises.org.
vipenter@vipenterprises.org