Search Engine Optimization & Social Media Optimization Updates: November 2009

Robot.txt and It's Importance

6:48 PM

Robot.txt and It's Importance

<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>

Today's Inspirational Quote:

"It is more important to know where you are going than to get there quickly. Do not mistake activity for achievement."

-- Mabel Newcomber

<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>
Greetings!
How is it going?

It is great, isn’t it guys, when search engines frequently visit your website and index your pages – you will have a great deal to be glad about (apparently for reasons you already know!). However, sometimes there are cases, rarely though, when you would want a search engine NOT to index few pages of your website – either for technical reasons or personal ones.

Talking technically, say, if you have two versions of a page (one for viewing in the browser and other for printing), you'd rather have the printing version excluded from crawling, otherwise you risk being imposed a duplicate content penalty. Also, if you happen to have sensitive data on your site that you do not want the world to see, you will prefer that search engines do not index it. Additionally, you may also want to save some bandwidth by excluding images, stylesheets and javascript from indexing – and for this you need a way to tell spiders to keep away from ‘these’ items.

Its’ here that the ‘Robots.txt’ file comes to your rescue!

Robots exclusion standard – ‘Robots.txt’

Many search engines use programs called robots to locate web pages for indexing. These programs are not limited to a pre-defined list of webpages instead they follow links on pages they find, which makes them a form of intelligent agent. The process of following links is called spidering, wandering, or gathering. Once they have a page or document, the parsing and indexing of the page begins.

If a site owner wishes to give instructions to web robots about which pages to index and which pages NOT to be indexed, he must place a text file called robots.txt to the root of the web site hierarchy (e.g. www.example.com/robots.txt). Robots that wish to follow the instructions try FIRST to look for & fetch this file to know if the web owner wanna restrict it indexing few pages. If this file doesn't exist web robots assume that the web owner wishes indexing of all its pages.

‘Block or remove pages from being indexed by using a robots.txt file’

IMPORTANT: All respectable robots will respect the directives in a robots.txt file, although some may interpret them differently. A robots.txt by no chance is enforceable, and some spammers and other troublemakers may choose to ignore it. Password protecting of confidential information is recommended here.

Definition wise Robots.txt or The Robot Exclusion Standard, also known as the Robots Exclusion Protocol is a convention to prevent web spiders and other web robots from accessing all or a few pages of a website, which are otherwise publicly viewable.

Creation of a Robot.txt file

For creating a ‘Robots.txt’ file check this:

http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449

Generate ‘Robot.txt’ files using the below given links:

http://www.mcanerin.com/EN/search-engine/robots-txt.asp

http://www.howrank.com/Robots.txt-Tool.php

Rule 1: Make sure it's named exactly ‘Robots.txt

Rule 2: This file must be uploaded to the root accessible directory of your site, not a subdirectory (ie: http://www.mysite.com but NOT http://www.mysite.com/stuff/).

It is only by following the above two rules will search engines interpret the instructions contained in the file. Deviate from this, and "robots.txt" becomes nothing more than a regular text file.

Note-worthy Notes:

Robots.txt is a text (not html) file you put on your site.
Robots.txt is by no means are mandatory unless you want to hide public view of few pages
Search Engines generally obey what they are asked not to do but you are NOT to trust them blindly.
It is important to clarify that robots.txt is not a way from preventing search engines from crawling your site (i.e. it is not a firewall, or a kind of password protection) and the fact that putting a robots.txt file is something like putting a note “Please, do not enter” on an unlocked door.
If you have real sensitive data, it is NOT RECOMMENDED to rely on robots.txt.
In the original REP directory paths start at the root for that web server host, generally with a leading slash (/). This path is treated as a right-truncated substring match, an implied right wildcard.
You need a robots.txt file only if your site includes content that you don't want search engines to index. If you want search engines to index everything in your site, you don't need a robots.txt file (not even an empty one).
For websites with multiple subdomains, each subdomain must have its own robots.txt file. If example.com had a robots.txt file but a.example.com did not, the rules that would apply for example.com would not apply to a.example.com.

Note: The concept and structure of robots.txt has been developed more than a decade ago and if you are interested to learn more about it, visit http://www.robotstxt.org/ or you can go straight to the Standard for Robot Exclusion because in this article we will deal only with the most important aspects of a robots.txt file. For more information you may also read: http://www.searchtools.com/robots/robots-exclusion-protocol.html

Thanks!

V-Empower Inc: Robots Topic Today 26-Nov

Connect with V-Empower Inc on Social Networking websites:

SEO – Session Part - XI

2:03 PM

SEO – Session Part -11

<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>

Today's Inspirational Quote:

"The one who asks questions doesn't lose his way."

-- African Proverb

<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>~<>

Moving forward, today we’ll be learning Ways to count Index links:

…Oh wait a second!

D’you know what ‘Index links’ are?

Index Links: It is the count of links or pages of your website that are being indexed (or noted down or recognized or picked up) by the search engines.

‘More the number of index links – more the possibility of you listing in search engines.’

Note-Worthy Note: Make sure your website is made in a search-engine friendly format so that they have easy access to all links in your website. You must have read in one of our previous sessions, ‘SiteMaps’, which emphasizes its skill in helping Search Engines index all your pages – It is indeed very very important. Go back and check if you’ve missed it… Now!

Well, here we stop the history and proceed with the future. Let's learn ways to count the links of your website indexed by 3 of the world’s major Search Engines: Google, Yahoo & Bing.

Way 1:

For Google:

Step1: Go to http://www.google.com/

Step2: Type this: ‘site:v-empower.com’ and click on ‘Search’ option.

Format: site:Domain.com

NOTE: Do NOT give any space before or after colon.

For Yahoo:

Step1: Go to http://siteexplorer.search.yahoo.com/

Step2: Type only the domain name of your website.
Example: if your webites’ URL is http://www.v-empower.com/

Type: v-empower.com
Step3: The ‘Pages’ count is your index link count.

For Bing:

Step1: Go to http://www.bing.com/

Step2: Type this: ‘site:v-empower.com’ and click on ‘Search’ option.

Format: site:website_domain_name

NOTE: Do NOT give any space before or after colon.

Way 2:

Using the ‘SEO toolbar’ – follow the below steps:

Step1: Right-click on the ‘Quirk Icon’ (shown as circled ‘q’ in your bar)

Step2: Now select ‘Show Indexed Pages’option.

Step3: Choose a Search Engine.

Step4: A new page will open up in your browser listing all the indexed links of your website by that particular Search Engine.

NOTE: You may click option ‘All’ if you want to know index link count for all the available Search Engines – Each Search Engine result is displayed in a separate window.

Image below will help you understand my point better:

Well, that’s all for today, Optimizers! See you tomorrow…

Keep counting till then!

V-Empower Inc.: SEO Training Session Part XI - 26-Nov

Important Note: Guys don't worry if you missed the sessions, by clicking the image below you can see all the training sessions from the Beginning.

That’s all for today folks!

V-Empower Inc. – SEO Training Session Part XI
Connect with V-Empower Inc. on Social Networking websites:

SEO – Session Part X

4:08 PM

SEO – Session Part X

SEO Tool Bar - Backlinks for Google, Yahoo and Bing

~~**~~

“Pause to Ponder: Those who can't laugh at themselves leave the job to others.”

~~**~~

… Continuing from yesterday, I will now show & explain you as how the SEO toolbar looks when installed in your browser and what information it gives and where.

SEO TOOLBAR:

Ways to count backlinks:

Way 1:

For Google:

Step1: Go to http://www.google.com/

Step2: Type this: ‘link:v-empower.com’ and click on ‘Search’ option.

Format: link:www.domainname.com

NOTE: Do NOT give any space after or before colon.

For Yahoo:

Step1: Go to http://siteexplorer.search.yahoo.com/

Step2: Type only the domain name of your website.
Example: if your webites’ URL is http://www.v-empower.com/

Type: v-empower.com
Step3: The ‘Inlinks’ count is your backward links count.

For MSN:

Step1: Go to http://www.bing.com/

Step2: Type Link: http://www.domain.com/

Example: if your webites’ URL is http://www.v-empower.com/

Type: Link: http://www.v-empower.com/

NOTE: Give a space after colon only in Bing.
---------------------------------------------------------
Hint of humor a day – keeps the tensions away!

Sardar jee’s Interview:
Interviewer: How does an electric motor run?

Santa: Dhhuuuurrrrrrrrrr. ....

Inteviewer shouts (amazed & angry): Stop it !!

Santa: Dhhuurrrr dhup dhup dhup...
---------------------------------------------------------

Way 2:

After installing the ‘quirk toolbar’, place it anywhere on the page according to your convenience. Then follow the below steps:

Step1: Right-click on the ‘Quirk Iicon’ (shown as circled ‘q’ in your bar – check above extreme left)

Step2: Now select ‘Show Backward Links’ (the last option)

Step3: Click on ‘Domain’ option from the displayed ones.

Step4: Choose a Search Engine.

Step5: A new page will open up in your browser showing your websites’ backward links.

NOTE: You may click option ‘All’ if you want to know back link count for all the available Search Engines – Each Search Engine result is displayed in a separate window.

Below image will help you understand my point better:

Well guys…. That’s all for the today.

We will learn counting ‘Index links’ tomorrow…

V-Empower Inc: SEO Training Session Part X

Important Note: Guys don't worry if you missed the sessions, by clicking the image below you can see all the training sessions from the Beginning.

That’s all for today folks!

V-Empower Inc. – SEO Training Session Part X
Connect with V-Empower Inc. on Social Networking websites:

Robot.txt and It's Importance

SEO – Session Part - XI

SEO – Session Part -11

SEO – Session Part X

SEO – Session Part X

Menu-Blog

Share This to Social Media

Follow us on Twitter

Ads

Blog Archive

Labels

V-Empower on Facebook

Adsense