google webcrawler | Malav Shukla

Googlebot, Google’s web Crawler

November 16, 2009 at 4:47 am | Posted in SEO, SEO-Basic, Uncategorized | Leave a comment
Tags: googel spider, google, google indexer, google webcrawler, googlebot, search engine, search engine optimization, SEO, webcrawler

Googlebot is Google’s web crawling robot, which finds and retrieves pages on the web and hands them off to the Google indexer. It’s easy to imagine Googlebot as a little spider scurrying across the strands of cyberspace, but in reality Googlebot doesn’t traverse the web at all. It functions much like your web browser, by sending a request to a web server for a web page, downloading the entire page, and then handing it off to Google’s indexer.

Googlebot consists of many computers requesting and fetching pages much more quickly than you can with your web browser. In fact, Googlebot can request thousands of different pages simultaneously. To avoid overwhelming web servers, or crowding out requests from human users, Googlebot deliberately makes requests of each individual web server more slowly than it’s capable of doing.

Googlebot finds pages in two ways:

Through an add URL form, http://www.google.com/addurl.html, and through finding links by crawling the web.
Allows rapid access to documents that contain user query terms.

To improve search performance, Google ignores (doesn’t index) common words called stop words (such as the, is, on, or, of, how, why, as well as certain single digits and single letters). Stop words are so common that they do little to narrow a search, and therefore they can safely be discarded. The indexer also ignores some punctuation and multiple spaces, as well as converting all letters to lowercase, to improve Google’s performance.

Create a free website or blog at WordPress.com.
Entries and comments feeds.

Malav Shukla

Googlebot, Google’s web Crawler

Archives

Category

Email Subscription

Malav Shukla | SEO Specialist

Delicious

Follow Me on Twitter

Get Posts in Feeds!!!

Tags