Hack

Using htaccess to open the door for the Google bot / spider

October 09, 07 by ReniX
advertisment

Thanks for visiting! If you're new here, you may want to subscribe to my Rss Feed. This blog posts regular Internet news, updates for apps, security, ideas, hacks, quick fixes and everything about hi-tech. Go ahead, subscribe to our feed!

Search engine bots, commonly known as “spiders”, are standard tools used by search engines to find and index web pages throughout the internet. In order for a spider to find and index a web page it must first discover it through manual submission on their site or more commonly a link from a site that has already been indexed by the spider. A common problem occurring with large scale dynamic sites today is that the links used throughout are query string heavy. This makes it difficult for a search engine spider to index deep within these large-scale sites because of specific precautions the spiders take when dealing with these types of URLs.

An example of a NON search friendly URL

http://www.mysite.com/page.php?v=value1&p=value2&c=value3

An example of a search friendly URL

http://www.mysite.com/page-value1-value2-value3.html

This white paper will detail the use of the . htaccess file for Apache based web servers to take dynamic, query-heavy URLs and turn them into clean, efficient, search engine friendly URL structures.

What are the benefits of search friendly URLs

Read the rest of this entry »

Close
E-mail It