Using htaccess to open the door for the Google bot / spider
October 09, 07 by ReniXThanks for visiting! If you're new here, you may want to subscribe to my Rss Feed. This blog posts regular Internet news, updates for apps, security, ideas, hacks, quick fixes and everything about hi-tech. Go ahead, subscribe to our feed!

Search engine bots, commonly known as “spiders”, are standard tools used by search engines to find and index web pages throughout the internet. In order for a spider to find and index a web page it must first discover it through manual submission on their site or more commonly a link from a site that has already been indexed by the spider. A common problem occurring with large scale dynamic sites today is that the links used throughout are query string heavy. This makes it difficult for a search engine spider to index deep within these large-scale sites because of specific precautions the spiders take when dealing with these types of URLs.
An example of a NON search friendly URL
http://www.mysite.com/page.php?v=value1&p=value2&c=value3
An example of a search friendly URL
http://www.mysite.com/page-value1-value2-value3.html
This white paper will detail the use of the . htaccess file for Apache based web servers to take dynamic, query-heavy URLs and turn them into clean, efficient, search engine friendly URL structures.
Glad to see you here and Thank You for your visit. My name is Lirent, born in February 1983 in a small city in Europe. This is my blog about Hacks, Tips and Free Resourse for the WordWideWeb.





