11th giugno 2009 by WebAir

How to Avoid Duplicate Pages?
Una delle Penalità che Google può dare ad un sito è basata sulla eventuale presenza di Pagine Duplicate.
Cosa fare nel caso in cui un sito abbia pagine duplicate ovvero pagine che differiscono solo per l’url ma che presentano lo stesso contenuto?
Ecco una lista di consigli su come evitare tale problema e quindi migliorare l’indicizzazione delle pagine di un sito su Google.
One of the Penalties that Google can give to a site is based on the eventual presence of Duplicate Pages.
What to do if a website has duplicate pages or pages that differ only in the url but with the same content?
If you search your websites or other websites on google you can find different pageg with the same content. The difference it’s just the url, for example:
yoursite.com www.yoursite.com yoursite.com/index.htm
or wiki pages…
wikiname.com/product1wikiname.com/article/product1wikiname.com/category/product1
We must avoid this situation because Google takes his time to check this duplicate pages and not for new pages or new contents on our website.
Here is a list of Tips on how to avoid this problem and thus improve the indexing of pages of a site to Google.
(click on photos for large version)
Use Google WebMaster Tools
If you do not already know them, are tools provided by Google to help webmasters in the process of indexing their websites.
URL > http://www.google.com/webmasters/tools/
After the login you can select your website from the list and manage it.
In the Side Menu there’s the Diagnostic submenu, this provides you suggestions and warning about you website. For example, Duplicate Meta Descriptions and Duplicate Title Tags. The Diagnostic Tool provides you the url with the duplicate tags so you can check it and solve the problem.
Delete the Duplicate Pages from the XML Sitemap
Check your Sitemap and if there’s the URL of a Duplicate Page, delete it and be sure that only the original versione of the page remains on the XML Sitemap!
Do not do this
<url> <loc>http://www.yourname.it/</loc> <priority>1.00</priority> <lastmod>2009-12-27T14:22:55+00:00</lastmod> <changefreq>daily</changefreq> </url> <url> <loc>http://www.yourname.it/index.html</loc> <priority>0.80</priority> <lastmod>2009-03-11T14:22:55+00:00</lastmod> <changefreq>daily</changefreq> </url>
...
and do this
<url> <loc>http://www.yourname.it/</loc> <priority>1.00</priority> <lastmod>2009-12-27T14:22:55+00:00</lastmod> <changefreq>daily</changefreq> </url> <url> <loc>http://www.yourname.it/category1.html</loc> <priority>0.80</priority> <lastmod>2009-03-11T14:22:55+00:00</lastmod> <changefreq>daily</changefreq> </url>
...
The Error of the Index Page
In a webpage we have the Main menu with the link for the home page, the Logo and the Heading element H1. These have the link for the home page; now remember that you can see the home page in different ways:
yoursite.com
www.yoursite.com
yoursite.com/index.htm
The Tip it’s don’t do this:
<a href="index.html" title"your title" name="your-title">
but do this
<a href="/" title"your title" name="your-title">
…and also in this case, remove index.html (index.php, or index.asp, etc…) from your XML Sitemap and insert only www.yoursite.com
HTTP 301 Status Code
If you have one or more pages with the same contents you can choose to redirect them to the original page using the 301 Status code, a simple and useful redirect.
The message that receives Googlebot it’s “Moved Permanently“, so it will visit only the original page.
You can use it in different ways:
Redirection with META Refresh (the easiest way!)
<META HTTP-EQUIV="REFRESH" CONTENT="0; URL=http://www.website-name.com/original-page.html">
Redirection with Javascript
<html> <head> <script type="text/javascript"> window.location.href='http://www.website-name.com/'; </script> </head> <body> This page has moved to <a href="http://www.website-name.com/">http://www.website-name.com/</a> </body> </html>
HTTP 301 Redirect in PHP
<?php
// Permanent redirection header("HTTP/1.1 301 Moved Permanently"); header("Location: http://www.website-name.com/"); exit(); ?>
For the complete list of the Permanent Redirect Methods (perl, cold fusion, asp, etc…) >Permanent Redirect with HTTP 301
…but the 301 Redirect it’s not the only possible redirect!
The Canonical Page
If you have duplicate pages you can choose to add in the head section this code:
<link rel="canonical" href="http://www.website-name.com/original-page.htm"/>
You can use it for relative or absolut links. The canonical link it’s a suggestion for Googlebot, not a directive.
Remember: use it only if you can’t delete the duplicate pages or the content of these. The content of the pages must be identical!
Duplicate Pages and Robots.txt
An other simple tip to solve the problem of the duplicate pages! In robots.txt you can choose the directories that Googlebot does not follow.
How do this? It’s simple!
User-Agent: * Disallow: /directory/subdirectory/ Disallow: /directory/file.html Allow: /
In this way Googlebot doe’s NOT follow /directory/subdirectory/ and /directory/file.html but follows the others. With Google Webmaster Tools you can automatically generate your robots.txt in a few clicks.
For more informations about robots.txt visit the official website: http://www.robotstxt.org/
Comments/Suggestions are welcome!
Follow us on Twitter for Extra-News and Resources!












This article really helped me. Thank you! This is an awesome resource for identifying and remedying problems and pitfalls we all have with our web properties, from time to time. Even if someone were to have a doctorate in Internet marketing, if one were available, it would be impossible to keep up with all of the changes and the technological advances…not to mention the constant algorithmic changes Google dreams up and implements. Well done!
Professor John P. J. Zajaros, Sr.
The Internet Marketing Quest Revealed
The Ultimate Internet Image
SEO Tips – How to Avoid Duplicate Pages?…
One of the Penalties that Google can give to a site is based on the eventual presence of Duplicate Pages.
What to do if a website has duplicate pages or pages that differ only in the url but with the same content?
…
Bookmarked your article, thanks! regards, pp
Superb Blog here.. I been using Akisment for my wordpress blog and wondering if it does a good work of protecting spam as many pass through? A reply would be helpful mate.
I found your blog on google and read a few of your other posts. I just added you to my Google News Reader. Keep up the good work. Look forward to reading more from you in the future.
Optimization is a key aspect of getting your site noticed. Thanks for the info.
if there are duplicated contents… Before Google will implement penalty, google will look first the site age. In that way,search engine can tell who posted first…
Search engines constantly works towards refining their technology to crawl the web more severely and return progressively more relevant results to the users. The higher a Website ranks in the Search Engines more users will visit the website it means you will get high traffic. In other words you can say that Search Engine ranking.
great tips brother, thanks for share. its very useful.
HaI had considered the apparently simple ways a search engine like Google works. The issue is that even though a spider indexes your page numerous times, it still takes a tonne of effort on your part in order to get a page to become interesting to the big G. I guess this adds to my understanding of search engine optimization!
I think this PR 6 health related article directory is very relevant
Hi. I treasured to drop you a quick note to verbalize my thanks. I’ve been observing your blog for a month or so and have plucked up a heap of fabulous information as well as enjoyed the way you’ve structured your site. I am seeking to run my own blog however I think its too general and I would like to focus more on smaller topics.
Very impressive website, as well as some good ideas in your post. I’ll be back her for sure. Thanks for the good content.
This is a great blog. I have just started with affiliate marketing and am after all the information that i can get! I will be checking back soon. I have just opened a clickbank account and will report back to you guys on whether its easy to do any online job!Any advice is appreciated.
Hey I found your website by mistake on msn while hunting for something totally irrelevant but I am very glad that I did, You have just captured yourself another subscriber.
Nice brief and this fill someone in on helped me alot in my college assignement. Gratefulness you on your information.
Could you go into more detail on this? Btw, the advice you gave me is really good.
Dressers Furniture
Thank you for the SEO insight! I have a question, hoping someone here could help me. I have tried ‘on page seo’ for my blog, and getting backlinks. But I am still not getting any visible results! Do you have any other advice for backlinking? I tried what I could understand already. Thanks again!!
Just checking in and paying my compliments to another marketer who posts about search engine listings.
I usually go online most of the time and read. I am fascinated about SEO at its been 3 months since I started reading. And right now, I what I know are few basic and still learning until now. I found your post very interesting.Very informative. Visit this page
If you want to have more information about Blackhat World and Google,check out this page
This is a cool blog from blackhat world. Very nice and informative.Check it out.,Click here
I love the information you posted here. Its a very good site. I am learning seo by myself. This kind of information really helps a lot. Thanks!Check out my blog about blackhat world.Click here
You got a very nice post on your blog. I am learning a lot from it. I found this blog interesting also.Check this out..Click here
I want to know more about this stuff.I am to know everything about it. Check out my blog..Click here
Your post is one of my favorites.Very informative.Thanks so much for sharing. Looking for a next post soon..Click here
This is a very good source of information especially for those who wants to learn more about this stuff. I will recommend this post to my friends. And also check out my blog.Visit here
This is a nice post. Thank you for sharing.Check out my Blog.Visit here. Thank you
Can you please tell me more about SEO?I happened to read this blog and it is very interesting.Click here.
This is cool post. I am learning a lot from it.Also check out this one.Click here.
You know it’s posts like this that can easily spur people on to master about this. I found it to be pretty informative. I will be coming back here for more reading as I really enjoyed this!
I could not have said it better myself. Remember to celebrate your victories- even the small ones!
Hey I came across your blog by luck on feedburner while trying to find something really unrelated but I am very pleased that I did, You have just captured yourself another subscriber.
Thanks for this great article. i found your blog on bing and find it is very useful.
I am quite sure that becoming an internet marketer is an easy and efficient way to earn money and based on our effort it promotes you to the extreme.