Finding Duplicate Content with Free Tools

by Shimon Sandler on July 6, 2009

Checking for duplicate content is part of the process of analyzing a website. The most common types of duplicate content to look for is in Title tags, Meta descriptions, product descriptions, dynamic URLs, and Canonicalization.
Fighting off the multi-headed monster of duplicate content
Many websites reuse the same Title tag for their entire site. That’s duplicate content.

Many ecommerce sites resell products from manufacturers will often use the product descriptions provided by the manufacturer. Hence, an abundance of duplicate content.

Top 6 Duplicate Content Mistakes:
1. Circular Navigation – multiple paths though a website.
2. Print-Friendly Pages – Fix: block search engine’s from printer friendly pages.
3. Inconsistent Linking
4. Product Only Pages – with nothing differentiating them from other pages.
5. Transparent Serving
6. Bad Cloaking

There are various ways to check for Duplicate Content. You can do a manual check, and/or use tools to check for dupe content. There are a ton of fee-based SEO tools out there that bundle a duplicate content tool into their suite of tools. However for this post, I’m just going to stick with just the free tools.

Free Tools

Copyscape will check to see if any other site contains the same text as your page. This is great to see all the other ecommerce sites that have the same product description.

Yahoo Explorer is a great free tool to find duplicate content. Make sure you see the results for Pages. Then, Export results to TSV. Open in Excel.

The spreadsheet will contain the TITLE tag, URL, SIZE, and FORMAT. Sort the data by descending or ascending for the Title tag. This will enable you to quickly scan the document for batches of duplicate content.

Xenu Link Sleuth is a popular free tool. This is an application download. Besides providing the same data as Yahoo Explorer, the Xenu Link Sleuth can detect broken links. Then, export to Excel. Sort data as needed.

Google Webmaster Tools. Go to Diagnostics, HTML suggestions, Duplicate title tags – next screen is a list, with a hyperlink to “Download this table”.

Additional Reading:
How to find your website Duplicate Content Issues
How to Remedy Duplicate Content
Google Dupe Content
Site Explorer is a First Step
Duplicate Content Issues

Photo credit: kajojak

Share and Enjoy:
  • StumbleUpon
  • Sphinn
  • del.icio.us
  • Reddit
  • Facebook
  • Propeller
  • TwitThis

No related posts.

{ 10 comments… read them below or add one }

Jenny Stradling July 7, 2009 at

I always use CopyScape, but here`s an interesting little factoid for you. One client of mine has a e commerce website with multiple product pages. At one point all of his pages were ranking well (both PR and in the SERPs) except for one page. We couldn`t figure it out! I ran the page through CopyScape and it came back with dup content. I asked him to rewrite the content, explaining it was the reason for the penalty. He explained to me that the content came directly from the product manufacturer and it was a requirement to include in order to sell the product. Long story short, he had to contact the manufacturer to request to use unique content, but as soon as he changes it, tah dah! The page was ranking within weeks. Something to keep in mind with reseller clients! My client never even thought twice about the manufacturer`s product description reading as dup content for the search engines!

Reply

Sarah September 4, 2009 at

I prefer to use PlagiarismDetect when checking plagiarism.

Reply

billyberue September 6, 2009 at

I am very careful about duplicate content. Quite interesting about the commerce store above. Wouldn’t that be indicative of most e-commerce sites then…meaning the description of the products?

Reply

SEO Link Company December 16, 2009 at

I really watch out on creating a tag, I sometimes tend to forget that I already used them, or duplicate content. i’m careful now.

Reply

Pamelia & SEO Services April 21, 2011 at

Generally I use copyscape to detect plag!

Reply

Bangladesh Outsourching July 8, 2011 at

I always use CopyScape &I think CopyScape is the best for duplicate content.

Reply

Carbon Accounting Software July 11, 2011 at

Nice tool copyscape i am daily checking the post before posting and after posting it, to check whether some body is copying my content

Nice Post

Nancy

Reply

Assurance auto non paiement September 14, 2011 at

i use also copyscape to detect plag

Reply

david November 27, 2011 at

Thanks for your post. It’s very helpful for a newbie like me :) .
As for balise title, how can we count the number of duplicate titles of a website?

Reply

????? ??????? December 12, 2011 at

I use copyscape too. But i need a free tool to do it.

Reply

Leave a Comment

*

Previous post:

Next post: