Duplicate Content and SEO: Is There Really a Penalty?

Posted on in Blog

There are thousands of factors that impact your website’s organic performance. One of the factors long thought to be high on Google’s list is duplicate content. Having identical or similar pages causes keyword cannibalization, reduces your domain authority and can negatively impact user experience. Luckily, duplicate content issues are often easy to resolve once you know how to find them.

What is Duplicate Content?

Duplicate content is any text, photo or other HTML element that appears in more than one place on the internet. That includes all URLs, whether they’re pages on your site or elsewhere on the web. It’s often best to break duplicate content down into two categories:

  • Internal duplicate content – More than one page on a domain has identical or similar content. This often happens when “duplicating” pages to make additional product, service or blog pages.
  • External duplicate content – More than one page on multiple domains has similar content. This is usually the result of plagiarism or, in some cases, an organization replicating site content and using a new domain without redirecting users from the old domain.

Why is Having Duplicate Content an Issue for SEO?

Duplicate content negatively impacts two audiences. Admittedly, one is an algorithm, but both search engines and site users are tripped up when your site has multiple pages with the same content.

Related: The Big, Bad List of SEO Terms You Need to Know

SEO, Duplicate Content and Google

You may have heard of Google’s duplicate content penalty, wherein Google actively penalizes sites with duplicate content issues. There’s good news; that’s a myth. The bad news? It’s still bad. Search engines like Google won’t know which page to index, which page should rank for organic search results, or which duplicate page URL to attribute page authority and link equity to.

For site owners, it’s important to remember that Google avoids showing more than one version of your content on the SERP. Duplicate content reduces the value of all the duplicated pages, not just the newest version.

How to Find Duplicate Content and Fix It

It’s important to identify both internal and external duplicate content issues. It’s usually best to dig into internal duplicates first because you’ll be able to take action yourself to resolve the problem.

Use Site Search

You can quickly find similar or duplicate content using a simple site search. In a Google search bar, type in site: yourdomain.com and then a keyword, such as:

site:oneupweb.com SEO

This will return only results from your domain and include any URLs that contain the keyword. Using a site search is best for longer tail keywords; as you can see, a short-term keyword on a topic you cover extensively will return a lot of results!

Screenshot of a site search on SEO topics at Oneupweb.com.

Use a Dedicated Tool

We use Screaming Frog for most of our in-depth site crawling work. This paid tool includes a handy duplicate content tool that identifies all troublesome pages and other SEO optimization opportunities like duplicate page titles, meta descriptions and more.

You can also use a free online duplicate site checker like Siteliner. It will analyze up to 250 pages of your domain for free. Depending on the size of your site, this might be all you need, or it can offer up a snapshot of duplicate content out of this sample size.

How to Avoid Duplicate Content in the First Place

You can create duplicate content accidentally. As much as 29% of all web content could be considered duplicate.

URL parameters – Any URL variation, including those containing click tracking code, can contribute to duplicate content problems. The addition of these code snippets impacts both the original URL and any other URL variations. Check out the following URLs:

oneupweb.com/blog/duplicate-content?#texthow

and

oneupweb.com/blog/duplicate-content?utm_source=fb&utm_medium=feed&utm_campaign=bedliners&utm_id

Both URLs refer to the same page and could cause that content to be considered duplicate.

HTTP and HTTPS pages – Some domains include both non-secure HTTP and secure HTTPS protocols. If both page versions are live, search engines will consider it duplicate content.

Copied or reproduced content – Many ecommerce sites struggle with duplicate content if they include product descriptions on multiple pages. Using “pre-packaged” product descriptions can also result in duplicate content issues, with multiple sites displaying identical copy pulled from a centralized catalog provided by the distributor or manufacturer.

Fixing Duplicate Content Problems

Resolving duplicate content issues comes down to pointing users and search engines to the right URL. In most cases, the right URL is the most recent, informative and robust page; consider analyzing each page version to see which ranks for the most organic keywords. In most cases, this will also be the URL that captures the most organic sessions.

Once you know where to point search engines, here’s how to fix duplicate content.

Use a 301 Redirects

 This will redirect – literally point – all traffic from “duplicate” URLs to the “right” URL. This will resolve the issue and is often the best solution. Most Content Management Systems (CMS) offer a redirection tool. In WordPress, log in to your dashboard, go to Tools and select “Redirection.”

Screenshot of the Tool panel in a WordPress CMS.

Canonical Tags

 Also known as rel=”canonical,” is a piece of code or tag that can be added to the HTML of any webpage. Implementing the rel=”canonical” tag is like slipping a note to search engines that says, “Hey, this page exists, but you should really give credit to the OC (original content) over there.”

Add Self-Referential Canonical Tags to Pages

 If you’re putting out great work, it’s only a matter of time until scrapers try to steal it. Site scrapers take content from other websites to use on their domains. This is the most common form of external duplicate content, but it’s easy to fight. Simply add a canonical tag to the page that references the same page’s URL. This is often enough to deter scrapers and if they do port over your entire code, the tag will ensure your URL gets credit as the source.

Let Us Do the Duplicate Digging

A technical site audit from Oneupweb is an excellent way to identify common issues like duplicate content. Our SEO experts dive deep into your domain to identify and prioritize tweaks that will make your website perform perfectly. To get the most out of your digital assets, get in touch or call 231-922-9977 today to get started.

Up Next

Search engine results pages (SERPs) are always changing. Google is constantly testing new SERP features to provide users with information more efficiently and often without clicking through a result to a domain. The no-click SERP has changed the way we search the internet, but how big of an impact do SERP features have on important...

Read More