Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Duplicate content penalty
-
when moz crawls my site they say I have 2x the pages that I really have & they say I am being penalized for duplicate content. I know years ago I had my old domain resolve over to my new domain. Its the only thing that makes sense as to the duplicate content but would search engines really penalize me for that? It is technically only on 1 site. My business took a significant sales hit starting early July 2013, I know google did and algorithm update that did have SEO aspects. I need to resolve the problem so I can stay in business
-
Thx Jane- No I wasn't aware of that. I don't get it because I put canonical tags right under the Head and I used the code below to do it. I will check again but am unsure how to fix it
I don't even know how to fix coding on the Http://cheaptubes.com site. It seems like when I add content to the canonical site it updates all of them. Thx for pointing out errors, you are giving me something to fix and improve.
-
Hi again,
Are you aware that you have a canonical tag on http://cheaptubes.com that points to a non-existent URL? i.e. http://i.imgur.com/yEd2377.png
http://www.cheaptubes.com/default.html
If http://cheaptubes.com/ 301 redirected to http://www.cheaptubes.com/, this would resolve the issue.
Are you aware that the www version of your site shows for a brand search (https://www.google.co.uk/search?q=cheaptubes.com&oq=cheaptubes.com&aqs=chrome..69i58j69i60l2j69i57j69i60j0.3367j0j4&sourceid=chrome&es_sm=91&ie=UTF-8) but that the canonical tags on each page point to the non-www version, e.g. http://i.imgur.com/P7Tizsv.png and http://i.imgur.com/lhTA95w.png?
The canonical tag on the www.cheaptubes.com/ page also points to http://www.cheaptubes.com/default.html. Sorry to show so many errors, but it doesn't look like canonicalisation has been implemented properly here.
-
Thx Jane - I may not have put a canonical tag on that page yet but its the same for every page. I can't access the http://cheaptubes.com but I can access the canonical version to publish. I did put canonical tags on most of my other pages such as the SWNTs page but it still shows a non canonical version when moz crawls it. Perhaps a 301 from http://cheaptubes.com to the canonical page? I'm just not sure how to handle it.
-
Hi Mike,
Are you saying that there is a canonical tag on http://cheaptubes.com/cntmaterialsafetydatasheet.htm, pointing to http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm? This would solve the duplicate content problem, but I do not see a canonical tag on either of those pages...
-
Hi Everyone - I'm hoping you can help me out again. I have a functional 301 on cheaptubesinc.com. that cleared about 1/2 my dup content penalty on the moz crawl this week. As you can see in the results below, I still have 57 pages with dup content according to Moz.
57 Duplicate Page Content
13 4XX (Client Error)
57 Duplicate Page TitleI checked and I think it is mostly a canonical problem. I do have Rel Canonical tags on all my pages. When I clicked on the 1st one it appears that is the case, see below
cheaptubes.com carbon nanotubes msds
http://www.cheaptubes.com/cntmaterialsafetydatasheet.htm29414872001 duplicate
cheaptubes.com carbon nanotubes msds
http://cheaptubes.com/cntmaterialsafetydatasheet.htm25My question is, do I need another 301 from http://cheaptubes.com to the canonical version? I'ld rather not since I had to fight with network solutions for a week for them to add the / after .com so my other pages would work. Is this a penalty I should still be concernedabout given that I have the rel canonical tags? Please let me know your thoughts on thisMike
-
Thanks Everyone. I got the old url cheaptubesinc.com 301'd to cheaptubes.com this week. Of course network solutions left off the / after .com and before the page name so that only the home page would 301 and they could try to sell me more 301s, it cost $60 for 1 and I have 48 pages on my site. I called and emailed them all week and they kept saying they had done it right and and they couldn't force google to change the links. I then realized if I typed www.cheaptubesinc/graphene.htm that it didn't work because it 301'd to www.cheaptubes.comgraphene.htm. They were argumentative with me even though I was polite with them even though I didn't want to be. I finally got a tech on the phone who said he would add the slash and ask his boss for forgiveness. However given the history of having the domain parked and pointed before and that not working over time & now this, I think my best bet is to transfer my domains to someone else. I heard bluehost is good. My concern is if they were that unethical in our dealings and the boss was argumentative in emails than they could go in an remove the slash at any time.
I also found a ton of code errors right at the top of my pages. I now know it was from putting up temporary messages but not checking to make sure the code was clean. The woman I bought my them from (6.5 years ago I paid her $60 and she still helps me for free, what difference between her & NS) notice open H1's & P elements at the top of the pages. I was still ranking well for acronyms but missing out on the long keywords since last july which caused my sales to drop off. I figure I lost at least 150K in sales because I neglected my website and didn't clean up the code on my pages a painful lesson I won't soon forget. On tuesday, when I searched single walled carbon nanotubes I had to go 8 pages back in google to find my page. By week's end I was #8 on page 1 and ahead of sigma aldrich a major materials supplier.
Thank you so much for your help everyone, it is sincerely appreciated
Mike
-
Thank you Oleg - I did put the tag
into the head right below the robots & google bot code on every page. I mistakenly deleted some very old non updated pages. Thx to moz, i have a list of the pages and will contact hosting co to 301 it. I think I ultimately have to 301 each page. I had moz recrawl my site last night but it said it dropped from 100 duplicate content penalties to 89, an improvement but not the one I hoped for. I did have a client tell my the site was down today, contacted network solutions and they said it was up now but they had an outage last night. Perhaps it affected the moz recrawl,but I can't know that. I also want to change the names of the pages as an interim measure before I update the site to newer format. Should I create new optimized by name pages first and then get on the phone with tech support and 301 them all to the knew pages? seems logical but so did deleting old pages until moz couldn't find them, then i realized the bots will count it against me rather than the housekeeping that it was.
Mike
Mike
-
-
ok, got it, thank you so much Jane
-
Hi,
You don't need to redirect at all (with a 301 or otherwise) if the canonical tag is in place. So don't worry about that at all - both URLs can load together if the canonical tag points Google from the "duplicate" to the "correct / canonical" one. Sorry if that wasn't clear.
I am not sure the frequency of Moz's crawling or if you can force a refresh, I'm sorry.
-
Thx Jane
The problem is I can't simply 301 it because I'm not on apache. I can do the canonical tag. Of course I've already gone in and changed it over to the tag + refresh but server is down so it won't publish right now. I was trying to get it done ahead of moz crawling my site today. Is there a way to get moz to recrawl it after the changes are updated or do I need to wait another week?
-
Hi,
Hard to say, but it definitely won't have helped. As Bryan says, you've split authority between over twice the number of pages the site should have, and Google can take action against sites that produce a large amount of duplicate content. I'd get the canonical tags in place (and thoroughly check they're set up right, as it can be a mess if they're implemented incorrectly) and check on progress over two or three weeks. If you see nothing happen, I'd say your reason for dropping could be something else.
-
Hi again,
The canonical tag sounds like the right way to go for you.
Regarding the meta refresh method of redirection - this works perfectly for users... it was always the case that search engines did not honour this as a redirect though. This may have changed in the recent past (and realistically, it should have - a lot of people used this tactic for redirection and Google should understand that it shows a moved page). However, it is generally thought that the meta refresh does not pass all authority (as noted here), and this thread shows a Googler advising against it (this is a post from 2010 though).
Honestly, with the canonical tag, you don't need to do the refresh / redirection - this will take care of the issue
Cheers,
Jane
-
Hi there,
I'll answer these one at a time as there are a few responses to go through.
default.htm is the home page as created by the CMS, but you want to either use that URL or www.cheaptubes.com as the home page, not both.
The solution is a 301 or the canonical tag so that home page content does not appear on both URLs.
-
Hi Jane, Oleg, & Bryan
I checked with the woman who designed my theme (she is awesome). She offered the following suggestions which seem like the way to go for me. Are there any negatives that I'm not aware of with the options below?
Since you are still using FrontPage, just open your site, locate the appropriate pages, and type the following into the head area:
If you are on a Windows server, your web host can do the 301 redirect for you. You will tell them the name of the old pages and the name of the new pages and they will do the rest.
An easy alternate is for you to do the redirect yourself with an easy tag that goes into the head area of the old pages. This tag is called a redirect and redirects from the old page to the new one.
URL="http://www.newsite.com/newurl.html">
Google, Bing, and Yahoo all recognize the meta tag for the redirect and will adjust their indexing accordingly. I will usually leave an old page on the server for about 3 months to give the search engines time to catch up. Then I can delete the page.
You can, of course, get more "bang for your buck" by using both the canonical link and the meta refresh at the same time.
URL="http://www.newsite.com/newurl.html">
I like the last one, am going to try that unless you think its a flawed strategy.
Thanks for your help
Mike
-
Hi Jane
How do I change to canonical url's if I can't do a 301?
Mike
-
so how do I use the canonical tag since i can't 301 it?
-
It certainly could. Google sees the www. version as a 2nd website, so essentially you're splitting your 'ranking authority' between 2 webpages.
-
Thanks Oleg
I can't 301 because I'm not using apache, still on frontpage. I know its old, getting out my abacus now : )
-
To sum up...
- 301 redirect all non-www urls to www versions (since it has a higher page authority) and add canonicals to all pages with the www version of the url
- For all lower case / upper case page duplicates... pick one, set a canonical tag and 301 to the chosen case, make sure all your links point to the correct url case.
- 301 redirect default.htm to your root domain - http://www.cheaptubes.com
-
does the 2 versions problem help to explain why my sales started dropping significantly after the google july 4th update? I know there were some SEO penalties in that update. I also know a friendly competitor who saw a similar drop starting in early July.
-
Hi Jane
Thank you so much. I am reviewing the link you provided. I don't think I can 301 redirect because it is done in front page, not apache. I have tried for years to find another platform but failed. I spent years trying to figure out drupal, even ordered several books but no luck. I tried concrete 5 and just using HTML 5 editor like coffee cup. I keep struggling with getting them to work. I've bought themes to use but can't get them operational.
I thought default.htm was supposed to be the home page, is that incorrect?
Mike
-
Hi again,
Yep - your non-www and www pages are both resolving... e.g. http://cheaptubes.com/ and http://www.cheaptubes.com/ bring up the same content. Also, http://cheaptubes.com/default.htm and http://www.cheaptubes.com/default.htm is also a duplicate of the home page.
Internally, I am seeing the same thing, e.g. http://www.cheaptubes.com/carbon-nanotubes-prices.htm and http://cheaptubes.com/carbon-nanotubes-prices.htm - same page, one on the www subdomain ("www." is a subdomain like any other, just with an extremely common name) and one just sitting on the root.
The solution here is either to 301 redirect the non-www version of the site to the www version for every page, or to use the canonical tag to point from the non-preferred versions to the "canonical" versions. More information on this is available here.
You also have a situation where upper-case URLs will resolve as well as lower case ones, e.g. http://www.cheaptubes.com/SWNTs.htm and http://www.cheaptubes.com/swnts.htm (as well as http://cheaptubes.com/swnts.htm!).
URLs should only be allowed to resolve with one case, preferably lower. The upper / mixed case should 301 redirect to the proper version.
Essentially, the "two versions of the site" issue is the biggest problem, with all pages being available on at least two URLs - one with www and one without. There are other tidiness issues like /default.htm bringing up the home page as well.
Does this make sense? Let me know if this is not clear.
Best,
Jane
-
also on a page that moz ranks as an "F", I still rank high in organic results, see the results from when I searched for MWNTs below, I was 1st organic result. If long form, multi walled carbon nanotubes I fell to 6th or 7th but still on the first page.
-
Thank you Oleg, Bryan, & Jane. I am a rookie when it comes to web development but my pages always ranked well because enough Moz tips sunk in. See the alert from last weeks crawl below.
Pages with High Priority Issues
98Duplicate Page Content24XX (Client Error)If we look at the home page, it has 3 URLs, see belowURLPage AuthorityLinking Root DomainsExternal Link CountInternal Link CountStatus CodeDuplicate URLsDownload Duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com3322611882003 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://www.cheaptubes.com/default.htm2422622001 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com2931502002 of 3 duplicates
cheaptubes.com the source for carbon nanotubes home page
http://cheaptubes.com/default.htm2410462003 of 3 duplicates Does this help?-Mike
-
Hi there,
This could definitely be a case of both non-www and www URLs resolving, but I'd like to echo the guys above me and ask for more information - if you could share the actual examples, either on here or in a private message, it would be easier to find why Moz has found twice the number of URLs your site should have.
Thanks,
Jane
-
+Really need more information
If you have URLs constructed dynamically depending on where the user navigates from this could also be an issue, but I would expect more than 2x the pages.
-
Could you share more details?
What do the duplicate content examples look like? http vs https? www. vs non-www?
If the content is replicated on 2 domains, yes that is duplicate content and you should consolidate to one site via 301 redirects.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate content with tagging and categories
Hello, Moz is showing that a site has duplicate content - which appears to be because of tags and categories. It is a relatively new site, with only a few blog publications so far. This means that the same articles are displayed under a number of different tags and categories... Is this something I should worry about, or just wait until I have more content? The 'tag' and 'category' pages are not really pages I would expect or aim for anyone to find in google results anyway. Would be glad to here any advice / opinions on this Thanks!
On-Page Optimization | | wearehappymedia1 -
Duplicate Content Re: Product listing body copy on Website, Amazon & Ebay - issues ?
Hi Is it ok to have identical product body copy on market/platform listings same as the websites product listings ? In this case the products are the websites/own brand products (all pages canonicalised), so i take it shouldn't cause any issues or are you supposed to differentiate the product body copy on marketplace listings ? Im asking re seo reasons All Best Dan
On-Page Optimization | | Dan-Lawrence0 -
Duplicate Content - Blog Rewriting
I have a client who has requested a rewrite of 250 blog articles for his IT company. The blogs are dispersed on a variety of platforms: his own website's blog, a business innovation website, and an IT website. He wants to have each article optimised with keyword phrases and then posted onto his new website thrice weekly. All of this is in an effort to attract some potential customers to his new site and also to establish his company as a leader in its field. To what extent would I need to rewrite each article so as to avoid duplicating the content? Would there even be an issue if I did not rewrite the articles and merely optimised them with keywords? Would the articles need to be completely taken by all current publishers? Any advice would be greatly appreciated.
On-Page Optimization | | StoryScout0 -
Duplicate content on partner site
I have a trade partner who will be using some of our content on their site. What's the best way to prevent any duplicate content issues? Their plan is to attribute the content to us using rel=author tagging. Would this be sufficient or should I request that they do something else too? Thanks
On-Page Optimization | | ShearingsGroup0 -
Duplicate Content on Event Pages
My client has a pretty popular service of event listings and, in hope of gathering more events, they opened up the platform to allow users to add events. This works really well for them and they are able to garner a lot more events this way. The major problem I'm finding is that many event coordinators and site owners will take the copy from their website and copy and paste it, duplicating a lot of the content. We have editor picks that contain a lot of unique content but the duplicate content scares me. It hasn't hurt our page ranking (we have a page ranking of 7) but I'm wondering if this is something that we should address. We don't have the manpower to eliminate all the duplication but if we cut down the duplication would we experience a significant advantage over people posting the same event?
On-Page Optimization | | mattdinbrooklyn0 -
Does schema.org assist with duplicate content concerns
The issue of duplicate content has been well documented and there are lots of articles suggesting to noindex archive pages in WordPress powered sites. Schema.org allows us to mark-up our content, including marking a components URL. So my question simply, is no-indexing archive (category/tag) pages still relevant when considering duplicate content? These pages are in essence a list of articles, which can be marked as an article or blog posting, with the url of the main article and all the other cool stuff the scheme gives us. Surely Google et al are smart enough to recognise these article listings as gateways to the main content, therefore removing duplicate content concerns. Of course, whether or not doing this is a good idea will be subjective and based on individual circumstances - I'm just interested in whether or not the search engines can handle this appropriately.
On-Page Optimization | | MarkCA0 -
Is content aggregation good SEO?
I didn't see this topic specifically addressed here: what's the current thinking on using content aggregation for SEO purposes? I'll use flavors.me as an example. Flavors.me lets you set up a domain that pulls in content from a variety of services (Twitter, YouTube, Flickr, RSS, etc.). There's also a limited ability to publish unique content as well. So let's say that we've got MyDomain.com set up, and most of the content is being drawn in from other services. So there's blog posts from WordPress.com, videos from YouTube, a photo gallery from Flickr, etc. How would Google look at this scenario? Is MyDomain.com simply scraped content from the other (more authoritative) sources? Is the aggregated content perceived to "belong" to MyDomain.com or not? And most importantly, if you're aggregating a lot of content related to Topic X, will this content aggregation help MyDomain.com rank for Topic X? Looking forward to the community's thoughts. Thanks!
On-Page Optimization | | GOODSIR0 -
Sliders and Content Above the Fold
I was just inspecting a wire frame that is going out to a client and realized that the slider may interfere with the "content above the fold." Can't believe this had not struck me on others. If the Header has basic business info, etc. in it and you place a slider to display images in the area just beneath the Header or slightly down from it, does that decrease the amount of content seen a being above the fold? Or, is content above the fold established by virtue of H1,2, 3, etc.?
On-Page Optimization | | RobertFisher0