• seohunters9

        See all notifications

        Skip to content
        Moz logo Menu open Menu close
        • Products
          • Moz Pro
          • Moz Pro Home
          • Moz Local
          • Moz Local Home
          • STAT
          • Moz API
          • Moz API Home
          • Compare SEO Products
          • Moz Data
        • Free SEO Tools
          • Domain Analysis
          • Keyword Explorer
          • Link Explorer
          • Competitive Research
          • MozBar
          • More Free SEO Tools
        • Learn SEO
          • Beginner's Guide to SEO
          • SEO Learning Center
          • Moz Academy
          • MozCon
          • Webinars, Whitepapers, & Guides
        • Blog
        • Why Moz
          • Digital Marketers
          • Agency Solutions
          • Enterprise Solutions
          • Small Business Solutions
          • The Moz Story
          • New Releases
        • Log in
        • Log out
        • Products
          • Moz Pro

            Your all-in-one suite of SEO essentials.

          • Moz Local

            Raise your local SEO visibility with complete local SEO management.

          • STAT

            SERP tracking and analytics for enterprise SEO experts.

          • Moz API

            Power your SEO with our index of over 44 trillion links.

          • Compare SEO Products

            See which Moz SEO solution best meets your business needs.

          • Moz Data

            Power your SEO strategy & AI models with custom data solutions.

          Let your business shine with Listings AI
          Moz Local

          Let your business shine with Listings AI

          Learn more
        • Free SEO Tools
          • Domain Analysis

            Get top competitive SEO metrics like DA, top pages and more.

          • Keyword Explorer

            Find traffic-driving keywords with our 1.25 billion+ keyword index.

          • Link Explorer

            Explore over 40 trillion links for powerful backlink data.

          • Competitive Research

            Uncover valuable insights on your organic search competitors.

          • MozBar

            See top SEO metrics for free as you browse the web.

          • More Free SEO Tools

            Explore all the free SEO tools Moz has to offer.

          NEW Keyword Suggestions by Topic
          Moz Pro

          NEW Keyword Suggestions by Topic

          Learn more
        • Learn SEO
          • Beginner's Guide to SEO

            The #1 most popular introduction to SEO, trusted by millions.

          • SEO Learning Center

            Broaden your knowledge with SEO resources for all skill levels.

          • On-Demand Webinars

            Learn modern SEO best practices from industry experts.

          • How-To Guides

            Step-by-step guides to search success from the authority on SEO.

          • Moz Academy

            Upskill and get certified with on-demand courses & certifications.

          • MozCon

            Save on Early Bird tickets and join us in London or New York City

          Unlock flexible pricing & new endpoints
          Moz API

          Unlock flexible pricing & new endpoints

          Find your plan
        • Blog
        • Why Moz
          • Digital Marketers

            Simplify SEO tasks to save time and grow your traffic.

          • Small Business Solutions

            Uncover insights to make smarter marketing decisions in less time.

          • Agency Solutions

            Earn & keep valuable clients with unparalleled data & insights.

          • Enterprise Solutions

            Gain a competitive edge in the ever-changing world of search.

          • The Moz Story

            Moz was the first & remains the most trusted SEO company.

          • New Releases

            Get the scoop on the latest and greatest from Moz.

          Surface actionable competitive intel
          New Feature

          Surface actionable competitive intel

          Learn More
        • Log in
          • Moz Pro
          • Moz Local
          • Moz Local Dashboard
          • Moz API
          • Moz API Dashboard
          • Moz Academy
        • Avatar
          • Moz Home
          • Notifications
          • Account & Billing
          • Manage Users
          • Community Profile
          • My Q&A
          • My Videos
          • Log Out

        The Moz Q&A Forum

        • Forum
        • Questions
        • My Q&A
        • Users
        • Ask the Community

        Welcome to the Q&A Forum

        Browse the forum for helpful insights and fresh discussions about all things SEO.

        1. Home
        2. SEO Tactics
        3. Technical SEO
        4. Duplicate content and http and https

        Moz Q&A is closed.

        After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

        Duplicate content and http and https

        Technical SEO
        10
        16
        34827
        Loading More Posts
        • Watching

          Notify me of new replies.
          Show question in unread.

        • Not Watching

          Do not notify me of new replies.
          Show question in unread if category is not ignored.

        • Ignoring

          Do not notify me of new replies.
          Do not show question in unread.

        • Oldest to Newest
        • Newest to Oldest
        • Most Votes
        Reply
        • Reply as question
        Locked
        This topic has been deleted. Only users with question management privileges can see it.
        • hawkvt1
          hawkvt1 last edited by

          Within my Moz crawl report, I have a ton of duplicate content caused by identical pages due to identical pages of http and https URL's.

          For example:

          http://www.bigcompany.com/accomodations

          https://www.bigcompany.com/accomodations

          The strange thing is that 99% of these URL's are not sensitive in nature and do not require any security features.  No credit card information, booking, or carts.  The web developer cannot explain where these extra URL's came from or provide any further information.

          Advice or suggestions are welcome!  How do I solve this issue?

          THANKS MOZZERS

          1 Reply Last reply Reply Quote 0
          • Dr-Pete
            Dr-Pete Staff @GrowthLedge last edited by

            Hard to tell without knowing the site, but it's possible there are external links to "https" versions of the pages. At this point, Google is going to increase the pressure to secure sites, and later this year Chrome will start warning users about all non-secure pages, so it may be worth making the move.

            1 Reply Last reply Reply Quote 0
            • GrowthLedge
              GrowthLedge last edited by

              I'm reading this response and this is happening on my site as well.  How did this happen in the first place?  I have duplicate content because of https and http copies of all my web pages.  If I type https://www.mywebsite.com I can't get to my site.  Could this be coming from my hosting company?  I've set up my site to simply be http://www.mywebsite.com.  I'm a little worried to change my robots.txt and I would love to know how this happened in the first place.

              Dr-Pete 1 Reply Last reply Reply Quote 0
              • Dr-Pete
                Dr-Pete Staff @ajiabs last edited by

                If Google detects both http: and https: versions, they've started to automatically pick the https: version, but that's not consistent yet. In general, I think it's still important to set strong canonicalization signals. Google still separates your http: and https: sites in Google Search Console, too, so even they haven't quite made up their minds.

                In general, Google is pushing sites toward https:, but that's a somewhat complex decision that depends on more than just SEO. If you're using https: and the https: URLs are indexed, then you should treat those as canonical and suppress the http: URLs, in most cases.

                1 Reply Last reply Reply Quote 0
                • ajiabs
                  ajiabs Subscriber last edited by

                  Hate to respond to a 3 year old thread. But does this solution needs to be updated?

                  Is there any change in response now, as Google is favoring https for most pages. Does google still consider http and https as two different sites? If so which one should be suppressed - http or https?

                  Aji

                  Dr-Pete 1 Reply Last reply Reply Quote 0
                  • GTGshops
                    GTGshops @Dr-Pete last edited by

                    Hi,

                    I'm still having problems with redirecting. I only have 1 duplicate page with https and http, that I want to redirect but it's the homepage.

                    i want to redirect: https://www.domain.com to http://www.domain.com

                    But keep the rest of the pages the same (half http and the other half https).

                    How do i do this?

                    1 Reply Last reply Reply Quote 0
                    • hawkvt1
                      hawkvt1 @hawkvt1 last edited by

                      Anytime Rand!  I only have two simple rules:

                      1.  Talking business on ski days is not allowed

                      2.  Entry into Vermont requires a pound of Seattle's best french roast coffee.  In return, you  receive some fantastic Vermont maple syrup.

                      Simple rules to live by LOL

                      Thanks again for all of your help...

                      Peter

                      1 Reply Last reply Reply Quote 5
                      • randfish
                        randfish @hawkvt1 last edited by

                        Thanks dude! If I make it to Vermont, I might look you up 🙂

                        1 Reply Last reply Reply Quote 2
                        • hawkvt1
                          hawkvt1 @JamesNorquay last edited by

                          Thanks James..

                          Sorry, I was using Big Company as an example and just being generic.

                          The real URL if interested is www.hawkresort.com

                          1 Reply Last reply Reply Quote 0
                          • hawkvt1
                            hawkvt1 @Dr-Pete last edited by

                            I would personally like to thank everyone that responded with an answer.  Man O Man, the best part of belonging to SEOMOZ is the community forum.  It's incredibly valuable, being able to ask a question and reach out to such talent as all of you.

                            If anyone ever gets up to Killington or Okemo skiing, the beer is on me!  I live right between both ski areas, about 8 miles to either mountain..

                            Thanks again.

                            randfish hawkvt1 2 Replies Last reply Reply Quote 3
                            • Dr-Pete
                              Dr-Pete Staff last edited by

                              I think Harald and James covered the bases here, but a couple of comments on Harald's reply:

                              (1) Definitely check this. A common cause of indexed https: pages is that a secure section of your site is being crawled (like a shopping cart), and you're using relative navigation links (like ) - when a crawler or visitor hits the nav link from a secure page, the relative link grabs the https: In most cases, you may want to NOINDEX secure pages. Shopping carts and checkout pages have no business in the search index, IMO.

                              [(2)-(5) I believe this does work, but it's very tricky, so please be careful. If anyone has linked to the https: pages, you'll lose the link-juice this way (you'll just cut those pages off). I honestly don't think it's a good choice for most sites.

                              (8) I actually believe the 301-redirect is simpler in most cases.

                              As James said, sitewide canonical tags (or on the affect pages, if they're isolated) will also work.](/contact.php)

                              hawkvt1 GTGshops 2 Replies Last reply Reply Quote 3
                              • mediabase
                                mediabase @Kotkov last edited by

                                Hi Serge, I came to know about the "robots_ssl.txt" from the website  http://www.seoworkers.com/seo-articles-tutorials/robots-and-https.html

                                1 Reply Last reply Reply Quote 0
                                • GKLA
                                  GKLA last edited by

                                  I would check your server for a https folder.

                                  add a robots.txt file in the root of the https folder:

                                  User-agent: *
                                  Disallow:/

                                  My guess is that the spider is following a link somewhere within your site that links to a https:// url.  The spider is than re-indexing the entire site using https://

                                  My 2 cents for what its worth.

                                  1 Reply Last reply Reply Quote 0
                                  • Kotkov
                                    Kotkov last edited by

                                    Harald, " robots_ssl.txt " where did you get that?

                                    mediabase 1 Reply Last reply Reply Quote 0
                                    • mediabase
                                      mediabase last edited by

                                      Hello Hawkvt1, Fisrt of all I want to tell you that  the protocols (http/https) are different, they are considered two separate sites, so there’s a good chance to get penalized for duplicate content. If the search engine discovers two identical pages, generally it would take the page it saw first and ignore the other pages.The solutions are described below:

                                      S__olutions:

                                      1. Be smart about the site structure:  to keep the engines from crawling and indexing HTTPS pages, structure the website so that HTTPs are only accessible through a form submission (log-in, sign-up, or payment pages). The common mistake is making these pages available via a standard link (happens when you are either ignorant or  not aware that the secure version of the site is being crawled and indexed).
                                      2. Use Robots.txt file to control which pages will be crawled and indexed
                                      3. Use.htaccess file. Here’s how to do this:
                                      4. Create a file names robots_ssl.txt in your root.
                                      5. Add the following code to your .htaccessRewriteCond %{SERVER_PORT} 443 [NC]RewriteRule ^robots.txt$ robots_ssl.txt [L]
                                      6. Remove yourdomain.com:443 from the webmaster tools if the pages have already been crawled
                                      7. For dynamic pages like php, try< ?phpif ($_SERVER["SERVER_PORT"] == 443){echo “< meta name=” robots ” content=” noindex,nofollow ” > “;}?>
                                      8. Dramatic solution (may not always be possible): 301 redirect the HTTPS pages to the HTTP pages – with hopes that the link juice will transfer over.

                                      For more information please refer to this link :

                                      http://www.seomoz.org/ugc/solving-duplicate-content-issues-with-http-and-https

                                      I'm sure that your problem is solved.

                                      1 Reply Last reply Reply Quote 9
                                      • JamesNorquay
                                        JamesNorquay last edited by

                                        You could implement the canonical tag onto the HTTP version of the website.

                                        Another problem when having a quick look at this website is that all your title tags are the same with the brand term at the front, this is not advisable at all you want to put the brand term at the end of the title and your generic terms first.

                                        I would look at getting an SEO audit done to fix the issues with the website.

                                        hawkvt1 1 Reply Last reply Reply Quote 1
                                        • 1 / 1
                                        • First post
                                          Last post

                                        Browse Questions

                                        Explore more categories

                                        • Moz Tools

                                          Chat with the community about the Moz tools.

                                        • SEO Tactics

                                          Discuss the SEO process with fellow marketers

                                        • Community

                                          Discuss industry events, jobs, and news!

                                        • Digital Marketing

                                          Chat about tactics outside of SEO

                                        • Research & Trends

                                          Dive into research and trends in the search industry.

                                        • Support

                                          Connect on product support and feature requests.

                                        • See all categories

                                        Related Questions

                                        • SAIM_Marketing

                                          Duplicate Content and Subdirectories

                                          duplicate content subdirectory directories

                                          Hi there and thank you in advance for your help! I'm seeking guidance on how to structure a resources directory (white papers, webinars, etc.) while avoiding duplicate content penalties. If you go to /resources on our site, there is filter function. If you filter for webinars, the URL becomes /resources/?type=webinar We didn't want that dynamic URL to be the primary URL for webinars, so we created a new page with the URL /resources/webinar that lists all of our webinars and includes a featured webinar up top. However, the same webinar titles now appear on the /resources page and the /resources/webinar page. Will that cause duplicate content issues? P.S. Not sure if it matters, but we also changed the URLs for the individual resource pages to include the resource type. For example, one of our webinar URLs is /resources/webinar/forecasting-your-revenue Thank you!

                                          Technical SEO | | SAIM_Marketing
                                          0
                                        • davedon

                                          How to change 302 redirect from http to https

                                          Hi gang. Our site currently has a 302 redirect from the HTTP version of the homepage to the HTTPS version of the homepage. I understand this really should be changed to a 301 redirect but I'm having a little trouble figuring out exactly how this should be done. Some places on the internet are telling me I can edit our htaccess file to specify the type of redirect, however our htaccess file seems to be missing some of the information in theirs. Can anyone tell me what needs to be changed in the htaccess file - or if there's a simpler way to change the 302 to a 301? Many thanks 🙂 htaccess: BEGIN WordPress RewriteEngine On RewriteBase / RewriteRule ^index.php$ - [L] RewriteCond %{REQUEST_FILENAME} !-f RewriteCond %{REQUEST_FILENAME} !-d RewriteRule . /index.php [L] END WordPress EXPIRES CACHING ExpiresActive On ExpiresByType image/jpg "access plus 6 months" ExpiresByType image/jpeg "access plus 6 months" ExpiresByType image/gif "access plus 6 months" ExpiresByType image/png "access plus 6 months" ExpiresByType text/css "access plus 10 days" ExpiresByType application/pdf "access plus 10 days" ExpiresByType application/x-shockwave-flash "access plus 10 days" ExpiresByType image/x-icon "access plus 6 months" ExpiresDefault "access plus 2 days" EXPIRES CACHING

                                          Technical SEO | | davedon
                                          0
                                        • Lina500

                                          How does Google view duplicate photo content?

                                          Now that we can search by image on Google and see every site that is using the same photo, I assume that Google is going to use this as a signal for ranking as well. Is that already happening? I ask because I have sold many photos over the years with first-use only rights, where I retain the copyright. So I have photos on my site that I own the copyright for that are on other sites (and were there first). I am not sure if I should make an effort to remove these photos from my site or if I can wait another couple years.

                                          Technical SEO | | Lina500
                                          0
                                        • TIM_DOTCOM

                                          Handling of Duplicate Content

                                          I just recently signed and joined the a-moz.groupbuyseo.org system. During the initial report for our web site it shows we have lots of duplicate content. The web site is real estate based and we are loading IDX listings from other brokerages into our site. If though these listings look alike, they are not. Each has their own photos, description and addresses. So why are they appear as duplicates – I would assume that they are all too closely related. Lots for Sale primarily – and it looks like lazy agents have 4 or 5 lots and input the description the same. Unfortunately for us, part of the IDX agreement is that you cannot pick and choose which listings to load and you cannot change the content. You are either all in or you cannot use the system. How should one manage duplicate content like this? Or should we ignore it? Out of 1500+ listings on our web site it shows 40 of them are duplicates.

                                          Technical SEO | | TIM_DOTCOM
                                          0
                                        • zeepartner

                                          Robots.txt on http vs. https

                                          We recently changed our domain from http to https. When a user enters any URL on http, there is an global 301 redirect to the same page on https. I cannot find instructions about what to do with robots.txt. Now that https is the canonical version, should I block the http-Version with robots.txt? Strangely, I cannot find a single ressource about this...

                                          Technical SEO | | zeepartner
                                          0
                                        • ashishb01

                                          Which Sitemap to keep - Http or https (or both)

                                          Hi, Just finished upgrading my site to the ssl version (like so many other webmasters now that it may be a ranking factor). FIxed all links, CDN links are now secure, etc and 301 Redirected all pages from http to https. Changed property in Google Analytics from http to https and added https version in Webmaster Tools. So far, so good. Now the question is should I add the https version of the sitemap in the new HTTPS site in webmasters or retain the existing http one? Ideally switching over completely to https version by adding a new sitemap would make more sense as the http version of the sitemap would anyways now be re-directed to HTTPS. But the last thing i can is to get penalized for duplicate content. Could you please suggest as I am still a rookie in this department. If I should add the https sitemap version in the new site, should i delete the old http one or no harm retaining it.

                                          Technical SEO | | ashishb01
                                          0
                                        • DHS_SH

                                          Duplicate Content Issues on Product Pages

                                          Hi guys Just keen to gauge your opinion on a quandary that has been bugging me for a while now. I work on an ecommerce website that sells around 20,000 products. A lot of the product SKUs are exactly the same in terms of how they work and what they offer the customer. Often it is 1 variable that changes. For example, the product may be available in 200 different sizes and 2 colours (therefore 400 SKUs available to purchase). Theese SKUs have been uploaded to the website as individual entires so that the customer can purchase them, with the only difference between the listings likely to be key signifiers such as colour, size, price, part number etc. Moz has flagged these pages up as duplicate content. Now I have worked on websites long enough now to know that duplicate content is never good from an SEO perspective, but I am struggling to work out an effective way in which I can display such a large number of almost identical products without falling foul of the duplicate content issue. If you wouldnt mind sharing any ideas or approaches that have been taken by you guys that would be great!

                                          Technical SEO | | DHS_SH
                                          0
                                        • ericmccarty

                                          How much to change to avoid duplicate content?

                                          Working on a site for a dentist.  They have a long list of services that they want us to flesh out with text.  They provided a bullet list of services, we're trying to get 1 to 2 paragraphs of text for each. Obviously, we're not going to write this off the top of our heads.  We're pulling text from other sources and trying to rework. The question is, how much rephrasing do we have to do to avoid a duplicate content penalty?  Do we make sure there are changes per paragraph, sentence, or phrase? Thanks! Eric

                                          Technical SEO | | ericmccarty
                                          0

                                        Get started with Moz Pro!

                                        Unlock the power of advanced SEO tools and data-driven insights.

                                        Start my free trial
                                        Products
                                        • Moz Pro
                                        • Moz Local
                                        • Moz API
                                        • Moz Data
                                        • STAT
                                        • Product Updates
                                        Moz Solutions
                                        • SMB Solutions
                                        • Agency Solutions
                                        • Enterprise Solutions
                                        • Digital Marketers
                                        Free SEO Tools
                                        • Domain Authority Checker
                                        • Link Explorer
                                        • Keyword Explorer
                                        • Competitive Research
                                        • Brand Authority Checker
                                        • Local Citation Checker
                                        • MozBar Extension
                                        • MozCast
                                        Resources
                                        • Blog
                                        • SEO Learning Center
                                        • Help Hub
                                        • Beginner's Guide to SEO
                                        • How-to Guides
                                        • Moz Academy
                                        • API Docs
                                        About Moz
                                        • About
                                        • Team
                                        • Careers
                                        • Contact
                                        Why Moz
                                        • Case Studies
                                        • Testimonials
                                        Get Involved
                                        • Become an Affiliate
                                        • MozCon
                                        • Webinars
                                        • Practical Marketer Series
                                        • MozPod
                                        Connect with us

                                        Contact the Help team

                                        Join our newsletter
                                        Moz logo
                                        © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                        • Accessibility
                                        • Terms of Use
                                        • Privacy

                                        Looks like your connection to Moz was lost, please wait while we try to reconnect.