• seohunters9

        See all notifications

        Skip to content
        Moz logo Menu open Menu close
        • Products
          • Moz Pro
          • Moz Pro Home
          • Moz Local
          • Moz Local Home
          • STAT
          • Moz API
          • Moz API Home
          • Compare SEO Products
          • Moz Data
        • Free SEO Tools
          • Domain Analysis
          • Keyword Explorer
          • Link Explorer
          • Competitive Research
          • MozBar
          • More Free SEO Tools
        • Learn SEO
          • Beginner's Guide to SEO
          • SEO Learning Center
          • Moz Academy
          • MozCon
          • Webinars, Whitepapers, & Guides
        • Blog
        • Why Moz
          • Digital Marketers
          • Agency Solutions
          • Enterprise Solutions
          • Small Business Solutions
          • The Moz Story
          • New Releases
        • Log in
        • Log out
        • Products
          • Moz Pro

            Your all-in-one suite of SEO essentials.

          • Moz Local

            Raise your local SEO visibility with complete local SEO management.

          • STAT

            SERP tracking and analytics for enterprise SEO experts.

          • Moz API

            Power your SEO with our index of over 44 trillion links.

          • Compare SEO Products

            See which Moz SEO solution best meets your business needs.

          • Moz Data

            Power your SEO strategy & AI models with custom data solutions.

          Let your business shine with Listings AI
          Moz Local

          Let your business shine with Listings AI

          Learn more
        • Free SEO Tools
          • Domain Analysis

            Get top competitive SEO metrics like DA, top pages and more.

          • Keyword Explorer

            Find traffic-driving keywords with our 1.25 billion+ keyword index.

          • Link Explorer

            Explore over 40 trillion links for powerful backlink data.

          • Competitive Research

            Uncover valuable insights on your organic search competitors.

          • MozBar

            See top SEO metrics for free as you browse the web.

          • More Free SEO Tools

            Explore all the free SEO tools Moz has to offer.

          NEW Keyword Suggestions by Topic
          Moz Pro

          NEW Keyword Suggestions by Topic

          Learn more
        • Learn SEO
          • Beginner's Guide to SEO

            The #1 most popular introduction to SEO, trusted by millions.

          • SEO Learning Center

            Broaden your knowledge with SEO resources for all skill levels.

          • On-Demand Webinars

            Learn modern SEO best practices from industry experts.

          • How-To Guides

            Step-by-step guides to search success from the authority on SEO.

          • Moz Academy

            Upskill and get certified with on-demand courses & certifications.

          • MozCon

            Save on Early Bird tickets and join us in London or New York City

          Unlock flexible pricing & new endpoints
          Moz API

          Unlock flexible pricing & new endpoints

          Find your plan
        • Blog
        • Why Moz
          • Digital Marketers

            Simplify SEO tasks to save time and grow your traffic.

          • Small Business Solutions

            Uncover insights to make smarter marketing decisions in less time.

          • Agency Solutions

            Earn & keep valuable clients with unparalleled data & insights.

          • Enterprise Solutions

            Gain a competitive edge in the ever-changing world of search.

          • The Moz Story

            Moz was the first & remains the most trusted SEO company.

          • New Releases

            Get the scoop on the latest and greatest from Moz.

          Surface actionable competitive intel
          New Feature

          Surface actionable competitive intel

          Learn More
        • Log in
          • Moz Pro
          • Moz Local
          • Moz Local Dashboard
          • Moz API
          • Moz API Dashboard
          • Moz Academy
        • Avatar
          • Moz Home
          • Notifications
          • Account & Billing
          • Manage Users
          • Community Profile
          • My Q&A
          • My Videos
          • Log Out

        The Moz Q&A Forum

        • Forum
        • Questions
        • My Q&A
        • Users
        • Ask the Community

        Welcome to the Q&A Forum

        Browse the forum for helpful insights and fresh discussions about all things SEO.

        1. Home
        2. SEO Tactics
        3. Intermediate & Advanced SEO
        4. PDF for link building - avoiding duplicate content

        Moz Q&A is closed.

        After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

        PDF for link building - avoiding duplicate content

        Intermediate & Advanced SEO
        4
        14
        3138
        Loading More Posts
        • Watching

          Notify me of new replies.
          Show question in unread.

        • Not Watching

          Do not notify me of new replies.
          Show question in unread if category is not ignored.

        • Ignoring

          Do not notify me of new replies.
          Do not show question in unread.

        • Oldest to Newest
        • Newest to Oldest
        • Most Votes
        Reply
        • Reply as question
        Locked
        This topic has been deleted. Only users with question management privileges can see it.
        • BobGW
          BobGW last edited by

          Hello,

          We've got an article that we're turning into a PDF. Both the article and the PDF will be on our site. This PDF is a good, thorough piece of content on how to choose a product.

          We're going to strip out all of the links to our in the article and create this PDF so that it will be good for people to reference and even print. Then we're going to do link building through outreach since people will find the article and PDF useful.

          My question is, how do I use rel="canonical" to make sure that the article and PDF aren't duplicate content?

          Thanks.

          1 Reply Last reply Reply Quote 0
          • Marcus_Miller
            Marcus_Miller @BobGW last edited by

            Hey Bob

            I think you should forget about any kind of perceived conventions and have whatever you think works best for your users and goals.

            Again, look at unbounce, that is a custom landing page with a homepage link (to share the love) but not the general site navigation.

            They also have a footer to do a bit more link love but really, do what works for you.

            Forget conventions - do what works!

            Hope that helps
            Marcus

            1 Reply Last reply Reply Quote 0
            • BobGW
              BobGW @BobGW last edited by

              I see, thanks! I think it's important not to have the ecommerce navigation on the page promoting the pdf. What would you say is ideal as far as the graphical and navigation components of the page with the PDF on it - what kind of navigation and graphical header should I have on it?

              1 Reply Last reply Reply Quote 0
              • Marcus_Miller
                Marcus_Miller @BobGW last edited by

                Yep, check the HTTP headers with webbug or there are a bunch of browser plugins that will let you see the headers for the document.

                That said, I would push to drive the links to the page though rather than the document itself and just create a nice page that houses the document and make that the link target.

                You could even make the PDF link only available by email once they have singed up or some such as canonical is only a directive and you would still be better getting those links flooding into a real page on the site.

                You could even offer up some HTML to make this easier for folks to link to that linked to your main page. If you take a look at any savvy infographics etc folks will try to draw a link into a page rather than the image itself for the very same reasons.

                If you look at something like the Noobs Guide to Online Marketing from Unbounce then you will see something like this as the suggested linking code:

                [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

                [The Noob Guide to Online Marketing - Infographic](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

                [](<strong>http://unbounce.com/noob-guide-to-online-marketing-infographic/</strong>)

                Unbounce – The DIY Landing Page Platform

                So, the image is there but the link they are pimping is a standard page:

                http://unbounce.com/noob-guide-to-online-marketing-infographic/

                They also cheekily add an extra homepage link in as well with some keywords and the brand so if folks don't remove that they still get that benefit.

                Ultimately, it means that when links flood into the site they benefit the whole site rather than just promote one PDF.

                Just my tuppence! 
                Marcus

                1 Reply Last reply Reply Quote 0
                • BobGW
                  BobGW @Marcus_Miller last edited by

                  Thanks for the code Marcus.

                  Actually, the pdf is what people will be linking to. It's a guide for websites. I think the PDF will be much easier to promote than the article.I assume so anyway.

                  Is there a way to make sure my canonical code in htaccess is working after I insert the code?

                  Thanks again,

                  Bob

                  Marcus_Miller BobGW 3 Replies Last reply Reply Quote 0
                  • Marcus_Miller
                    Marcus_Miller last edited by

                    Hey Bob

                    There is a much easier way to do this and simply have your PDFs that you don't want indexed in a folder that you block access to in robots.txt. This way you can just drop PDFs into articles and link to them knowing full well these pages will not be indexed.

                    Assuming you had a PDF called article.pdf in a folder called pdfs/ then the following would prevent indexation.

                    User-agent: * Disallow: /pdfs/

                    Or to just block the file itself:

                    User-agent: *
                    Disallow: /pdfs/yourfile.pdf Additionally, There is no reason not to add the canonical link as well and if you find people are linking directly to the PDF then having this would ensure that the equity associated with those links was correctly attributed to the parent page (always a good thing).

                    Header add Link '<http: www.url.co.uk="" pdfs="" article.html="">; </http:> rel="canonical"'

                    Generally, there are better ways to block indexation than with robots.txt but in the case of PDFs, we really don't want these files indexed as they make for such poor landing pages (no navigation) and we certainly want to remove any competition or duplication between the page and the PDF so in this case, it makes for a quick, painless and suitable solution.

                    Hope that helps!
                    Marcus

                    BobGW 1 Reply Last reply Reply Quote 2
                    • BobGW
                      BobGW @BobGW last edited by

                      Thanks ThompsonPaul,

                      Say the pdf is located at

                      domain.com/pdfs/white-papers.pdf

                      and the article that I want to rank is at

                      domain.com/articles/article.html

                      do I simply add this to my htaccess file?:

                      Header add Link "<http: www.domain.com="" articles="" article.html="">; rel="canonical""</http:>

                      1 Reply Last reply Reply Quote 0
                      • ThompsonPaul
                        ThompsonPaul @BobGW last edited by

                        You can insert the canonical header link using your site's .htaccess file, Bob. I'm sure Hostgator provides access to the htaccess file through ftp (sometimes you have to turn on "show hidden files") or through the file manager built into your cPanel.

                        Check tip #2 in this recent SEOMoz blog article for specifics:
                        seomoz.org/blog/htaccess-file-snippets-for-seos

                        Just remember too - you will want to do the same kind of on-page optimization for the PDF as you do for regular pages.

                        • Give it a good, descriptive, keyword-appropriate, dash-separated file name. (essential for usability as well, since it will become the title of the icon when saved to someone's desktop)
                        • Fill out the metadata for the PDF, especially the Title and Description. In Acrobat it's under File -> Properties -> Description tab (to get the meta-description itself, you'll need to click on the Additional Metadata button)

                        I'd be tempted to build the links to the html page as much as possible as those will directly help ranking, unlike the PDF's inbound links which will have to pass their link juice through the canonical, assuming you're using it. Plus, the visitor will get a preview of the PDF's content and context from the rest of your site which which may increase trust and engender further engagement..

                        Your comment about links in the PDF got kind of muddled, but you'll definitely want to make certain there are good links and calls to action back to your website within the PDF - preferably on each page. Otherwise there's no clear "next step" for users reading the PDF back to a purchase on your site. Make sure to put Analytics tracking tags on these links so you can assess the value of traffic generated back from the PDF - otherwise the traffic will just appear as Direct in your Analytics.

                        Hope that all helps;

                        Paul

                        1 Reply Last reply Reply Quote 2
                        • BobGW
                          BobGW @BobGW last edited by

                          Can I just use htaccess?

                          See here: http://www.seomoz.org/blog/how-to-advanced-relcanonical-http-headers

                          We only have one pdf like this right now and we plan to have no more than five.

                          Say the pdf is located at

                          domain.com/pdfs/white-papers.pdf

                          and the article that I want to rank is at

                          domain.com/articles/article.pdf

                          do I simply add this to my htaccess file?:

                          Header add Link "<http: www.domain.com="" articles="" article.pdf="">; rel="canonical""</http:>

                          1 Reply Last reply Reply Quote 0
                          • BobGW
                            BobGW @BobGW last edited by

                            How do I know if I can do an HTTP header request? I'm using shared hosting through hostgator.

                            1 Reply Last reply Reply Quote 0
                            • DoRM
                              DoRM @BobGW last edited by

                              PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

                              1 Reply Last reply Reply Quote 0
                              • DoRM
                                DoRM @BobGW last edited by

                                PDF seem to not rank as well as other normal webpages.  They still rank do not get me wrong, we have over 100 pdf pages that get traffic for us. The main version is really up to you, what do you want to show in the search results.  I think it would be easier to rank for a normal webpage though.  If you are doing a rel="canonical"  it will pass most of the link juice, not all but most.

                                1 Reply Last reply Reply Quote 1
                                • BobGW
                                  BobGW @DoRM last edited by

                                  Thank you DoRM,

                                  I assume that the PDF is what I want to be the main version since that is what I'll be marketing, but I could be wrong? What if I get backlinks to both pages, will both sets of backlinks count?

                                  DoRM BobGW ThompsonPaul 6 Replies Last reply Reply Quote 0
                                  • DoRM
                                    DoRM last edited by

                                    Indicate the canonical version of a URL by responding with the Link rel="canonical" HTTP header. Addingrel="canonical" to the head section of a page is useful for HTML content, but it can't be used for PDFs and other file types indexed by Google Web Search. In these cases you can indicate a canonical URL by responding with the Link rel="canonical" HTTP header, like this (note that to use this option, you'll need to be able to configure your server):

                                    Link: <http: www.example.com="" downloads="" white-paper.pdf="">; rel="canonical"</http:> 
                                    

                                    Google currently supports these link header elements for Web Search only.

                                    You can read more her http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139394

                                    BobGW 1 Reply Last reply Reply Quote 1
                                    • 1 / 1
                                    • First post
                                      Last post

                                    Browse Questions

                                    Explore more categories

                                    • Moz Tools

                                      Chat with the community about the Moz tools.

                                    • SEO Tactics

                                      Discuss the SEO process with fellow marketers

                                    • Community

                                      Discuss industry events, jobs, and news!

                                    • Digital Marketing

                                      Chat about tactics outside of SEO

                                    • Research & Trends

                                      Dive into research and trends in the search industry.

                                    • Support

                                      Connect on product support and feature requests.

                                    • See all categories

                                    Related Questions

                                    • EdenPrez

                                      Will I be flagged for duplicate content by Google?

                                      Hi Moz community, Had a question regarding duplicate content that I can't seem to find the answer to on Google. My agency is working on a large number of franchisee websites (over 40) for one client, a print franchise, that wants a refresh of new copy and SEO. Each print shop has their own 'microsite', though all services and products are the same, the only difference being the location. Each microsite has its own unique domain. To avoid writing the same content over and over in 40+ variations, would all the websites be flagged by Google for duplicate content if we were to use the same base copy, with the only changes being to the store locations (i.e. where we mention Toronto print shop on one site may change to Kelowna print shop on another)? Since the print franchise owns all the domains, I'm wondering if that would be a problem since the sites aren't really competing with one another. Any input would be greatly appreciated. Thanks again!

                                      Intermediate & Advanced SEO | | EdenPrez
                                      0
                                    • ycnetpro101

                                      Duplicate content in Shopify - subsequent pages in collections

                                      Hello everyone! I hope an expert in this community can help me verify the canonical codes I'll add to our store is correct. Currently, in our Shopify store, the subsequent pages in the collections are not indexed by Google, however the canonical URL on these pages aren't pointing to the main collection page (page 1), e.g. The canonical URL of page 2, page 3 etc are used as canonical URLs instead of the first page of the collections. I have the canonical codes attached below, it would be much appreciated if an expert can urgently verify these codes are good to use and will solve the above issues? Thanks so much for your kind help in advance!! -----------------CODES BELOW--------------- <title><br /> {{ page_title }}{% if current_tags %} – tagged "{{ current_tags | join: ', ' }}"{% endif %}{% if current_page != 1 %} – Page {{ current_page }}{% endif %}{% unless page_title contains shop.name %} – {{ shop.name }}{% endunless %}<br /></title>
                                      {% if page_description %} {% endif %} {% if current_page != 1 %} {% else %} {% endif %}
                                      {% if template == 'collection' %}{% if collection %}
                                      {% if current_page == 1 %} {% endif %}
                                      {% if template == 'product' %}{% if product %} {% endif %}
                                      {% if template == 'collection' %}{% if collection %} {% endif %}

                                      Intermediate & Advanced SEO | | ycnetpro101
                                      0
                                    • GhillC

                                      Same site serving multiple countries and duplicated content

                                      Hello! Though I browse MoZ resources every day, I've decided to directly ask you a question despite the numerous questions (and answers!) about this topic as there are few specific variants each time: I've a site serving content (and products) to different countries built using subfolders (1 subfolder per country). Basically, it looks like this:
                                      site.com/us/
                                      site.com/gb/
                                      site.com/fr/
                                      site.com/it/
                                      etc. The first problem was fairly easy to solve:
                                      Avoid duplicated content issues across the board considering that both the ecommerce part of the site and the blog bit are being replicated for each subfolders in their own language. Correct me if I'm wrong but using our copywriters to translate the content and adding the right hreflang tags should do. But then comes the second problem: how to deal with duplicated content when it's written in the same language? E.g. /us/, /gb/, /au/ and so on.
                                      Given the following requirements/constraints, I can't see any positive resolution to this issue:
                                      1. Need for such structure to be maintained (it's not possible to consolidate same language within one single subfolders for example),
                                      2. Articles from one subfolder to another can't be canonicalized as it would mess up with our internal tracking tools,
                                      3. The amount of content being published prevents us to get bespoke content for each region of the world with the same spoken language. Given those constraints, I can't see a way to solve that out and it seems that I'm cursed to live with those duplicated content red flags right up my nose.
                                      Am I right or can you think about anything to sort that out? Many thanks,
                                      Ghill

                                      Intermediate & Advanced SEO | | GhillC
                                      0
                                    • L_M_SEO

                                      Deep linking with redirects & building SEO

                                      Hi there. I'm using deep linking with unique URL's that redirect to our website homepage or app (depending on whether the user accesses the link from an iphone or computer) as a way to track attribution and purchases. I'm wondering whether using links that redirect negatively affects our SEO? Is the homepage still building SEO rank despite the redirects? I appreciate your time & thanks for your help.

                                      Intermediate & Advanced SEO | | L_M_SEO
                                      0
                                    • nchlondon

                                      Directory with Duplicate content? what to do?

                                      Moz keeps finding loads of pages with duplicate content on my website. The problem is its a directory page to different locations. E.g if we were a clothes shop we would be listing our locations: www.sitename.com/locations/london www.sitename.com/locations/rome www.sitename.com/locations/germany The content on these pages is all the same, except for an embedded google map that shows the location of the place. The problem is that google thinks all these pages are duplicated content. Should i set a canonical link on every single page saying that www.sitename.com/locations/london is the main page? I don't know if i can use canonical links because the page content isn't identical because of the embedded map. Help would be appreciated. Thanks.

                                      Intermediate & Advanced SEO | | nchlondon
                                      0
                                    • 360eight-SEO

                                      News sites & Duplicate content

                                      Hi SEOMoz I would like to know, in your opinion and according to 'industry' best practice, how do you get around duplicate content on a news site if all news sites buy their "news" from a central place in the world? Let me give you some more insight to what I am talking about. My client has a website that is purely focuses on news. Local news in one of the African Countries to be specific. Now, what we noticed the past few months is that the site is not ranking to it's full potential. We investigated, checked our keyword research, our site structure, interlinking, site speed, code to html ratio you name it we checked it. What we did pic up when looking at duplicate content is that the site is flagged by Google as duplicated, BUT so is most of the news sites because they all get their content from the same place. News get sold by big companies in the US (no I'm not from the US so cant say specifically where it is from) and they usually have disclaimers with these content pieces that you can't change the headline and story significantly, so we do have quite a few journalists that rewrites the news stories, they try and keep it as close to the original as possible but they still change it to fit our targeted audience - where my second point comes in. Even though the content has been duplicated, our site is more relevant to what our users are searching for than the bigger news related websites in the world because we do hyper local everything. news, jobs, property etc. All we need to do is get off this duplicate content issue, in general we rewrite the content completely to be unique if a site has duplication problems, but on a media site, im a little bit lost. Because I haven't had something like this before. Would like to hear some thoughts on this. Thanks,
                                      Chris Captivate

                                      Intermediate & Advanced SEO | | 360eight-SEO
                                      0
                                    • JustinTaylor88

                                      Duplicate internal links on page, any benefit to nofollow

                                      Link spam is naturally a hot topic amongst SEO's, particularly post Penguin. While digging around forums etc, I watched a video blog from Matt Cutts posted a while ago that suggests that Google only pays attention to the first instance of a link on the page As most websites will have multiple instances of a links (header, footer and body text), is it beneficial to nofollow the additional instances of the link? Also as the first instance of a link will in most cases be within the header nav, does that then make the content link text critical or can good on page optimisation be pulled from the title attribute? I would appreciate the experiences and thoughts Mozzers thoughts on this thanks in advance!

                                      Intermediate & Advanced SEO | | JustinTaylor88
                                      0
                                    • Indexxess

                                      Link Building Ideas for a health site

                                      Hi, I am trying to rank a health related website. This is the url: www.ridpiles.com Domain age is 1 year 6 months. Done Directory submissions Blog Comments + Forum posts Done Social Bookmarks Article submissions (Not much) I have done competitor analysis. All of my competitors are just had links from directories and some link exchanges. They got links from quality sites like Yahoo dir. I know my site is far better than my competitors and has 100% unique content. I have submitted to yahoo directory inclusion, but still no luck i hadn't accepted into it. I am planning to go for a sponsered review but dont know, weather the link will be valuable for that much of money. I was left with Guest Blogging. I see this is the only option for me to build links. But i have a very tough competiton, i must compete with most reputed sites like webmd.com etc, i need to get more good links. But i cant get what other ways to get authoritative links. If Guest blogging is the only option for me, how many posts do i need to do daily? And can someone suggest me good Guest blogging sites? Anyhelp would be appreciated.

                                      Intermediate & Advanced SEO | | Indexxess
                                      0

                                    Get started with Moz Pro!

                                    Unlock the power of advanced SEO tools and data-driven insights.

                                    Start my free trial
                                    Products
                                    • Moz Pro
                                    • Moz Local
                                    • Moz API
                                    • Moz Data
                                    • STAT
                                    • Product Updates
                                    Moz Solutions
                                    • SMB Solutions
                                    • Agency Solutions
                                    • Enterprise Solutions
                                    • Digital Marketers
                                    Free SEO Tools
                                    • Domain Authority Checker
                                    • Link Explorer
                                    • Keyword Explorer
                                    • Competitive Research
                                    • Brand Authority Checker
                                    • Local Citation Checker
                                    • MozBar Extension
                                    • MozCast
                                    Resources
                                    • Blog
                                    • SEO Learning Center
                                    • Help Hub
                                    • Beginner's Guide to SEO
                                    • How-to Guides
                                    • Moz Academy
                                    • API Docs
                                    About Moz
                                    • About
                                    • Team
                                    • Careers
                                    • Contact
                                    Why Moz
                                    • Case Studies
                                    • Testimonials
                                    Get Involved
                                    • Become an Affiliate
                                    • MozCon
                                    • Webinars
                                    • Practical Marketer Series
                                    • MozPod
                                    Connect with us

                                    Contact the Help team

                                    Join our newsletter
                                    Moz logo
                                    © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
                                    • Accessibility
                                    • Terms of Use
                                    • Privacy

                                    Looks like your connection to Moz was lost, please wait while we try to reconnect.