Skip to content
    Moz logo Menu open Menu close
    • Products
      • Moz Pro
      • Moz Pro Home
      • Moz Local
      • Moz Local Home
      • STAT
      • Moz API
      • Moz API Home
      • Compare SEO Products
      • Moz Data
    • Free SEO Tools
      • Domain Analysis
      • Keyword Explorer
      • Link Explorer
      • Competitive Research
      • MozBar
      • More Free SEO Tools
    • Learn SEO
      • Beginner's Guide to SEO
      • SEO Learning Center
      • Moz Academy
      • MozCon
      • Webinars, Whitepapers, & Guides
    • Blog
    • Why Moz
      • Digital Marketers
      • Agency Solutions
      • Enterprise Solutions
      • Small Business Solutions
      • The Moz Story
      • New Releases
    • Log in
    • Log out
    • Products
      • Moz Pro

        Your all-in-one suite of SEO essentials.

      • Moz Local

        Raise your local SEO visibility with complete local SEO management.

      • STAT

        SERP tracking and analytics for enterprise SEO experts.

      • Moz API

        Power your SEO with our index of over 44 trillion links.

      • Compare SEO Products

        See which Moz SEO solution best meets your business needs.

      • Moz Data

        Power your SEO strategy & AI models with custom data solutions.

      Track AI Overviews in Keyword Research
      Moz Pro

      Track AI Overviews in Keyword Research

      Try it free!
    • Free SEO Tools
      • Domain Analysis

        Get top competitive SEO metrics like DA, top pages and more.

      • Keyword Explorer

        Find traffic-driving keywords with our 1.25 billion+ keyword index.

      • Link Explorer

        Explore over 40 trillion links for powerful backlink data.

      • Competitive Research

        Uncover valuable insights on your organic search competitors.

      • MozBar

        See top SEO metrics for free as you browse the web.

      • More Free SEO Tools

        Explore all the free SEO tools Moz has to offer.

      NEW Keyword Suggestions by Topic
      Moz Pro

      NEW Keyword Suggestions by Topic

      Learn more
    • Learn SEO
      • Beginner's Guide to SEO

        The #1 most popular introduction to SEO, trusted by millions.

      • SEO Learning Center

        Broaden your knowledge with SEO resources for all skill levels.

      • On-Demand Webinars

        Learn modern SEO best practices from industry experts.

      • How-To Guides

        Step-by-step guides to search success from the authority on SEO.

      • Moz Academy

        Upskill and get certified with on-demand courses & certifications.

      • MozCon

        Save on Early Bird tickets and join us in London or New York City

      Access 20 years of data with flexible pricing
      Moz API

      Access 20 years of data with flexible pricing

      Find your plan
    • Blog
    • Why Moz
      • Digital Marketers

        Simplify SEO tasks to save time and grow your traffic.

      • Small Business Solutions

        Uncover insights to make smarter marketing decisions in less time.

      • Agency Solutions

        Earn & keep valuable clients with unparalleled data & insights.

      • Enterprise Solutions

        Gain a competitive edge in the ever-changing world of search.

      • The Moz Story

        Moz was the first & remains the most trusted SEO company.

      • New Releases

        Get the scoop on the latest and greatest from Moz.

      Surface actionable competitive intel
      New Feature

      Surface actionable competitive intel

      Learn More
    • Log in
      • Moz Pro
      • Moz Local
      • Moz Local Dashboard
      • Moz API
      • Moz API Dashboard
      • Moz Academy
    • Avatar
      • Moz Home
      • Notifications
      • Account & Billing
      • Manage Users
      • Community Profile
      • My Q&A
      • My Videos
      • Log Out

    The Moz Q&A Forum

    • Forum
    • Questions
    • Users
    • Ask the Community

    Welcome to the Q&A Forum

    Browse the forum for helpful insights and fresh discussions about all things SEO.

    1. Home
    2. SEO Tactics
    3. Intermediate & Advanced SEO
    4. Why do people put xml sitemaps in subfolders? Why not just the root? What's the best solution?

    Moz Q&A is closed.

    After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

    Why do people put xml sitemaps in subfolders? Why not just the root? What's the best solution?

    Intermediate & Advanced SEO
    2
    3
    4540
    Loading More Posts
    • Oldest to Newest
    • Newest to Oldest
    • Most Votes
    Reply
    • Reply as question
    Log in to reply
    This topic has been deleted. Only users with question management privileges can see it.
    • McTaggart
      McTaggart last edited by

      Just read this: "The location of a Sitemap file determines the set of URLs that can be included in that Sitemap. A Sitemap file located at http://example.com/catalog/sitemap.xml can include any URLs starting with http://example.com/catalog/ but can not include URLs starting with http://example.com/images/." here: http://www.sitemaps.org/protocol.html#location

      Yet surely it's better to put the sitemaps at the root so you have:
      (a) http://example.com/sitemap.xml 
      http://example.com/sitemap-chocolatecakes.xml
      http://example.com/sitemap-spongecakes.xml 
      and so on...

      OR this kind of approach - 
      (b) http://example/com/sitemap.xml
      http://example.com/sitemap/chocolatecakes.xml and 
      http://example.com/sitemap/spongecakes.xml

      I would tend towards (a) rather than (b) - which is the best option?

      Also, can I keep the structure the same for sitemaps that are subcategories of other sitemaps - for example - for a subcategory of http://example.com/sitemap-chocolatecakes.xml I might create http://example.com/sitemap-chocolatecakes-cherryicing.xml - or should I add a sub folder to turn it into http://example.com/sitemap-chocolatecakes/cherryicing.xml

      Look forward to reading your comments - Luke

      1 Reply Last reply Reply Quote 0
      • McTaggart
        McTaggart last edited by

        Thanks Angular Marketing, and Everett... very helpful feedback and much appreciated. Luke
        
        1 Reply Last reply Reply Quote 1
        • HiveDigitalInc
          HiveDigitalInc last edited by

          Consider this:  "The location of a Sitemap file determines the set of URLs that can be included in that Sitemap. A Sitemap file located at http://example.com/catalog/sitemap.xml can include any URLs starting with http://example.com/catalog/ but can not include URLs starting with http://example.com/images/." here: http://www.sitemaps.org/protocol.html#location

          B would not be an acceptable approach as http://example.com/sitemap/chocolatecakes.xml could only contain a sitemap of content located in http://example.com/sitemap.   For this same reason, you couldn't create sitemaps in subfolder directories...

          This is the best approach from those options you mentioned...

          (a) http://example.com/sitemap.xml 
          http://example.com/sitemap-chocolatecakes.xml 
          http://example.com/sitemap-spongecakes.xml
          http://example.com/sitemap-chocolatecakes-cherryicing.xml

          It is worth noting that you can have a sitemap of sitemaps.. so for example.

          http://example.com/sitemap.xml could contain links to http://example.com/sitemap-cakes,  http://example.com/sitemap-articles, etc..
          http://example.com/sitemap-cakes.xml could contain links to http://example.com/sitemap-chocolatecakes.xml, http://example.com/sitemap-vanilla-cakes.xml, etc..

          Try not to over-complicate things by trying to create sub-category sitemaps, etc.. Unless you have an exorbitant amount of sub-category pages, or have directories/sections managed by different cms, etc.

          You generally see large sites will have a separate sitemap based on content type (company pages, category pages, product pages, blog pages)

          1 Reply Last reply Reply Quote 3
          • 1 / 1
          • First post
            Last post

          Got a burning SEO question?

          Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.


          Start my free trial


          Browse Questions

          Explore more categories

          • Moz Tools

            Chat with the community about the Moz tools.

          • SEO Tactics

            Discuss the SEO process with fellow marketers

          • Community

            Discuss industry events, jobs, and news!

          • Digital Marketing

            Chat about tactics outside of SEO

          • Research & Trends

            Dive into research and trends in the search industry.

          • Support

            Connect on product support and feature requests.

          • See all categories

          Related Questions

          • fablau

            What's the best way to noindex pages but still keep backlinks equity?

            Hello everyone, Maybe it is a stupid question, but I ask to the experts... What's the best way to noindex pages but still keep backlinks equity from those noindexed pages? For example, let's say I have many pages that look similar to a "main" page which I solely want to appear on Google, so I want to noindex all pages with the exception of that "main" page... but, what if I also want to transfer any possible link equity present on the noindexed pages to the main page? The only solution I have thought is to add a canonical tag pointing to the main page on those noindexed pages... but will that work or cause wreak havoc in some way?

            Intermediate & Advanced SEO | | fablau
            3
          • kchandler

            What Happens If a Hreflang Sitemap Doesn't Include Every Language for Missing Translated Pages?

            As we are building a hreflang sitemap for a client, we are correctly implementing the tag across 5 different languages including English. However, the News and Events section was never translated into any of the other four languages. There are also a few pages that were translated into some but not all of the 4 languages. Is it good practice to still list out the individual non-translated pages like on a regular sitemap without a hreflang tag? Should the hreflang sitemap include the hreflang tag with pages that are missing a few language translations (when one or two language translations may be missing)? We are uncertain if this inconsistency would create a problem and we would like some feedback before pushing the hreflang sitemap live.

            Intermediate & Advanced SEO | | kchandler
            0
          • esiow2013

            May know what's the meaning of these parameters in .htaccess?

            Begin HackRepair.com Blacklist RewriteEngine on Abuse Agent Blocking RewriteCond %{HTTP_USER_AGENT} ^BlackWidow [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Bolt\ 0 [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Bot\ mailto:[email protected] [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} CazoodleBot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^ChinaClaw [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Custo [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Default\ Browser\ 0 [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^DIIbot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^DISCo [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} discobot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Download\ Demon [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^eCatch [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ecxi [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^EirGrabber [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^EmailCollector [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^EmailSiphon [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^EmailWolf [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Express\ WebPictures [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^ExtractorPro [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^EyeNetIE [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^FlashGet [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^GetRight [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^GetWeb! [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Go!Zilla [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Go-Ahead-Got-It [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^GrabNet [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Grafula [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} GT::WWW [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} heritrix [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^HMView [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} HTTP::Lite [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} HTTrack [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ia_archiver [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} IDBot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} id-search [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} id-search.org [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Image\ Stripper [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Image\ Sucker [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Indy\ Library [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^InterGET [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Internet\ Ninja [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^InternetSeer.com [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} IRLbot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ISC\ Systems\ iRc\ Search\ 2.1 [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Java [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^JetCar [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^JOC\ Web\ Spider [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^larbin [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^LeechFTP [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} libwww [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} libwww-perl [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Link [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} LinksManager.com_bot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} linkwalker [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} lwp-trivial [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Mass\ Downloader [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Maxthon$ [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} MFC_Tear_Sample [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^microsoft.url [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Microsoft\ URL\ Control [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^MIDown\ tool [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Mister\ PiX [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Missigua\ Locator [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Mozilla.*Indy [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Mozilla.NEWT [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^MSFrontPage [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Navroad [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^NearSite [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^NetAnts [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^NetSpider [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Net\ Vampire [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^NetZIP [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Nutch [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Octopus [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Offline\ Explorer [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Offline\ Navigator [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^PageGrabber [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} panscient.com [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Papa\ Foto [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^pavuk [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} PECL::HTTP [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^PeoplePal [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^pcBrowser [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} PHPCrawl [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} PleaseCrawl [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^psbot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^RealDownload [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^ReGet [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Rippers\ 0 [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} SBIder [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^SeaMonkey$ [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^sitecheck.internetseer.com [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^SiteSnagger [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^SmartDownload [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Snoopy [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Steeler [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^SuperBot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^SuperHTTP [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Surfbot [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^tAkeOut [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Teleport\ Pro [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Toata\ dragostea\ mea\ pentru\ diavola [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} URI::Fetch [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} urllib [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} User-Agent [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^VoidEYE [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Web\ Image\ Collector [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Web\ Sucker [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Web\ Sucker [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} webalta [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebAuto [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^[Ww]eb[Bb]andit [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} WebCollage [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebCopier [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebFetch [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebGo\ IS [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebLeacher [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebReaper [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebSauger [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Website\ eXtractor [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Website\ Quester [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebStripper [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebWhacker [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WebZIP [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} Wells\ Search\ II [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} WEP\ Search [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Wget [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Widow [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WWW-Mechanize [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^WWWOFFLE [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Xaldon\ WebSpider [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} zermelo [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^Zeus [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ^(.)Zeus.Webster [NC,OR]
            RewriteCond %{HTTP_USER_AGENT} ZyBorg [NC]
            RewriteRule ^. - [F,L] Abuse bot blocking rule end End HackRepair.com Blacklist

            Intermediate & Advanced SEO | | esiow2013
            1
          • edlondon

            Google Not Indexing XML Sitemap Images

            Hi Mozzers, We are having an issue with our XML sitemap images not being indexed. The site has over 39,000 pages and 17,500 images submitted in GWT.  If you take a look at the attached screenshot, 'GWT Images - Not Indexed', you can see that the majority of the pages are being indexed - but none of the images are. The first thing you should know about the images is that they are hosted on a content delivery network (CDN), rather than on the site itself. However, Google advice suggests hosting on a CDN is fine - see second screenshot, 'Google CDN Advice'.  That advice says to either (i) ensure the hosting site is verified in GWT or (ii) submit in robots.txt.  As we can't verify the hosting site in GWT, we had opted to submit via robots.txt. There are 3 sitemap indexes: 1) http://www.greenplantswap.co.uk/sitemap_index.xml, 2) http://www.greenplantswap.co.uk/sitemap/plant_genera/listings.xml and 3) http://www.greenplantswap.co.uk/sitemap/plant_genera/plants.xml. Each sitemap index is split up into often hundreds or thousands of smaller XML sitemaps. This is necessary due to the size of the site and how we have decided to pull URLs in.  Essentially, if we did it another way, it may have involved some of the sitemaps being massive and thus taking upwards of a minute to load. To give you an idea of what is being submitted to Google in one of the sitemaps, please see view-source:http://www.greenplantswap.co.uk/sitemap/plant_genera/4/listings.xml?page=1. Originally, the images were SSL, so we decided to reverted to non-SSL URLs as that was an easy change.  But over a week later, that seems to have had no impact.  The image URLs are ugly... but should this prevent them from being indexed? The strange thing is that a very small number of images have been indexed - see http://goo.gl/P8GMn. I don't know if this is an anomaly or whether it suggests no issue with how the images have been set up - thus, there may be another issue. Sorry for the long message but I would be extremely grateful for any insight into this.  I have tried to offer as much information as I can, however please do let me know if this is not enough. Thank you for taking the time to read and help. Regards, Mark Oz6HzKO rYD3ICZ

            Intermediate & Advanced SEO | | edlondon
            0
          • danng

            XML Sitemap Index Percentage (Large Sites)

            Hi all I'm wanting to find out from those who have experience dealing with large sites (10s/100s of millions of pages). What's a typical (or highest) percentage of indexed pages vs. submitted pages you've seen? This information can be found in webmaster tools where Google shows you the pages submitted & indexed for each of your sitemap. I'm trying to figure out whether, The average index % out there There is a ceiling (i.e. will never reach 100%) It's possible to improve the indexing percentage further Just to give you some background, sitemap index files (according to schema.org) have been implemented to improve crawl efficiency and I'm wanting to find out other ways to improve this further. I've been thinking about looking at the URL parameters to exclude as there are hundreds (e-commerce site) to help Google improve crawl efficiency and utilise the daily crawl quote more effectively to discover pages that have not been discovered yet. However, I'm not sure yet whether this is the best path to take or I'm just flogging a dead horse if there is such a ceiling or if I'm already at the average ballpark for large sites. Any suggestions/insights would be appreciated. Thanks.

            Intermediate & Advanced SEO | | danng
            0
          • nicole.healthline

            Soft 404's from pages blocked by robots.txt -- cause for concern?

            We're seeing soft 404 errors appear in our google webmaster tools section on pages that are blocked by robots.txt (our search result pages). Should we be concerned? Is there anything we can do about this?

            Intermediate & Advanced SEO | | nicole.healthline
            4
          • RichBestSEO

            Is there any negative SEO effect of having comma's in URL's?

            Hello, I have a client who has a large ecommerce website. Some category names have been created with comma's in - which has meant that their software has automatically generated URL's with comma's in for every page that comes beneath the category in the site hierarchy. eg. 1 : http://shop.deliaonline.com/store/music,-dvd-and-games/dvds-and-blu_rays/ eg. 2 : http://shop.deliaonline.com/store/music,-dvd-and-games/dvds-and-blu_rays/action-and-adventure/ etc... I know that URL's with comma's in look a bit ugly! But is there 'any' SEO reason why URL's with comma's in are any less effective? Kind Regs, RB

            Intermediate & Advanced SEO | | RichBestSEO
            0
          • nicole.healthline

            Tool to calculate the number of pages in Google's index?

            When working with a very large site, are there any tools that will help you calculate the number of links in the Google index? I know you can use site:www.domain.com to see all the links indexed for a particular url. But what if you want to see the number of pages indexed for 100 different subdirectories (i.e. www.domain.com/a, www.domain.com/b)? is there a tool to help automate the process of finding the number of pages from each subdirectory in Google's index?

            Intermediate & Advanced SEO | | nicole.healthline
            0

          Get started with Moz Pro!

          Unlock the power of advanced SEO tools and data-driven insights.

          Start my free trial
          Products
          • Moz Pro
          • Moz Local
          • Moz API
          • Moz Data
          • STAT
          • Product Updates
          Moz Solutions
          • SMB Solutions
          • Agency Solutions
          • Enterprise Solutions
          • Digital Marketers
          Free SEO Tools
          • Domain Authority Checker
          • Link Explorer
          • Keyword Explorer
          • Competitive Research
          • Brand Authority Checker
          • Local Citation Checker
          • MozBar Extension
          • MozCast
          Resources
          • Blog
          • SEO Learning Center
          • Help Hub
          • Beginner's Guide to SEO
          • How-to Guides
          • Moz Academy
          • API Docs
          About Moz
          • About
          • Team
          • Careers
          • Contact
          Why Moz
          • Case Studies
          • Testimonials
          Get Involved
          • Become an Affiliate
          • MozCon
          • Webinars
          • Practical Marketer Series
          • MozPod
          Connect with us

          Contact the Help team

          Join our newsletter
          Moz logo
          © 2021 - 2025 SEOMoz, Inc., a Ziff Davis company. All rights reserved. Moz is a registered trademark of SEOMoz, Inc.
          • Accessibility
          • Terms of Use
          • Privacy

          Looks like your connection to Moz was lost, please wait while we try to reconnect.