Substantial difference between Number of Indexed Pages and Sitemap Pages

Online-Marketing-Guy

Hey there,

I am doing a website audit at the moment.

I've notices substantial differences in the number of pages indexed (search console), the number of pages in the sitemap and the number I am getting when I crawl the page with screamingfrog (see below). Would those discrepancies concern you? The website and its rankings seems fine otherwise.

Total indexed: 2,360 (Search Consule)
About 2,920 results (Google search "site:example.com")
Sitemap: 1,229 URLs
Screemingfrog Spider: 1,352 URLs

Cheers,
Jochen

anthonydnelson

Those discrepancies would not concern me, but there are some differences between all the things you list:

Total indexed: 2,360 Search Console - this is likely a reasonably accurate list of the number of pages you have indexed in Google. You could use a tool like URL Profiler to check index status of specific URLs.

About 2,920 results Google search "site:example.com" - site: search is less accurate and will likely return a different number each time you do it, even if it's just moments apart.

Sitemap: 1,229 URLs: these are URLs you added to a sitemap because they are priority pages you want to make sure Google has indexed and hopefully ranked. You control this number.

Screaming Frog Spider: 1,352 URLs - Screaming Frog is going to start on your homepage and crawl the site attempting to discover as many URLs as possible. If you are not linking to a page, SF won't be able to crawl it. Google on the other hand may have old pages, old URL structures or pages that were linked from an external website in their index and they won't forget them.

A really important question is: how many pages do you have that you want to be indexed? Is Google's index bloated with pages that you want to keep out? Figure these things out, and then try to adjust your sitemaps, noindex, robots.txt as needed.

Online-Marketing-Guy

Thanks for your reply Dmitrii,

we have excluded all query parameters in search console so this shouldn't be an issue. What is also strange is that when I try to scrape the SERPS via a site:example.com search Google is only showing a fraction (about 700) of the 2,920 results.

Cheers,

Jochen

Dmitrii Kustov

★
★
☆
☆
☆

MozPoints: 810
Good Answers: 47
Endorsed Answers: 20">

★
★
☆
☆
☆

Dmitrii

DmitriiK

Hi there.

I think that as long as rankings are good (especially historically), there is no reason to worry, because google includes in index pages, which wouldn't be in sitemap - for example pages, generated with query parameters (domain.com?x=value). Sometimes these pages do not really exist by themselves (like filters in online stores), they only exist "on the fly".

Hope this makes sense and helps

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Substantial difference between Number of Indexed Pages and Sitemap Pages

Browse Questions

Explore more categories

Related Questions

Page with metatag noindex is STILL being indexed?!

Google Indexing Of Pages As HTTPS vs HTTP

Google News Sitemap in Different Languages

Pages with excessive number of links

Our login pages are being indexed by Google - How do you remove them?

Best practice for removing indexed internal search pages from Google?

Is it bad to host an XML sitemap in a different subdomain?

NOINDEX listing pages: Page 2, Page 3... etc?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved