Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Google Search console says 'sitemap is blocked by robots?
- 
					
					
					
					
 Google Search console is telling me "Sitemap contains URLs which are blocked by robots.txt." I don't understand why my sitemap is being blocked? My robots.txt look like this: User-Agent: * 
 Disallow:It's a WordPress site, with Yoast SEO installed. Is anyone else having this issue with Google Search console? Does anyone know how I can fix this issue? 
- 
					
					
					
					
 Nice happy to hear that do you work with Greg Reindel? He is a good friend I looked at your IP that is why I ask? Tom 
- 
					
					
					
					
 I agree with David Hey is your dev Greg Reindel? If so you can call me for help PM me here for my info. Thomas Zickell 
- 
					
					
					
					
 Hey guys, I ended up disabling the sitemap option from YoastSEO, then installed the 'Google (XML) sitemap' plug-in. I re-submitted the sitemap to Google last night, and it came back with no issues. I'm glad to finally have this sorted out. Thanks for all the help! 
- 
					
					
					
					
 Hi Christian, The current robots.txt shouldn't be blocking those URLs. Did you or someone else recently change the robots.txt file? If so, give Google a few days to re-crawl your site. Also, can you check what happens when you do a fetch and render on one of the blocked posts in Search Console? Do you have issues there? Cheers, David 
- 
					
					
					
					
 I think you need to make an https robots.txt file if you are running https if running https https://a-moz.groupbuyseo.org/blog/xml-sitemaps `User-agent: * Disallow: /wp-admin/ Allow: /wp-admin/admin-ajax.php` Sitemap: https://domain.com/index-sitemap.xml(that is a https site map)can you send the sitemap URL or run it though deepcrawl Hope this helps? Did you make a new robots.txt file? 
- 
					
					
					
					
 Thanks for the response. Do you think this is a robots.txt issue? Or could this be caused by the YoastSEO plugin? Do you know if this plug-in works with YoastSEO together? Or will it cause issues? 
- 
					
					
					
					
 Thank you for the response. I just scanned the site using 'Screaming frog'. Under Internal>Directives there were zero 'no index' links. I also check for '404 errors', server 505 errors, or anything 'blocked by robots.txt'. Google search console is still showing me that there are URL's being blocked by my sitemap. (I added a screenshot of this). When I click through, it tells me that the 'post sitemap' has over +300 warnings. I have just deleted the YoastSEO plugin, and I am now re-installing it. hopefully, this fixes the issue. 
- 
					
					
					
					
 No, you do not need to change or plug-in what is happening is Webmaster tools is telling you that you have no index or no follow were robots xTag somewhere on your URLs inside your sitemap. Run your site through Moz, screaming frog Seo spider or deepcrawl and look for no indexed URLs. webmaster tools/search console is telling you that you have no index URLs inside of your XML sitemap not that you robots.txt is blocking it. This would be set in the Yoast plugin. one way to correct it is to look for noindex URLs & filter them inside Yoast so they are not being presented to the crawlers. If you would like you can turn off the sitemap on Yoast and turn it back on if that does not work I recommend completely removing the plug-in and reinstalling it - https://kb.yoast.com/kb/how-can-i-uninstall-my-plugin/
- https://kinsta.com/blog/uninstall-wordpress-plugin/
 Can you send a screenshot of what you're seeing? When you see it in Google Webmaster tools are you talking about the XML sitemap itself mean no indexed because all XML sitemaps are no indexed. Please add this to your robots.txt `User-agent:* Disallow:/wp-admin/ Allow:/wp-admin/admin-ajax.php` Sitemap: http://www.website.com/sitemap_index.xmlI hope this is of help, Tom 
- 
					
					
					
					
 Hi, Use this plugin https://wordpress.org/plugins/wp-robots-txt/ it will remove previous robots.txt and set simple wordpress robots.txt and wait for a day problem can be solved. Also watch this video on the same @ https://www.youtube.com/watch?v=DZiyN07bbBM Thanks 
Browse Questions
Explore more categories
- 
		
		Moz ToolsChat with the community about the Moz tools. 
- 
		
		SEO TacticsDiscuss the SEO process with fellow marketers 
- 
		
		CommunityDiscuss industry events, jobs, and news! 
- 
		
		Digital MarketingChat about tactics outside of SEO 
- 
		
		Research & TrendsDive into research and trends in the search industry. 
- 
		
		SupportConnect on product support and feature requests. 
Related Questions
- 
		
		
		
		
		
		Google image search filter tabs and how to rank on them
 I have noticed Google image search has included suggestion tabs (e.g,. design, nature... when searching background) on the top of the image search. Technical SEO | | Mike555
 Are there specific meta tags I can add into my images so that my images will show up on each tab?
 Do those filters just show content based on image keywords or something else? IRme7gQ0
- 
		
		
		
		
		
		Robots.txt & meta noindex--site still shows up on Google Search
 I have set up my robots.txt like this: User-agent: * Technical SEO | | RoxBrock
 Disallow: / and I have this meta tag in my on a Wordpress site, set up with SEO Yoast name="robots" content="noindex,follow"/> I did "Fetch as Google" on my Google Search Console My website is still showing up in the search results and it says this: "A description for this result is not available because of this site's robots.txt" This site has not shown up for years and now it is ranking above my site that I want to rank for this keyword. How do I get Google to ignore this site? This seems really weird and I'm confused how a site with little content, that has not been updated for years can rank higher than a site that is constantly updated and improved.1
- 
		
		
		
		
		
		Is sitemap required on my robots.txt?
 Hi, I know that linking your sitemap from your robots.txt file is a good practice. Ok, but... may I just send my sitemap to search console and forget about adding ti to my robots.txt? That's my situation: 1 multilang platform which means... ... 2 set of pages. One for each lang, of course But my CMS (magento) only allows me to have 1 robots.txt file So, again: may I have a robots.txt file woth no sitemap AND not suffering any potential SEO loss? Thanks in advance, Juan Vicente Mañanas Abad Technical SEO | | Webicultors0
- 
		
		
		
		
		
		Good alternatives to Xenu's Link Sleuth and AuditMyPc.com Sitemap Generator
 I am working on scraping title tags from websites with 1-5 million pages. Xenu's Link Sleuth seems to be the best option for this, at this point. Sitemap Generator from AuditMyPc.com seems to be working too, but it starts handing up, when a sitemap file, the tools is working on,becomes too large. So basically, the second one looks like it wont be good for websites of this size. I know that Scrapebox can scrape title tags from list of url, but this is not needed, since this comes with both of the above mentioned tools. I know about DeepCrawl.com also, but this one is paid, and it would be very expensive with this amount of pages and websites too (5 million ulrs is $1750 per month, I could get a better deal on multiple websites, but this obvioulsy does not make sense to me, it needs to be free, more or less). Seo Spider from Screaming Frog is not good for large websites. So, in general, what is the best way to work on something like this, also time efficient. Are there any other options for this? Thanks. Technical SEO | | blrs120
- 
		
		
		
		
		
		Why isn't my homepage number #1 when searching my brand name?
 Hi! So we recently (a month ago) lunched a new website, we have great content that updates everyday, we're active on social platforms, and we did all that's possible, at the moment, when it comes to on site optimization (a web developer will join our team this month and help us fix all the rest). When I search for our brand name all our social profiles come up first, after them we have a few inner pages from our different news sections, but our homepage is somewhere in the 2nd search page... What may be the reason for that? Is it just a matter of time or is there a problem with our homepage I'm unable to find? Thanks! Technical SEO | | Orly-PP0
- 
		
		
		
		
		
		Block Domain in robots.txt
 Hi. We had some URLs that were indexed in Google from a www1-subdomain. We have now disabled the URLs (returning a 404 - for other reasons we cannot do a redirect from www1 to www) and blocked via robots.txt. But the amount of indexed pages keeps increasing (for 2 weeks now). Unfortunately, I cannot install Webmaster Tools for this subdomain to tell Google to back off... Any ideas why this could be and whether it's normal? I can send you more domain infos by personal message if you want to have a look at it. Technical SEO | | zeepartner0
- 
		
		
		
		
		
		Blank pages in Google's webcache
 Hello all, Is anybody experiencing blanck page's in Google's 'Cached' view? I'm seeing just the page background and none of the content for a couple of my pages but when I click 'View Text Only' all of teh content is there. Strange! I'd love to hear if anyone else is experiencing the same. Perhaps this is something to do with the roll out of Google's updates last week?! Thanks, Technical SEO | | A_Q
 Elias0
- 
		
		
		
		
		
		Blocking URL's with specific parameters from Googlebot
 Hi, I've discovered that Googlebot's are voting on products listed on our website and as a result are creating negative ratings by placing votes from 1 to 5 for every product. The voting function is handled using Javascript, as shown below, and the script prevents multiple votes so most products end up with a vote of 1, which translates to "poor". How do I go about using robots.txt to block a URL with specific parameters only? I'm worried that I might end up blocking the whole product listing, which would result in de-listing from Google and the loss of many highly ranked pages. DON'T want to block: http://www.mysite.com/product.php?productid=1234 WANT to block: http://www.mysite.com/product.php?mode=vote&productid=1234&vote=2 Javacript button code: onclick="javascript: document.voteform.submit();" Thanks in advance for any advice given. Regards, Technical SEO | | aethereal
 Asim0
 
			
		 
			
		 
			
		 
			
		 
					
				 
					
				 
					
				 
					
				 
					
				 
					
				