Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
How to stop my webmail pages not to be indexed on Google ??
- 
					
					
					
					
 when i did a search in google for Site:mywebsite.com , for a list of pages indexed. Surprisingly the following come up " Webmail - Login " Although this is associated with the domain , this is a completely different server , this the rackspace email server browser interface I am sure that there is nothing on the website that links or points to this. 
 So why is Google indexing it ? & how do I get it out of there. I tried in webmaster tool but I could not , as it seems like a sub-domain. Any ideas ?Thanks Naresh Sadasivan 
- 
					
					
					
					
 Hi, So you did add a meta noindex tag? Great. Remember that Google needs to crawl the page to see the meta noindex, so blocking it in robots.txt will mean it's still indexed, but has about 0% chance to show up in search unless you search for that URL. Also, I wouldn't spend any time worrying about obscure pages that are indexed. It's not going to hurt your rankings. 
- 
					
					
					
					
 "However i have double checked that where is no external link to the webmail pages from any other source" the link might be so obscure that seomoz has not picked it up and google bot has. Note in WMT there is "latest back links" but there is a few weeks delay on them, so the link might be recent. I'm 99% sure there has to be a link from somewhere, or there is no way google can find that page. I have added the No-index as well in my robot.txt file on my site but yet no change. It will not take effect until the next time google crawls your site, I think it took a week or so before my test site was removed (i did no robot.txt file too) 
- 
					
					
					
					
 Thanks for the prompt response . However i have double checked that where is no external link to the webmail pages from any other source. I have added the No-index as well in my robot.txt file on my site but yet no change. The Webmail - Login access has been provided by the hosting provider and i do not have the access to their files other that checking my Email using their system. Thanks Naresh Sadasivan 
- 
					
					
					
					
 there must be a link from somewhere for google to pick it up, I had a test site on a sub domain that got picked up to my surprise. The reason, the site had pinterest button on the product pages, and the developer had tested it to see if it worked, and later removed it, but someone (or something) repinned it and then a backlink to my test site was born Point is there are millions of ways to get a back-link accidentally . Best way not to list is either added it to you robots.txt file or add No-index to the webmail page 
Browse Questions
Explore more categories
- 
		
		Moz ToolsChat with the community about the Moz tools. 
- 
		
		SEO TacticsDiscuss the SEO process with fellow marketers 
- 
		
		CommunityDiscuss industry events, jobs, and news! 
- 
		
		Digital MarketingChat about tactics outside of SEO 
- 
		
		Research & TrendsDive into research and trends in the search industry. 
- 
		
		SupportConnect on product support and feature requests. 
Related Questions
- 
		
		
		
		
		
		Google keeps marking different pages as duplicates
 My website has many pages like this: mywebsite/company1/valuation mywebsite/company2/valuation mywebsite/company3/valuation mywebsite/company4/valuation ... These pages describe the valuation of each company. These pages were never identical but initially, I included a few generic paragraphs like what is valuation, what is a valuation model, etc... in all the pages so some parts of these pages' content were identical. Google marked many of these pages as duplicated (in Google Search Console) so I modified the content of these pages: I removed those generic paragraphs and added other information that is unique to each company. As a result, these pages are extremely different from each other now and have little similarities. Although it has been more than 1 month since I made the modification, Google still marks the majority of these pages as duplicates, even though Google has already crawled their new modified version. I wonder whether there is anything else I can do in this situation? Thanks Technical SEO | | TuanDo96270
- 
		
		
		
		
		
		Indexed pages
 Just started a site audit and trying to determine the number of pages on a client site and whether there are more pages being indexed than actually exist. I've used four tools and got four very different answers... Google Search Console: 237 indexed pages Google search using site command: 468 results MOZ site crawl: 1013 unique URLs Screaming Frog: 183 page titles, 187 URIs (note this is a free licence, but should cut off at 500) Can anyone shed any light on why they differ so much? And where lies the truth? Technical SEO | | muzzmoz1
- 
		
		
		
		
		
		How to remove all sandbox test site link indexed by google?
 When develop site, I have a test domain is sandbox.abc.com, this site contents are same as abc.com. But, now I search site:sandbox.abc.com and aware of content duplicate with main site abc.com My question is how to remove all this link from goolge. p/s: I have just add robots.txt to sandbox and disallow all pages. Thanks, Technical SEO | | JohnHuynh0
- 
		
		
		
		
		
		How to determine which pages are not indexed
 Is there a way to determine which pages of a website are not being indexed by the search engines? I know Google Webmasters has a sitemap area where it tells you how many urls have been submitted and how many are indexed out of those submitted. However, it doesn't necessarily show which urls aren't being indexed. Technical SEO | | priceseo1
- 
		
		
		
		
		
		How long does it take for Google for deindexing pages?
 Hi mozzers, We just launched a mobile website(parallel) and realized that it created many duplicate content with desktop URLs. I decided to add name="robots" content="No index, No follow" /> to the entire mobile site. My only concern is that I am still seeing the mobile site indexed when it's been almost a week I added these tags. Does anyone know how long it takes google to deindex your content? Thanks Technical SEO | | Ideas-Money-Art0
- 
		
		
		
		
		
		How to block "print" pages from indexing
 I have a fairly large FAQ section and every article has a "print" button. Unfortunately, this is creating a page for every article which is muddying up the index - especially on my own site using Google Custom Search. Can you recommend a way to block this from happening? Example Article: http://www.knottyboy.com/lore/idx.php/11/183/Maintenance-of-Mature-Locks-6-months-/article/How-do-I-get-sand-out-of-my-dreads.html Example "Print" page: http://www.knottyboy.com/lore/article.php?id=052&action=print Technical SEO | | dreadmichael0
- 
		
		
		
		
		
		Why google index my IP URL
 hi guys, a question please. if site:112.65.247.14 , you can see google index our website IP address, this could duplicate with our darwinmarketing.com content pages. i am not quite sure why google index my IP pages while index domain pages, i understand this could because of backlink, internal link and etc, but i don't see obvious issues there, also i have submit request to google team to remove ip address index, but seems no luck. Please do you have any other suggestion on this? i was trying to do change of address setting in Google Webmaster Tools, but didn't allow as it said "Restricted to root level domains only", any ideas? Thank you! boson Technical SEO | | DarwinChinaSEO0
- 
		
		
		
		
		
		Dynamically-generated .PDF files, instead of normal pages, indexed by and ranking in Google
 Hi, I come across a tough problem. I am working on an online-store website which contains the functionlaity of viewing products details in .PDF format (by the way, the website is built on Joomla CMS), now when I search my site's name in Google, the SERP simply displays my .PDF files in the first couple positions (shown in normal .PDF files format: [PDF]...)and I cannot find the normal pages there on SERP #1 unless I search the full site domain in Google. I really don't want this! Would you please tell me how to figure the problem out and solve it. I can actually remove the corresponding component (Virtuemart) that are in charge of generating the .PDF files. Now I am trying to redirect all the .PDF pages ranking in Google to a 404 page and remove the functionality, I plan to regenerate a sitemap of my site and submit it to Google, will it be working for me? I really appreciate that if you could help solve this problem. Thanks very much. Sincerely SEOmoz Pro Member Technical SEO | | fugu0
 
			
		 
			
		 
					
				 
					
				 
					
				 
					
				 
					
				 
					
				