Unnecessary pages getting indexed in Google for my blog

rahulchowdhury

I have a blog dapazze.com and I am suffering from a problem for a long time. I found out that Google have indexed hundreds of replytocom links and images attachment pages for my blog.

I had to remove these pages manually using the URL removal tool. I had used "Disallow: ?replytocom" in my robots.txt, but Google disobeyed it. After that, I removed the parameter from my blog completely using the SEO by Yoast plugin.

But now I see that Google has again started indexing these links even after they are not present in my blog (I use #comment). Google have also indexed many of my admin and plugin pages, whereas they are disallowed in my robots.txt file.

Have a look at my robots.txt file here: http://dapazze.com/robots.txt

Please help me out to solve this problem permanently?

Esaky

Me too have the same issue ! but not indexed in the Google ! but URL parameters in Google Webmasters shows there are 5K errors !

Should i use the URL Parameters settings or which one ?

Also make sure replytocom links are not blocked using Robots.txt, as it will stop Google bots from crawling and this your links won’t get deindexed. This is one mistake which I did, and later after removing replytocom parameter from robots.txt file, I was able to get most of my replytocom links deindexed. These are warning by the blogger ! http://www.shoutmeloud.com/how-to-fix-replytocom-links-issue-in-wordpress.html - he showed how to do that ! but my problem is different - It's Good that it's not indexed but i don't want to take any risk ! how to avooid them for future !

Someone else told me here that some plugins are doing/helping for you ! and not seen in your Robot.txt !

Confused confused ! so much confused ! Please help me !

rahulchowdhury

Actually previously I had removed the links manually. But I am seeing them come up again even after removing the parameter completely.

Can you please point our the problem for me?

SoftzSolutions

Please check that the comment pages are blocked by robots.txt file -

https://www.google.co.in/webhp?sourceid=chrome-instant&ion=1&ie=UTF-8#q=inurl:replytocom+site:http://dapazze.com/&hl=en&tbo=d&filter=0&bav=on.2,or.r_gc.r_pw.r_cp.r_qf.&bvm=bv.1355534169,d.bmk&fp=8d5ddd2cfb254bfd&bpcl=40096503&ion=1&biw=1366&bih=677

However, the blocked pages are now getting redirected to the main landing page of the blog posts.

Seems like it will take a while for Google to recrawl these pages and sort the issue.

In the mean time, could you please show some pages that are getting indexed by Google.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Unnecessary pages getting indexed in Google for my blog

Browse Questions

Explore more categories

Related Questions

Removing a site from Google index with no index met tags

Google is indexing bad URLS

Why is my blog disappearing from Google index?

How to Stop Google from Indexing Old Pages

How to stop my webmail pages not to be indexed on Google ??

How long does it take for Google for deindexing pages?

De-indexing millions of pages - would this work?

Why is a 301 redirected url still getting indexed?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved