Block session id URLs with robots.txt

Mat_C

Hi,

I would like to block all URLs with the parameter '?filter=' from being crawled by including them in the robots.txt.

Which directive should I use:

User-agent: *
Disallow: ?filter=

or

User-agent: *
Disallow: /?filter=

In other words, is the forward slash in the beginning of the disallow directive necessary?

Thanks!

Mat_C

Hi Martijn,

Thanks for the answer. Regarding the forward slash in the beginning, is it necessary to use this?

In the robots text from Zalando for example, you can see that they don't use it for a lot of filters.

Martijn_Scheijbeler

Uhh, that's not what the requester is looking for and could actually cause tons of problems if you would apply this on a site that you're unaware of. I would always go with the most limiting robots.txt that you can and in this case, I would go with: /?filter=

jasongmcmahon

Hi,

The following should suffice as it will black any URL with a "?" in it

User-agent: *
Disallow: /*?

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Block session id URLs with robots.txt

Browse Questions

Explore more categories

Related Questions

Will Reduced Bounce Rate, Increased Pages/Session, Increased Session Duration-RESULT IN BETTER RANKING?

If my website do not have a robot.txt file, does it hurt my website ranking?

Help with facet URLs in Magento

Should I include URLs that are 301'd or only include 200 status URLs in my sitemap.xml?

Weird 404 URL Problem - domain name being placed at end of urls

If I own a .com url and also have the same url with .net, .info, .org, will I want to point them to the .com IP address?

Removing dashes in our URLs?

Blocking Dynamic URLs with Robots.txt

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved