Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Does Google Read URL's if they include a # tag? Re: SEO Value of Clean Url's
-
An ECWID rep stated in regards to an inquiry about how the ECWID url's are not customizable, that "an important thing is that it doesn't matter what these URLs look like, because search engines don't read anything after that # in URLs. " Example http://www.runningboards4less.com/general-motors#!/Classic-Pro-Series-Extruded-2/p/28043025/category=6593891
Basically all of this: #!/Classic-Pro-Series-Extruded-2/p/28043025/category=6593891
That is a snippet out of a conversation where ECWID said that dirty urls don't matter beyond a hashtag...
Is that true? I haven't found any rule that Google or other search engines (Google is really the most important) don't index, read, or place value on the part of the url after a # tag.
-
Thanks Sachin
So basically on sites that use ECWID for their ecommerce, only the main pages on the actual website (not the product pages that ECWID generates which is the part from the hashtag on) get indexed?
Essentially Google is NOT indexing any products because ECWID uses an existing page on a website and shows products there.
Is that correct? For example if you look at an XML sitemap for the running boards site that we used as an example you will see there are only 10 pages on it. However there are over a 1000 different types of running boards sold on the site which have their own pages populate after a #tag in the url: http://www.runningboards4less.com/index.php?option=com_xmap&view=xml&tmpl=component&id=1
-
Traditionally, the search engines ignore everything after the hash-tag because it's usually content contained on the same page or URL. Therefore, those additional URLs should not get indexed (only the part before the hashtag should). As per my experience, they completely disregard anything after the # tag in a URL.
However, it is always advisable to have clean urls as both SEs and people prefer them over complicated one. Clean urls deliver enhanced usability to help users remember and share your URLs more easily. Another benefit of a simple URL is that other sites are more likely to link to a simple URL, because it is easier to do so.
-
Anyone? Bueller? Bueller?
Also if anyone knows how to modify Ecwid urls so that they are "clean", please chime in...
-
Thank you for your response. I am not implying that it is indexing a "separate" url. I am referring to the SEO value of a proper "clean" url for the specific page. ECWID doesn't allow for it's users to create custom urls.
If I were creating a url for the page I listed above, I would have it something like **** .com/chevy-van NOT _.com/#!/Classic-Pro-Series-Extruded-2/p/28043025/category=6593891 _
My question regards the low or lack of any value at all using a url like the long one above and if the statement made by the ECWID rep is factual.
-
These URLs are called AJAX URL- a URL containing a hash fragment, e.g.,
www.example.com/index.html#mystate
, where#mystate
is the hash fragment.Reg. the above mentioned URL- This url is using Hash-Bang (#!) not hashtag, which makes Ajax/ javascript pages crawlable. The basic # indicates a location on a page (anchor) so does not get indexed as a separate URL.
You can find detailed information here- https://support.google.com/webmasters/answer/174992?hl=en
https://support.google.com/webmasters/answer/174993
Hope this helps!
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to stop URLs that include query strings from being indexed by Google
Hello Mozzers Would you use rel=canonical, robots.txt, or Google Webmaster Tools to stop the search engines indexing URLs that include query strings/parameters. Or perhaps a combination? I guess it would be a good idea to stop the search engines crawling these URLs because the content they display will tend to be duplicate content and of low value to users. I would be tempted to use a combination of canonicalization and robots.txt for every page I do not want crawled or indexed, yet perhaps Google Webmaster Tools is the best way to go / just as effective??? And I suppose some use meta robots tags too. Does Google take a position on being blocked from web pages. Thanks in advance, Luke
Intermediate & Advanced SEO | | McTaggart0 -
Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
I found a lot of duplicate title tags showing in Google Webmaster Tools. When I visited the URL's that these duplicates belonged to, I found that they were just images from a gallery that we didn't particularly want Google to index. There is no benefit to the end user in these image pages being indexed in Google. Our developer has told us that these urls are created by a module and are not "real" pages in the CMS. They would like to add the following to our robots.txt file Disallow: /catalog/product/gallery/ QUESTION: If the these pages are already indexed by Google, will this adjustment to the robots.txt file help to remove the pages from the index? We don't want these pages to be found.
Intermediate & Advanced SEO | | andyheath0 -
Partial Match or RegEx in Search Console's URL Parameters Tool?
So I currently have approximately 1000 of these URLs indexed, when I only want roughly 100 of them. Let's say the URL is www.example.com/page.php?par1=ABC123=&par2=DEF456=&par3=GHI789= All the indexed URLs follow that same kinda format, but I only want to index the URLs that have a par1 of ABC (but that could be ABC123 or ABC456 or whatever). Using URL Parameters tool in Search Console, I can ask Googlebot to only crawl URLs with a specific value. But is there any way to get a partial match, using regex maybe? Am I wasting my time with Search Console, and should I just disallow any page.php without par1=ABC in robots.txt?
Intermediate & Advanced SEO | | Ria_0 -
Alt tag for src='blank.gif' on lazy load images
I didn't find an answer on a search on this, so maybe someone here has faced this before. I am loading 20 images that are in the viewport and a bit below. The next 80 images I want to 'lazy-load'. They therefore are seen by the bot as a blank.gif file. However, I would like to get some credit for them by giving a description in the alt tag. Is that a no-no? If not, do they all have to be the same alt description since the src name is the same? I don't want to mess things up with Google by being too aggressive, but at the same time those are valid images once they are lazy loaded, so would like to get some credit for them. Thanks! Ted
Intermediate & Advanced SEO | | friendoffood0 -
Do Q&A 's work for SEO
If I create a good community in my particular field on my SEO site and have a quality Q&A section like this etc (ripping of MOZ's idea here sorry, I hope it's ok) will the long term returns be worth the effort of creating and man ageing this. Is the user created content of as much use as I think it will be?
Intermediate & Advanced SEO | | mark_baird0 -
Do UTM URL parameters hurt SEO backlink value?
Does www.example.com and www.example.com/?utm_source=Google&utm_medium=Press+Release&utm_campaign=Google have the same SEO backlink value? I would assume that Google knows the difference.
Intermediate & Advanced SEO | | mkhGT0 -
Include Cross Domain Canonical URL's in Sitemap - Yes or No?
I have several sites that have cross domain canonical tags setup on similar pages. I am unsure if these pages that are canonicalized to a different domain should be included in the sitemap. My first thought is no, because I should only include pages in the sitemap that I want indexed. On the other hand, if I include ALL pages on my site in the sitemap, once Google gets to a page that has a cross domain canonical tag, I'm assuming it will just note that and determine if the canonicalized page is the better version. I have yet to see any errors in GWT about this. I have seen errors where I included a 301 redirect in my sitemap file. I suspect its ok, but to me, it seems that Google would rather not find these URL's in a sitemap, have to crawl them time and time again to determine if they are the best page, even though I'm indicating that this page has a similar page that I'd rather have indexed.
Intermediate & Advanced SEO | | WEB-IRS0 -
Is 404'ing a page enough to remove it from Google's index?
We set some pages to 404 status about 7 months ago, but they are still showing in Google's index (as 404's). Is there anything else I need to do to remove these?
Intermediate & Advanced SEO | | nicole.healthline0