Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
-
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain:
/JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8=
/MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I=
/YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34=
I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them.
We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really.
Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
-
Glad to hear you located the root cause of the issue.
-
Update: I think I found the source of the errors. We have a form submission box on every page of our site and there are random alphanumeric strings that are unique to every page that look like these urls that Google is reporting as errors. Maybe Google is trying to use our form submission tool.
It's still really odd that Google would think they are urls and report them as errors since they appear in an tag not an <a> tag</a>
<a>```
<input id="authenticity_token" name="authenticity_token" type="hidden" value="ElCkBCszxMddaqVyVIx+flkzRlM9xb7mHb3cFISlJvY=" /> -
The next step would be to check Google WMT > Diagnostics > Crawl errors > Not Found tab
Click the "Linked from Detected" text. That will show the source of the link.
-
That's not the case. We have an in house content system and no shopping cart. These urls return a 404 error and always have because they've never existed.
No crawlers have returned urls that look like this either. I can't get these urls to show up in Google's index either.
-
Hi Jimmy,
My bet is you are using a CMS or Shopping Cart on your site. The software is generating these seemingly random URLs, and the Google crawler is seeing them.
Crawlers don't move through your site the way people do. If you take your home page and right-click, then choose View Page Source, you will find many URLs. The crawler will then follow each and every URL throughout your site. The URLs you are seeing could be generated on any given page.
To track the issue, use the SEOmoz crawler on your site, then analyze the results. When you find these types of URLs, take note of the page they are located on.
Alternatively, you can contact your CMS or shopping cart's software developer and inquire about the URLs. If these URLs show a folder path before the random characters, that information can help track down which pages or software is generating these pages.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
The best checking tool Keyword Cannibalization
hi guys i have a Keyword Cannibalization isuue, please Introduce best free tools for checking Keyword Cannibalization.
Reporting & Analytics | | 1001mp30 -
Why is Indeed.com traffic appearing as organic in Google Analytics?
A large number of sessions in my client's Google Analytics account appear to come from medium: organic and source:Indeed. Since I'm focused on SEO for this project, I'd prefer that Indeed be treated as referral traffic. Any ideas for fixing this issue? Also, and I'm sure the answer is no, is there a way to fix the past data in Google Analytics that has already reported Indeed as an organic medium?
Reporting & Analytics | | Kevin_P0 -
Google Analytics SEO Queries Not Showing
Hi All, This might be a silly question, but for all the properties I monitor in Google Analytics, I'm now showing no data for SEO Queries under Acquisition for the past 6 days. Normally I would expect a few day delay in queries, but nothing for 6 days is somewhat peculiar especially as it was functioning fine prior to November 12th. Does anyone have insight into what might be going on? Thanks! URaNMa3
Reporting & Analytics | | amichaels0 -
Google Analytics Real Time Not Working! :(
Hello Everyone, Today, the real time feature in my google analytics stopped working. I am able to see that traffic is visiting my site, but not in real time. The real time count is usually at 0. But, there are some instances where real time will come back online, but there will be only 1 unique visitor. At any given time, our website usually has 20 visitors or so..Has anyone encountered this issue? Where should i start looking for fixes? What are the possible problems? Thanks!
Reporting & Analytics | | KarlMarxTheBear0 -
How can I track my rankings on Google Images?
I noticed a small amount of traffic coming from a particular very generic keyword. Being pleasantly surprised that we are ranking for this, and after some digging, I found that we are actually ranking in Google images, rather than in the web results. How can I track whether other keywords are ranking in Google images? I use Rank Checker to track keywords in the main web results, but this doesn't have a function for Google Images. Help please - thanks.
Reporting & Analytics | | TheJewelleryEd0 -
Getting google impressions for a site not in the index...
Hi all Wondering if i could pick the brains of those wise than myself... my client has an https website with tons of pages indexed and all ranking well, however somehow they managed to also set their server up so that non https versions of the pages were getting indexed and thus we had the same page indexed twice in the engine but on slightly different urls (it uses a cms so all the internal links are relative too). The non https is mainly used as a dev testing environment. Upon seeing this we did a google remove request in WMT, and added noindex in the robots and that saw the index pages drop over night. See image 1. However, the site still appears to getting return for a couple of 100 searches a day! The main site gets about 25,000 impressions so it's way down but i'm puzzled as to how a site which has been blocked can appear for that many searches and if we are still liable for duplicate content issues. Any thoughts are most welcome. Sorry, I am unable to share the site name i'm afraid. Client is very strict on this. Thanks, Carl image1.png
Reporting & Analytics | | carl_daedricdigital0 -
Localhost:4444 Showing Up in Google Analytics
Hello All, Lately in my Google Analytics account I have noticed a referral source labelled: localhost:4444 The number of visits is really high from this source, but I have no idea (no clue!) what it actually means. Can anyone shed some light on what this is about? Should I be creating some sort of filter to screen out this as a referral source (assuming it is not legitimate)? Many thanks in advance. Cheers!
Reporting & Analytics | | Robert-B0 -
How to remove unwanted dynamic parameters from a URL in Google Analytics
Hi, Would really appreciate some help with this. I have been experimenting with RegEx to achieve this but as I’ve never used it before am currently failing miserably. We have conversion pages i need to set goals for that are formatted as below: https://www.domain.co.uk//Application_Form/(S(ewhbqp5cki0mppuzukunkqno))/enterCardDetails.aspx I need to remove the (s(xxx)) section from the URL as rather than one pages i currently have thousands of unique URL's. What’s catching me out is that as it’s not a URL parameter I can’t discount and as half way through can’t just do head matches etc to /entercarddetails Help would be much appreciated. Thanks.
Reporting & Analytics | | Sarbs0