Getting Pages Requiring Login Indexed

TheEspresseo

Somehow certain newspapers' webpages show up in the index but require login. My client has a whole section of the site that requires a login (registration is free), and we'd love to get that content indexed. The developer offered to remove the login requirement for specific user agents (eg Googlebot, et al.). I am afraid this might get us penalized.

Any insight?

Cyrus-Shepard

My guess: It's possible, but it would be an uphill battle. The reason being Google would likely see the page as a duplicate of all the other pages on your site with a login form. Not only does Google tend to drop duplicate pages from it's index (especially if it has a duplicate title tag - more leeway is giving the more unique elements you can place on a page) but now you face a situation where you have lots of duplicate or "thin" pages, which is juicy meat for a Panda-like penalty. Generally, you want to keep this pages out of the index, so it's a catch 22.

TheEspresseo

That makes sense. I am looking into whether any portion of our content can be made public in a way that would still comply with industry regulations. I am betting against it.

Does anyone know whether a page requiring login like this could feasibly rank with a strong backlink profile or a lot of quality social mentions?

Cyrus-Shepard

The reason Google likes the "first click free" method is because they want the user to have a good result. They don't want users to click on a search result, then see something else on that page entirely, such as a login form.

So technically showing one set of pages to Google and another to users is considered cloaking. It's very likely that Google will figure out what's happening - either through manual review, human search quality raters, bounce rate, etc - and take appropriate actions against your site.

Of course, there's no guarantee this will happen, and you could argue that the cloaking wasn't done to deceive users, but the risk is high enough to warrant major consideration.

Are there any other options for displaying even part of the content, other than "first-click-free"? For example, can you display a snippet or few paragraphs of the information, then require login to see the rest? This at least would give Google something to index.

Unfortunately, most other methods for getting anything indexed without actually showing it to users would likely be considered blackhat.

Cyrus

TheEspresseo

Should have read the target:

"Subscription designation, snippets only: If First Click Free isn't a feasible option for you, we will display the "subscription" tag next to the publication name of all sources that greet our users with a subscription or registration form. This signals to our users that they may be required to register or subscribe on your site in order to access the article. This setting will only apply to Google News results.

If you prefer this option, please display a snippet of your article that is at least 80 words long and includes either an excerpt or a summary of the specific article. Since we do not permit "cloaking" -- the practice of showing Googlebot a full version of your article while showing users the subscription or registration version -- we will only crawl and display your content based on the article snippets you provide. If you currently cloak for Googlebot-news but not for Googlebot, you do not need to make any changes; Google News crawls with Googlebot and automatically uses the 80-word snippet.

NOTE: If you cloak for Googlebot, your site may be subject to Google Webmaster penalties. Please review Webmaster Guidelines to learn about best practices."

TheEspresseo

"In order to successfully crawl your site, Google needs to be able to crawl your content without filling out a registration form. The easiest way to do this is to configure your webservers not to serve the registration page to our crawlers (when the user-agent is "Googlebot") so that Googlebot can crawl these pages successfully. You can choose to allow Googlebot access to some restricted pages but not others. More information about technical requirements."

-http://support.google.com/webmasters/bin/answer.py?hl=en&answer=74536

Any harm in doing this while not implementing the rest of First Click Free??

TheEspresseo

What would you guys think about programming the login requirement behavior in such a way that only Google can't execute it--so Google wouldn't know that it is the only one getting through?

Not sure whether this is technically possible, but if it were, would it be theoretically likely to incur a penalty? Or is it foolish for other reasons?

TheEspresseo

Good idea--I'll have to determine precisely what I can and cannot show publicly and see if there isn't something I can do to leverage that.

I've heard about staying away from agent-specific content, but I wonder what the data are and whether there are any successful attempts?

TheEspresseo

First click free unfortunately won't work for us.

How might I go about determining how adult content sites handle this issue?

Matt-Williamson

Have you considered allowing only a certain proportion of each page to show to any visitors including search engines. This way your pages will have some specific content that can be indexed and help you rank in the SERPs.

I have seen it done where publications behind a pay wall only allow the first paragraph or two to show - just enough to get them ranked appropriately but not enough to stop user wanting to register to access the full articles when they find them either through the SERPs, other sites or directly.

However for this to work it all depends on what the regualtions you mention require - would a proportion of the content being shown to all be ok??

I would definitely stay away from serving up different content to different users if I were you as this is likely to end up causing you trouble in the search engines..

KeriMorgret

I believe newspapers use a feature called "first click free" that enables this to work. I don't know if that will work with your industry regulations or not, however. You may also want to see how sites that deal with adult content, such as liquor sites, have a restriction for viewing let allow indexing.

TheEspresseo

Understood. The login requirement is necessary for compliance with industry regulations. My questions is whether I will be penalized for serving agent-specific content and/or whether there is a better way to get these pages in the index.

ScottJames13

Search engines aren't good at completing online forms (such as a login), and thus any content contained behind them may remain hidden, so the developers option sounds like a good solution.

You may want to read:

http://www.seomoz.org/beginners-guide-to-seo/why-search-engine-marketing-is-necessary

Explore more categories

URL structure - Page Path vs No Page Path

No Index thousands of thin content pages?

Category Page as Shopping Aggregator Page

Should I use noindex or robots to remove pages from the Google index?

If Robots.txt have blocked an Image (Image URL) but the other page which can be indexed has this image, how is the image treated?

Better to 301 or de-index 403 pages

Is it a problem to use a 301 redirect to a 404 error page, instead of serving directly a 404 page?

Can too many "noindex" pages compared to "index" pages be a problem?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Getting Pages Requiring Login Indexed

Browse Questions

Explore more categories

Related Questions