View unanswered posts | View active topics It is currently Sat Jul 31, 2010 2:58 am



This topic is locked, you cannot edit posts or make further replies.  [ 59 posts ]  Go to page 1, 2  Next
 Google Problem with supplemental pages 
Author Message
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post Google Problem with supplemental pages
It seems that over the past few days many webmasters have noticed a significant proportion of their pages going into the supplementa index for no apparent reason.

Googleguy has acknowledged that there is a problem, and expects it to be cleaned up within a week or so.

It might be a good idea to check your pages with a site: search to see if this is happening to you.

<added>

While this may or may not be related (it is in a the same timeframe) SEOchat forums appear to have been dumped from the google index.

_________________
Expert SEO services

Buy Cheap Used Cars

Kids bedding & Baby bedding sets


Last edited by Mel on Wed Mar 08, 2006 1:34 pm, edited 1 time in total.



Wed Mar 08, 2006 1:12 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
What I'm seeing is the majority of my pages (75%) go to a URL only partially indexed page.

Dave


Wed Mar 08, 2006 1:34 pm
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
I am seeing about 25% of one sites pages go supplemental - these are pages that have been up for years in some cases, used to rank OK and in most cases have been updated in the past few months.


Wed Mar 08, 2006 1:59 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
Mel wrote:
I am seeing about 25% of one sites pages go supplemental - these are pages that have been up for years in some cases, used to rank OK and in most cases have been updated in the past few months.


The pages that went partial have been up and indexed for for almost 2 years as well. Some with #1 rankings that are now gone.

Dave


Wed Mar 08, 2006 2:28 pm
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
Lets hope that Googleguys "fix" includes this sort of stuff, as I can see no reason that any of those pages should go supplemental - except that there have been multiple copies of at least some of these pages which have been purloined and floating around on other sites.

Could it be that these pages are being hit as duplicate pages (even though they are original and the first published) due to copies on other sites??? :?:


Wed Mar 08, 2006 2:48 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
Mel wrote:
Lets hope that Googleguys "fix" includes this sort of stuff, as I can see no reason that any of those pages should go supplemental - except that there have been multiple copies of at least some of these pages which have been purloined and floating around on other sites.

Could it be that these pages are being hit as duplicate pages (even though they are original and the first published) due to copies on other sites??? :?:


Interesting thought. I believe that a filter for duplicate content is applied at intervals and not continuously. Perhaps this is time for that cycle and the results aren't quite what was expected.

Mel, are you aware of any duplicated content that has not slipped to supplemental whereas the original has?


Yes, lets hope they "fix" this soon. Given the blunders this past week by the CFO, one has to wonder if Google knows how Google is supposed to work.

Dave


Wed Mar 08, 2006 3:19 pm
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
I am going to carve out some time to compare the pages that have slipped vs. those that haven't in the next day or so.


Wed Mar 08, 2006 3:41 pm
Profile WWW
Site Admin
User avatar
Site Admin

Joined: Sun Dec 07, 2003 3:10 am
Posts: 8429
Location: On a Mountain
Post 
One thing being seen from the BigDaddy changes is that many pages that were 301'd are ending up in the supplemental index en masse.

_________________
SEO Proposals and the Link Hound
Agrarian Ruminations & Recollections


Wed Mar 08, 2006 6:22 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
I just checked 50+ IP's/DC's and found that 19 of them are having "issues" when it comes to my site.

In those 19 IP's/DC's only 91-96 out of 643 pages are fully indexed and cached with the rest of the pages being URL only partially indexed.

These 19 IP's/DC's are...

64.233.189.104
216.239.53.99
216.239.57.99
216.239.59.99
66.102.11.99
66.102.9.99
66.102.7.99
216.239.59.104
216.239.53.104
66.102.11.104
66.102.9.104
66.102.7.104
216.239.57.98
216.239.57.105
216.239.59.105
66.102.7.105
66.102.7.147
216.239.57.147
216.239.57.98

On the surface, it appears if they're trying to rebuild their index from scratch.

Dave


Wed Mar 08, 2006 7:15 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
I went through and checked several of the top ranked sites for KP's I follow on some of the IP's/DC's I listed above. All of the sites had scores of pages placed in the supplemental index.

<edit to add>

This appears to be a far more reaching issue than possible problems with duplicate content filtering or 301 pages. Far too many pages being placed in the supplemental index from what I see.

</edit>

Dave


Wed Mar 08, 2006 10:45 pm
Profile WWW
Member

Joined: Fri Feb 24, 2006 10:12 pm
Posts: 39
Post 
This might be a newbie question, but what does it mean for the site to go into supplemental index?


Thu Mar 09, 2006 12:19 am
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
The supplemental index is the very last place Google looks for pages to satisfy a query. First they go to the short barrels, if they find enough ranking pool candidates there (originally they were looking for 40,000 pages but it may well be more now) then they look no further and proceed to rank those pages for the query.

If they don't find enough there, then they go to the long barrels for ranking pool candidates.

Only if they don't find enough pages from the combination of the short and long barrels do they try the supplemental index.

In a nutshell the only thing any supplemental pages ever rank for are strage and usual queries.


Thu Mar 09, 2006 1:11 am
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
OK I have had a bit of time to look into this problem on one site and find in this case that all the pages which are in the supplemental index are old pages that do not actually exist any longer (due to a site reorganization eight months ago) and which were 301'd to a new location with new content in most cases.

This confirms WilliamC comments about Big Daddy putting 301'd pages into the supplemental index.

I have verified that clicking on the supplemental listed link does indeed take you to the new page and that it does return a proper 301 response, so it looks like this is a case of Google not handling 301's correctly.

Quote:
#1 Server Response: http://www.xxxxxx.xxx
HTTP Status Code: HTTP/1.1 301 Moved Permanently
Date: Thu, 09 Mar 2006 06:14:10 GMT
Server: Apache/1.3.33 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 PHP/4.3.10 FrontPage/5.0.2.2635 mod_ssl/2.8.22 OpenSSL/0.9.6b
X-Powered-By: PHP/4.3.10
Location: http://xxxxx.xxx
Connection: close
Content-Type: text/html
Redirect Target: http://xxxxx.xxx

#2 Server Response: http://xxxxx.xxx
HTTP Status Code: HTTP/1.1 200 OK
Date: Thu, 09 Mar 2006 06:14:11 GMT
Server: Apache/1.3.33 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 PHP/4.3.10 FrontPage/5.0.2.2635 mod_ssl/2.8.22 OpenSSL/0.9.6b
X-Powered-By: PHP/4.3.10
Connection: close
Content-Type: text/html



Thu Mar 09, 2006 6:21 am
Profile WWW
Full member

Joined: Sat Aug 07, 2004 3:06 am
Posts: 432
Location: Australia
Post 
Post by Googleguy on Webmaster World:
Quote:
Hey, I wanted to stop by and give an update. I think we found what the immediate issue was. Future indexing should start picking up most affected people's pages again, though it may take a few more days for it to be visible. I'll check in again after the weekend is over to see if most people are seeing pages returning.

_________________
search engine marketing | online coupons


Thu Mar 09, 2006 6:45 am
Profile
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
Thats good news Timbo, but I guess the big question is why the hell did it happen in the first place.

Don't they test out these changes before they go live with them???


Thu Mar 09, 2006 6:54 am
Profile WWW
Full member

Joined: Sat Aug 07, 2004 3:06 am
Posts: 432
Location: Australia
Post 
Mel wrote:
Don't they test out these changes before they go live with them???

Probably the work experience guy trying to be proactive :D


Thu Mar 09, 2006 7:02 am
Profile
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
Mel wrote:
OK I have had a bit of time to look into this problem on one site and find in this case that all the pages which are in the supplemental index are old pages that do not actually exist any longer (due to a site reorganization eight months ago) and which were 301'd to a new location with new content in most cases.

This confirms WilliamC comments about Big Daddy putting 301'd pages into the supplemental index.

I have verified that clicking on the supplemental listed link does indeed take you to the new page and that it does return a proper 301 response, so it looks like this is a case of Google not handling 301's correctly.


This appears to only be part of the problem though. I'm seeing hundreds of pages never 301'd that are returning the supplemental or URL only results.

Could this be a case of the primary pages in the short barrel being dropped or lost and thus the supplemental pages are what are being used to fill in the results?

Dave


Thu Mar 09, 2006 1:48 pm
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
I personally think that this looks like the Big Daddy update ran into some indexing problems and they are in the process of rebuilding the index, but then why not just revert back to an old index temporarily?

I should clarify that on the site the I investigated the old pages which should have been dropped after having been 301'd for so long were instead put into the supplemental index, but the new pages are fine. This make my suspicions of indexing problems stronger.


Thu Mar 09, 2006 2:38 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
Mel wrote:
I personally think that this looks like the Big Daddy update ran into some indexing problems and they are in the process of rebuilding the index, but then why not just revert back to an old index temporarily?


Good question. Since the problem is only on select DC's, one would think they'd only display the results of the ones that are unaffected. It appears they may be trying to do this to a point.

Second, I suspect, for whatever reason, they don't have the old index to revert back to, or are unable to do so in the affected DC's and the only answer is to rebuild the indicies on those DC's from scratch.

Dave


Thu Mar 09, 2006 2:49 pm
Profile WWW
Site Admin
User avatar
Site Admin

Joined: Sun Dec 07, 2003 3:10 am
Posts: 8429
Location: On a Mountain
Post 
CrankyDave wrote:
Second, I suspect, for whatever reason, they don't have the old index to revert back to, or are unable to do so in the affected DC's and the only answer is to rebuild the indicies on those DC's from scratch.


Thanks for confirming the 301 thing Mel.

I will throw another supposition into the mix. Maybe BigDaddy needed more data from the pages than old indices stored. Or maybe they didn't store them in a way that can be fully used by the BD changes. That would require an actual full rebuild of all internal data if BD handles things using different/more data than before.


Thu Mar 09, 2006 3:10 pm
Profile WWW
Professional / Mod
Professional / Mod

Joined: Wed Sep 03, 2003 7:18 am
Posts: 8366
Location: Malaysia
Post 
Yes but I think they did a total rebuild with a different addressing scheme (64bits) some time back and that went smooth as silk.


Thu Mar 09, 2006 3:12 pm
Profile WWW
Full member

Joined: Mon Dec 05, 2005 9:30 pm
Posts: 231
Location: Orlando
Post 
I think WilliamC might make a good point. Especially with trying to fix the canonical issues and maybe even pulling in more information about links and relavency of those links. who knows. let's just hope it gets fixed soon so we can all see where we are going to stand.

_________________
Prefabricated Home Manufacturer ..... Steel-toe shoes ..... deluge shower - emergency shower


Thu Mar 09, 2006 4:29 pm
Profile WWW
Full member

Joined: Sat Aug 07, 2004 3:06 am
Posts: 432
Location: Australia
Post 
I have used the WebBug application to get server headers from the sites of mine which have been hit with a supp index and/or had their cache date set to Jul 2005.

WebBug returned this:

HTTP/1.0 200 OK
Age: 14901
Accept-Ranges: bytes
Date: Thu, 09 Mar 2006 19:18:27 GMT
Content-Length: 639
Content-Type: text/html
Server: Apache
Last-Modified: Tue, 26 Jul 2005 05:30:16 GMT

26th July is the cache date on the sites. The servers don't currently return the Last-Modified date but it seems that it is returning this Jul 05 date by default.

A post on Webmaster World seemed to indicate that it was an issue to do with the HTTP protocol retrurned. Webbug was using HTTP 1.0.


Fri Mar 10, 2006 1:03 am
Profile
Full member

Joined: Thu Apr 08, 2004 11:11 am
Posts: 223
Location: Mid Sussex, UK
Post 
theres a interesting article by Aaron wall on the topic

http://www.seobook.com/archives/001545.shtml

_________________
Advent Support


Fri Mar 10, 2006 1:07 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
From my little corner of the universe, it appears that Google is trying to keep the afflicted DC's out of the rotation as the default (.com) DC. If this is true, they're at least taking a positive step while they sort this out.

If you do a search for St. Roch Medals, my sites product page will show up as #1 on the DC's that are not affected and not at all on the ones that are.

<edit to add>
I'd be curious to know where the site shows up for the above term on your computer to see if indeed Google is keeping the "bad" DC's from going live.
</edit>

Dave


Fri Mar 10, 2006 1:42 pm
Profile WWW
Site Admin
User avatar
Site Admin

Joined: Sun Dec 07, 2003 3:10 am
Posts: 8429
Location: On a Mountain
Post 
Cheester wrote:
theres a interesting article by Aaron wall on the topic

http://www.seobook.com/archives/001545.shtml


I see he completely missed the 301 issue


Fri Mar 10, 2006 5:14 pm
Profile WWW
Moderator
User avatar
Moderator

Joined: Mon Dec 12, 2005 6:40 pm
Posts: 2873
Location: Playing with fire!
Post 
I had thought (hoped) that Google may have been trying to keep the affeced DC's out of the rotation from going live. This is not the case.

FTR, none of my pages have ever been 301'd and it is affecting new pages as well as old ones. The one difference in my case is that pages are not showing as supplemental but as partially indexed URL only.

I suspect that the reason for this is that that is what Google has in their supplemental index for my site. The URL only, therefore nothing to display as supplemental.

Any thought as to this affecting dynamic pages rather than static?

Dave


Sat Mar 11, 2006 2:10 pm
Profile WWW
Founder
Founder

Joined: Thu Nov 21, 2002 1:22 am
Posts: 11147
Post 
Unfortunately, this site has succumbed to the Supplemental syndrome. All of the pages, except the home page, are now Supplemental.

There are no 301s or 302s anywhere in the site, and, although some articles have been reproduced in may places, most of the site's pages could not fall foul of a duplication issue. The main site's pages could fall foul of a duplication issue, due to the Printable versions, but the forum's pages couldn't be seen as duplicates. So, if the problem has a single cause, then I can't see it being 301s, 302s, or duplication.

_________________
PhilC
Hidden Text
Search Engine Optimization articles and tools :: PageRank explained


Sat Mar 11, 2006 6:13 pm
Profile WWW
Founder
Founder

Joined: Thu Nov 21, 2002 1:22 am
Posts: 11147
Post 
Hi Grinler - nice to see you here :)

I understand what you mean about pages already being Supplemental, but didn't show until there were very few non-supplemental pages. I've seen that happen before. But I don't think it's the case with this site, so it probably isn't what's happening.


Sat Mar 11, 2006 6:29 pm
Profile WWW
Site Admin
User avatar
Site Admin

Joined: Sun Dec 07, 2003 3:10 am
Posts: 8429
Location: On a Mountain
Post 
I am seeing a lot of forums affected, so possibly another facet may be the way forums do a wee bit of cloaking, or the possibility for duped content via diff url structures.


Sat Mar 11, 2006 7:53 pm
Profile WWW
Display posts from previous:  Sort by  
This topic is locked, you cannot edit posts or make further replies.   [ 59 posts ]  Go to page 1, 2  Next

Who is online

Users browsing this forum: No registered users and 0 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
Latest Blogs
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group.
Designed by STSoftware for PTF

phpBB SEO