| Author |
Message |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
 Google Problem with supplemental pages
It seems that over the past few days many webmasters have noticed a significant proportion of their pages going into the supplementa index for no apparent reason.
Googleguy has acknowledged that there is a problem, and expects it to be cleaned up within a week or so.
It might be a good idea to check your pages with a site: search to see if this is happening to you.
<added>
While this may or may not be related (it is in a the same timeframe) SEOchat forums appear to have been dumped from the google index.
_________________ Expert SEO services
Buy Cheap Used Cars
Kids bedding & Baby bedding sets
Last edited by Mel on Wed Mar 08, 2006 1:34 pm, edited 1 time in total.
|
| Wed Mar 08, 2006 1:12 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
What I'm seeing is the majority of my pages (75%) go to a URL only partially indexed page.
Dave
|
| Wed Mar 08, 2006 1:34 pm |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
I am seeing about 25% of one sites pages go supplemental - these are pages that have been up for years in some cases, used to rank OK and in most cases have been updated in the past few months.
|
| Wed Mar 08, 2006 1:59 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
Mel wrote: I am seeing about 25% of one sites pages go supplemental - these are pages that have been up for years in some cases, used to rank OK and in most cases have been updated in the past few months.
The pages that went partial have been up and indexed for for almost 2 years as well. Some with #1 rankings that are now gone.
Dave
|
| Wed Mar 08, 2006 2:28 pm |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
Lets hope that Googleguys "fix" includes this sort of stuff, as I can see no reason that any of those pages should go supplemental - except that there have been multiple copies of at least some of these pages which have been purloined and floating around on other sites.
Could it be that these pages are being hit as duplicate pages (even though they are original and the first published) due to copies on other sites??? 
|
| Wed Mar 08, 2006 2:48 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
Mel wrote: Lets hope that Googleguys "fix" includes this sort of stuff, as I can see no reason that any of those pages should go supplemental - except that there have been multiple copies of at least some of these pages which have been purloined and floating around on other sites. Could it be that these pages are being hit as duplicate pages (even though they are original and the first published) due to copies on other sites??? 
Interesting thought. I believe that a filter for duplicate content is applied at intervals and not continuously. Perhaps this is time for that cycle and the results aren't quite what was expected.
Mel, are you aware of any duplicated content that has not slipped to supplemental whereas the original has?
Yes, lets hope they "fix" this soon. Given the blunders this past week by the CFO, one has to wonder if Google knows how Google is supposed to work.
Dave
|
| Wed Mar 08, 2006 3:19 pm |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
I am going to carve out some time to compare the pages that have slipped vs. those that haven't in the next day or so.
|
| Wed Mar 08, 2006 3:41 pm |
|
 |
|
WilliamC
Site Admin
Joined: Sun Dec 07, 2003 3:10 am Posts: 8429 Location: On a Mountain
|
One thing being seen from the BigDaddy changes is that many pages that were 301'd are ending up in the supplemental index en masse.
_________________ SEO Proposals and the Link Hound Agrarian Ruminations & Recollections
|
| Wed Mar 08, 2006 6:22 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
I just checked 50+ IP's/DC's and found that 19 of them are having "issues" when it comes to my site.
In those 19 IP's/DC's only 91-96 out of 643 pages are fully indexed and cached with the rest of the pages being URL only partially indexed.
These 19 IP's/DC's are...
64.233.189.104
216.239.53.99
216.239.57.99
216.239.59.99
66.102.11.99
66.102.9.99
66.102.7.99
216.239.59.104
216.239.53.104
66.102.11.104
66.102.9.104
66.102.7.104
216.239.57.98
216.239.57.105
216.239.59.105
66.102.7.105
66.102.7.147
216.239.57.147
216.239.57.98
On the surface, it appears if they're trying to rebuild their index from scratch.
Dave
|
| Wed Mar 08, 2006 7:15 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
I went through and checked several of the top ranked sites for KP's I follow on some of the IP's/DC's I listed above. All of the sites had scores of pages placed in the supplemental index.
<edit to add>
This appears to be a far more reaching issue than possible problems with duplicate content filtering or 301 pages. Far too many pages being placed in the supplemental index from what I see.
</edit>
Dave
|
| Wed Mar 08, 2006 10:45 pm |
|
 |
|
Tsyle
Member
Joined: Fri Feb 24, 2006 10:12 pm Posts: 39
|
This might be a newbie question, but what does it mean for the site to go into supplemental index?
|
| Thu Mar 09, 2006 12:19 am |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
The supplemental index is the very last place Google looks for pages to satisfy a query. First they go to the short barrels, if they find enough ranking pool candidates there (originally they were looking for 40,000 pages but it may well be more now) then they look no further and proceed to rank those pages for the query.
If they don't find enough there, then they go to the long barrels for ranking pool candidates.
Only if they don't find enough pages from the combination of the short and long barrels do they try the supplemental index.
In a nutshell the only thing any supplemental pages ever rank for are strage and usual queries.
|
| Thu Mar 09, 2006 1:11 am |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
OK I have had a bit of time to look into this problem on one site and find in this case that all the pages which are in the supplemental index are old pages that do not actually exist any longer (due to a site reorganization eight months ago) and which were 301'd to a new location with new content in most cases.
This confirms WilliamC comments about Big Daddy putting 301'd pages into the supplemental index.
I have verified that clicking on the supplemental listed link does indeed take you to the new page and that it does return a proper 301 response, so it looks like this is a case of Google not handling 301's correctly.
Quote: #1 Server Response: http://www.xxxxxx.xxxHTTP Status Code: HTTP/1.1 301 Moved Permanently Date: Thu, 09 Mar 2006 06:14:10 GMT Server: Apache/1.3.33 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 PHP/4.3.10 FrontPage/5.0.2.2635 mod_ssl/2.8.22 OpenSSL/0.9.6b X-Powered-By: PHP/4.3.10 Location: http://xxxxx.xxxConnection: close Content-Type: text/html Redirect Target: http://xxxxx.xxx#2 Server Response: http://xxxxx.xxxHTTP Status Code: HTTP/1.1 200 OK Date: Thu, 09 Mar 2006 06:14:11 GMT Server: Apache/1.3.33 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 PHP/4.3.10 FrontPage/5.0.2.2635 mod_ssl/2.8.22 OpenSSL/0.9.6b X-Powered-By: PHP/4.3.10 Connection: close Content-Type: text/html
|
| Thu Mar 09, 2006 6:21 am |
|
 |
|
timbo
Full member
Joined: Sat Aug 07, 2004 3:06 am Posts: 432 Location: Australia
|
Post by Googleguy on Webmaster World:
Quote: Hey, I wanted to stop by and give an update. I think we found what the immediate issue was. Future indexing should start picking up most affected people's pages again, though it may take a few more days for it to be visible. I'll check in again after the weekend is over to see if most people are seeing pages returning.
_________________ search engine marketing | online coupons
|
| Thu Mar 09, 2006 6:45 am |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
Thats good news Timbo, but I guess the big question is why the hell did it happen in the first place.
Don't they test out these changes before they go live with them???
|
| Thu Mar 09, 2006 6:54 am |
|
 |
|
timbo
Full member
Joined: Sat Aug 07, 2004 3:06 am Posts: 432 Location: Australia
|
Mel wrote: Don't they test out these changes before they go live with them???
Probably the work experience guy trying to be proactive 
|
| Thu Mar 09, 2006 7:02 am |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
Mel wrote: OK I have had a bit of time to look into this problem on one site and find in this case that all the pages which are in the supplemental index are old pages that do not actually exist any longer (due to a site reorganization eight months ago) and which were 301'd to a new location with new content in most cases.
This confirms WilliamC comments about Big Daddy putting 301'd pages into the supplemental index.
I have verified that clicking on the supplemental listed link does indeed take you to the new page and that it does return a proper 301 response, so it looks like this is a case of Google not handling 301's correctly.
This appears to only be part of the problem though. I'm seeing hundreds of pages never 301'd that are returning the supplemental or URL only results.
Could this be a case of the primary pages in the short barrel being dropped or lost and thus the supplemental pages are what are being used to fill in the results?
Dave
|
| Thu Mar 09, 2006 1:48 pm |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
I personally think that this looks like the Big Daddy update ran into some indexing problems and they are in the process of rebuilding the index, but then why not just revert back to an old index temporarily?
I should clarify that on the site the I investigated the old pages which should have been dropped after having been 301'd for so long were instead put into the supplemental index, but the new pages are fine. This make my suspicions of indexing problems stronger.
|
| Thu Mar 09, 2006 2:38 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
Mel wrote: I personally think that this looks like the Big Daddy update ran into some indexing problems and they are in the process of rebuilding the index, but then why not just revert back to an old index temporarily?
Good question. Since the problem is only on select DC's, one would think they'd only display the results of the ones that are unaffected. It appears they may be trying to do this to a point.
Second, I suspect, for whatever reason, they don't have the old index to revert back to, or are unable to do so in the affected DC's and the only answer is to rebuild the indicies on those DC's from scratch.
Dave
|
| Thu Mar 09, 2006 2:49 pm |
|
 |
|
WilliamC
Site Admin
Joined: Sun Dec 07, 2003 3:10 am Posts: 8429 Location: On a Mountain
|
CrankyDave wrote: Second, I suspect, for whatever reason, they don't have the old index to revert back to, or are unable to do so in the affected DC's and the only answer is to rebuild the indicies on those DC's from scratch.
Thanks for confirming the 301 thing Mel.
I will throw another supposition into the mix. Maybe BigDaddy needed more data from the pages than old indices stored. Or maybe they didn't store them in a way that can be fully used by the BD changes. That would require an actual full rebuild of all internal data if BD handles things using different/more data than before.
|
| Thu Mar 09, 2006 3:10 pm |
|
 |
|
Mel
Professional / Mod
Joined: Wed Sep 03, 2003 7:18 am Posts: 8366 Location: Malaysia
|
Yes but I think they did a total rebuild with a different addressing scheme (64bits) some time back and that went smooth as silk.
|
| Thu Mar 09, 2006 3:12 pm |
|
 |
|
dmbjeep
Full member
Joined: Mon Dec 05, 2005 9:30 pm Posts: 231 Location: Orlando
|
I think WilliamC might make a good point. Especially with trying to fix the canonical issues and maybe even pulling in more information about links and relavency of those links. who knows. let's just hope it gets fixed soon so we can all see where we are going to stand.
_________________ Prefabricated Home Manufacturer ..... Steel-toe shoes ..... deluge shower - emergency shower
|
| Thu Mar 09, 2006 4:29 pm |
|
 |
|
timbo
Full member
Joined: Sat Aug 07, 2004 3:06 am Posts: 432 Location: Australia
|
I have used the WebBug application to get server headers from the sites of mine which have been hit with a supp index and/or had their cache date set to Jul 2005.
WebBug returned this:
HTTP/1.0 200 OK
Age: 14901
Accept-Ranges: bytes
Date: Thu, 09 Mar 2006 19:18:27 GMT
Content-Length: 639
Content-Type: text/html
Server: Apache
Last-Modified: Tue, 26 Jul 2005 05:30:16 GMT
26th July is the cache date on the sites. The servers don't currently return the Last-Modified date but it seems that it is returning this Jul 05 date by default.
A post on Webmaster World seemed to indicate that it was an issue to do with the HTTP protocol retrurned. Webbug was using HTTP 1.0.
|
| Fri Mar 10, 2006 1:03 am |
|
 |
|
Cheester
Full member
Joined: Thu Apr 08, 2004 11:11 am Posts: 223 Location: Mid Sussex, UK
|
theres a interesting article by Aaron wall on the topic
http://www.seobook.com/archives/001545.shtml
_________________ Advent Support
|
| Fri Mar 10, 2006 1:07 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
From my little corner of the universe, it appears that Google is trying to keep the afflicted DC's out of the rotation as the default (.com) DC. If this is true, they're at least taking a positive step while they sort this out.
If you do a search for St. Roch Medals, my sites product page will show up as #1 on the DC's that are not affected and not at all on the ones that are.
<edit to add>
I'd be curious to know where the site shows up for the above term on your computer to see if indeed Google is keeping the "bad" DC's from going live.
</edit>
Dave
|
| Fri Mar 10, 2006 1:42 pm |
|
 |
|
WilliamC
Site Admin
Joined: Sun Dec 07, 2003 3:10 am Posts: 8429 Location: On a Mountain
|
I see he completely missed the 301 issue
|
| Fri Mar 10, 2006 5:14 pm |
|
 |
|
CrankyDave
Moderator
Joined: Mon Dec 12, 2005 6:40 pm Posts: 2873 Location: Playing with fire!
|
I had thought (hoped) that Google may have been trying to keep the affeced DC's out of the rotation from going live. This is not the case.
FTR, none of my pages have ever been 301'd and it is affecting new pages as well as old ones. The one difference in my case is that pages are not showing as supplemental but as partially indexed URL only.
I suspect that the reason for this is that that is what Google has in their supplemental index for my site. The URL only, therefore nothing to display as supplemental.
Any thought as to this affecting dynamic pages rather than static?
Dave
|
| Sat Mar 11, 2006 2:10 pm |
|
 |
|
PhilC
Founder
Joined: Thu Nov 21, 2002 1:22 am Posts: 11147
|
Unfortunately, this site has succumbed to the Supplemental syndrome. All of the pages, except the home page, are now Supplemental.
There are no 301s or 302s anywhere in the site, and, although some articles have been reproduced in may places, most of the site's pages could not fall foul of a duplication issue. The main site's pages could fall foul of a duplication issue, due to the Printable versions, but the forum's pages couldn't be seen as duplicates. So, if the problem has a single cause, then I can't see it being 301s, 302s, or duplication.
_________________ PhilC
Hidden Text
Search Engine Optimization articles and tools :: PageRank explained
|
| Sat Mar 11, 2006 6:13 pm |
|
 |
|
PhilC
Founder
Joined: Thu Nov 21, 2002 1:22 am Posts: 11147
|
Hi Grinler - nice to see you here
I understand what you mean about pages already being Supplemental, but didn't show until there were very few non-supplemental pages. I've seen that happen before. But I don't think it's the case with this site, so it probably isn't what's happening.
|
| Sat Mar 11, 2006 6:29 pm |
|
 |
|
WilliamC
Site Admin
Joined: Sun Dec 07, 2003 3:10 am Posts: 8429 Location: On a Mountain
|
I am seeing a lot of forums affected, so possibly another facet may be the way forums do a wee bit of cloaking, or the possibility for duped content via diff url structures.
|
| Sat Mar 11, 2006 7:53 pm |
|
 |
|