Friday, September 17, 2004

Yahoo Slurp Garbling Mac Encoded Pages

In a recent WebProWorld thread a member was wondering why his site was showing weird hieroglyphic (mostly Chinese) characters in his result abstracts. Searching for his site in Yahoo showed some of the strangest crap that I have ever seen. Another site of his was showing the same strange characters as well.



The one common denominator was the character encoding of the pages, they are text/html;charset=macintosh. This encoding makes the Yahoo Index choke. It chokes the system so bad that when you click on the cache link for any of these pages, it tosses an error for them. The error simply states, "We're sorry, but we could not process your request for the cache of http://www.xxxxx.com/. Please click here to check the current page."



My browser was showing the encoding of these pages as Western European (Mac), which is not a problem for the browser (I am running IE 6). It was not a problem for the Lynx text browser also. It is however a problem for Yahoo Search -- and it is not showing a Western European character set that I am familular with.

No comments:

Post a Comment