You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2004/03/05 05:36:35 UTC

DO NOT REPLY [Bug 27394] - If the special characters (①, ②, etc.) of WINDOWS are taken, a space may enter behind a character.

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=27394>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=27394

If the special characters (①, ②, etc.) of WINDOWS are taken, a space may enter behind a character.





------- Additional Comments From zalewski@optonline.net  2004-03-05 04:36 -------
I am not sure which characters are being reported as broken. But I have noticed 
a similar problem:

1.  Using Excel, set a sheet name with one of the non-ISO 8859-1 characters in 
Windows ANSII Code Page 1252. For example, this includes s caron and z caron 
(but not a caron). Caron is the little cap diacritic drawn above the character. 
Don't use any non CP 1252 characters.

2. Try to read the sheet name from POI. The character will show as a little 
square (because it is an invalid code point).

3. If you use Biff Viewer, you will see that the sheet name has been stored by 
Excel as a compressed 8 bit string.

4. If you look at the string that HSSF returns for the sheet name, you will see 
that, for example, lower case z caron is converted by HSSF into \u009e, which 
is actually not a valid code point. The proper representation of z caron is 
\u0173.

When I read the original report, I cannot tell which 'special characters' are 
being used. (They appear as little boxes in my browser). I suspect that 
the 'special characters' may be those in ASCII range 0x80 - 0x9f, which cannot 
be translated to Unicode simply by assuming that the high order byte is zero.

Also a question for sasaki: He should report what his windows code page is (if 
it is not 1252). Attaching a sample spreadsheet would probably help.

Did I even get this right? The problem is that a special character is entered 
inside a cell, with Excel. When HSSF reads the value of that cell, it displays 
the character as a little box, or possibly a space.

---------------------------------------------------------------------
To unsubscribe, e-mail: poi-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: poi-dev-help@jakarta.apache.org