You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@poi.apache.org by bu...@apache.org on 2009/01/27 22:13:56 UTC

DO NOT REPLY [Bug 46610] [PATCH] Problems accessing documents containing unicode

https://issues.apache.org/bugzilla/show_bug.cgi?id=46610


Benjamin Engele <be...@polarion.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
            Summary|java.lang.IllegalArgumentExc|[PATCH] Problems  accessing
                   |eption: The end (272) must  |documents containing unicode
                   |not be before the start     |
                   |(300)                       |




--- Comment #6 from Benjamin Engele <be...@polarion.com>  2009-01-27 13:13:54 PST ---
Patch for Exception triggered by utf2.doc doesn't resolve all problems with
utf2.doc: The last paragraph is misplaced. This is happens because of another
error in translating byte positions from FormatedDiskPage to char positions in
the TextPiece.

Some more notes:
Writing wasn't tested and not changed. It is probably now more broken then it
was before. BytePropertyNode.getStartBytes() and getEndBytes() definitely needs
to be fixed, they still uses the wrong approach to calculate the byte index
from the char index.

IMHO BytePropertyNode.isUnicode() should be removed as soon as
get[Start/End]Bytes() has been fixed. Don't think the information that the tart
of the node is in a unicode text piece is useful.


-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org