You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2010/06/15 10:02:36 UTC
DO NOT REPLY [Bug 49441] New: Wrong CharSet
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
Summary: Wrong CharSet
Product: POI
Version: 3.6
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P2
Component: HSMF
AssignedTo: dev@poi.apache.org
ReportedBy: aristar@mail.ru
If used an encoding other than Cp1252 StringChunk returns incorrect value. To
solve this issue whe added setCharset method to MAPIMessage, Chunks and
StringChunk.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 49441] Wrong CharSet
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
--- Comment #5 from Dmitry <ar...@mail.ru> 2010-07-06 03:57:19 EDT ---
I tried on a system with an English localization. If not set the Language for
Non-Unicode Programs to Russian then message open with wrong characters.
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 49441] Wrong CharSet
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
--- Comment #4 from Nick Burch <ni...@alfresco.com> 2010-06-29 09:30:33 EDT ---
Unfortunately I can't seem to spot anything in the file which indicates the
encoding
If you open the file on a different machine which has a different system
language set, does it look correct or do you get the wrong characters showing
up?
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 49441] Wrong CharSet
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
--- Comment #3 from Dmitry <ar...@mail.ru> 2010-06-16 01:22:18 EDT ---
I dont tried other versions than 3.6, but in the head revision StringChunk
(from SVN) hardcoded Cp1252. (1251 - Cyrillic)
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 49441] Wrong CharSet
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
--- Comment #2 from Dmitry <ar...@mail.ru> 2010-06-16 01:21:18 EDT ---
Created an attachment (id=25594)
--> (https://issues.apache.org/bugzilla/attachment.cgi?id=25594)
Message with cyrillic fields
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 49441] Wrong CharSet
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
Nick Burch <ni...@alfresco.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |NEEDINFO
--- Comment #1 from Nick Burch <ni...@alfresco.com> 2010-06-15 09:48:13 EDT ---
Have you tried with a recent nightly build? HSMF has undergone a lot of changes
since 3.6
For String chunks which aren't stored as unicode, we assume they're CP1252
based on all the files we've seen - outlook should generally store them as one
of those two. If you have found a file that differs, please do upload it, and
also please help us track down where in the file that charset is stored!
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org
DO NOT REPLY [Bug 49441] Wrong CharSet
Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441
Nick Burch <ni...@alfresco.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEEDINFO |RESOLVED
Resolution| |FIXED
--- Comment #6 from Nick Burch <ni...@alfresco.com> 2010-08-03 12:07:45 EDT ---
Fixed in r981947.
There is now a guess7BitEncoding() method on MAPIMessage, which looks in the
headers to guess the encoding, then calls the new set encoding method on the
string chunks
--
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org