You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@poi.apache.org by bu...@apache.org on 2010/06/15 10:02:36 UTC

DO NOT REPLY [Bug 49441] New: Wrong CharSet

https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

           Summary: Wrong CharSet
           Product: POI
           Version: 3.6
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: normal
          Priority: P2
         Component: HSMF
        AssignedTo: dev@poi.apache.org
        ReportedBy: aristar@mail.ru


If used an encoding other than Cp1252 StringChunk returns incorrect value. To
solve this issue whe added setCharset method to MAPIMessage, Chunks and
StringChunk.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 49441] Wrong CharSet

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

--- Comment #5 from Dmitry <ar...@mail.ru> 2010-07-06 03:57:19 EDT ---
I tried on a system with an English localization. If not set the Language for
Non-Unicode Programs to Russian then message open with wrong characters.

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 49441] Wrong CharSet

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

--- Comment #4 from Nick Burch <ni...@alfresco.com> 2010-06-29 09:30:33 EDT ---
Unfortunately I can't seem to spot anything in the file which indicates the
encoding

If you open the file on a different machine which has a different system
language set, does it look correct or do you get the wrong characters showing
up?

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 49441] Wrong CharSet

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

--- Comment #3 from Dmitry <ar...@mail.ru> 2010-06-16 01:22:18 EDT ---
I dont tried other versions than 3.6, but in the head revision StringChunk
(from SVN) hardcoded Cp1252. (1251 - Cyrillic)

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 49441] Wrong CharSet

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

--- Comment #2 from Dmitry <ar...@mail.ru> 2010-06-16 01:21:18 EDT ---
Created an attachment (id=25594)
 --> (https://issues.apache.org/bugzilla/attachment.cgi?id=25594)
Message with cyrillic fields

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 49441] Wrong CharSet

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

Nick Burch <ni...@alfresco.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |NEEDINFO

--- Comment #1 from Nick Burch <ni...@alfresco.com> 2010-06-15 09:48:13 EDT ---
Have you tried with a recent nightly build? HSMF has undergone a lot of changes
since 3.6

For String chunks which aren't stored as unicode, we assume they're CP1252
based on all the files we've seen - outlook should generally store them as one
of those two. If you have found a file that differs, please do upload it, and
also please help us track down where in the file that charset is stored!

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org


DO NOT REPLY [Bug 49441] Wrong CharSet

Posted by bu...@apache.org.
https://issues.apache.org/bugzilla/show_bug.cgi?id=49441

Nick Burch <ni...@alfresco.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEEDINFO                    |RESOLVED
         Resolution|                            |FIXED

--- Comment #6 from Nick Burch <ni...@alfresco.com> 2010-08-03 12:07:45 EDT ---
Fixed in r981947.

There is now a guess7BitEncoding() method on MAPIMessage, which looks in the
headers to guess the encoding, then calls the new set encoding method on the
string chunks

-- 
Configure bugmail: https://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@poi.apache.org
For additional commands, e-mail: dev-help@poi.apache.org