You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@commons.apache.org by bu...@apache.org on 2005/07/21 16:16:42 UTC

DO NOT REPLY [Bug 35814] New: - [configuration] Support Byte Order Marks

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG�
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=35814>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND�
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=35814

           Summary: [configuration] Support Byte Order Marks
           Product: Commons
           Version: Nightly Builds
          Platform: All
        OS/Version: All
            Status: NEW
          Severity: enhancement
          Priority: P2
         Component: Configuration
        AssignedTo: commons-dev@jakarta.apache.org
        ReportedBy: ebourg@apache.org


We should support byte order marks to detect automatically unicode files and
parse them with the appropriate encoding. Here is a description of BOMs:

http://en.wikipedia.org/wiki/Byte_Order_Mark

I suggest the following changes:
- add a setUseByteOrderMark() method in FileConfiguration to specify if the BOM
should be added when the file is saved
- the useByteOrderMark flag is ignored if the encoding has no corresponding BOM
- on loading the file, look for a BOM at the beginning of the file and set the
useByteOrderMark flag to true and switch to the corresponding encoding. The flag
is set to false by default

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: commons-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: commons-dev-help@jakarta.apache.org