You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Jim Scott (JIRA)" <ji...@apache.org> on 2014/09/22 18:14:35 UTC

[jira] [Created] (DRILL-1439) Allow file / field text encoding type to be specifed so that UTF-8 default can be overriden

Jim Scott created DRILL-1439:
--------------------------------

             Summary: Allow file / field text encoding type to be specifed so that UTF-8 default can be overriden 
                 Key: DRILL-1439
                 URL: https://issues.apache.org/jira/browse/DRILL-1439
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Text & CSV
    Affects Versions: 0.5.0
            Reporter: Jim Scott


Drill parsed a field and found that the byte value 0xb5 was invalid. That byte is from ISO-8859-1 and not UTF-8. The equivalent character in UTF-8 would be 0xc2b5. If there was a way to tell the function or the file is a different encoding this problem could be prevented.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)