You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nifi.apache.org by "Joseph Percivall (JIRA)" <ji...@apache.org> on 2015/11/24 20:06:11 UTC

[jira] [Created] (NIFI-1217) New processor to determine flowfile text content's encoding

Joseph Percivall created NIFI-1217:
--------------------------------------

             Summary: New processor to determine flowfile text content's encoding
                 Key: NIFI-1217
                 URL: https://issues.apache.org/jira/browse/NIFI-1217
             Project: Apache NiFi
          Issue Type: Improvement
            Reporter: Joseph Percivall


A file can enter the through many different means. Most of which make it almost impossible to find the text encoding of the file without relying on OS specific commands.

There is a need for a processor that can analyze the contents of a flowfile to determine the text encoding. As a start this library may be of help [1]. It uses a Mozilla 1.1 license which needs special treatment [2].

Here is the email thread discussing this [3].

[1] http://jchardet.sourceforge.net/
[2] http://www.apache.org/legal/resolved.html#category-b
[3] http://mail-archives.apache.org/mod_mbox/nifi-users/201511.mbox/%3C1093351346.8086538.1448382574704.JavaMail.yahoo%40mail.yahoo.com%3E



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)