You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nifi.apache.org by "Joseph Percivall (JIRA)" <ji...@apache.org> on 2015/11/24 20:06:11 UTC
[jira] [Created] (NIFI-1217) New processor to determine flowfile
text content's encoding
Joseph Percivall created NIFI-1217:
--------------------------------------
Summary: New processor to determine flowfile text content's encoding
Key: NIFI-1217
URL: https://issues.apache.org/jira/browse/NIFI-1217
Project: Apache NiFi
Issue Type: Improvement
Reporter: Joseph Percivall
A file can enter the through many different means. Most of which make it almost impossible to find the text encoding of the file without relying on OS specific commands.
There is a need for a processor that can analyze the contents of a flowfile to determine the text encoding. As a start this library may be of help [1]. It uses a Mozilla 1.1 license which needs special treatment [2].
Here is the email thread discussing this [3].
[1] http://jchardet.sourceforge.net/
[2] http://www.apache.org/legal/resolved.html#category-b
[3] http://mail-archives.apache.org/mod_mbox/nifi-users/201511.mbox/%3C1093351346.8086538.1448382574704.JavaMail.yahoo%40mail.yahoo.com%3E
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)