You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nifi.apache.org by "Ian Hellstrom (JIRA)" <ji...@apache.org> on 2016/02/16 08:26:18 UTC

[jira] [Created] (NIFI-1517) Allow SplitContent to be split on a regular expression

Ian Hellstrom created NIFI-1517:
-----------------------------------

             Summary: Allow SplitContent to be split on a regular expression
                 Key: NIFI-1517
                 URL: https://issues.apache.org/jira/browse/NIFI-1517
             Project: Apache NiFi
          Issue Type: Improvement
          Components: Extensions
            Reporter: Ian Hellstrom
            Priority: Minor


Currently SplitContent allows HEX and text sequences to be added. However, it is sometimes necessary to split on alternatives or based on different sections of a log file (sometimes indicated by "[SOME_TEXT]"), where the section name SOME_TEXT can obviously vary. Hence, regular expressions (or EL) should be allowed in the SplitContent processor when using the "text" option.

It would also be great if it's possible to immediately extract relevant information from the split. For instance, create a RegEx that back-references SOME_TEXT in the aforementioned example. That way you could split files based on section markers yet immediately get rid of these markers. This additional request is a nice-to-have feature.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)