You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nifi.apache.org by "Ian Hellstrom (JIRA)" <ji...@apache.org> on 2016/02/16 08:26:18 UTC
[jira] [Created] (NIFI-1517) Allow SplitContent to be split on a
regular expression
Ian Hellstrom created NIFI-1517:
-----------------------------------
Summary: Allow SplitContent to be split on a regular expression
Key: NIFI-1517
URL: https://issues.apache.org/jira/browse/NIFI-1517
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Reporter: Ian Hellstrom
Priority: Minor
Currently SplitContent allows HEX and text sequences to be added. However, it is sometimes necessary to split on alternatives or based on different sections of a log file (sometimes indicated by "[SOME_TEXT]"), where the section name SOME_TEXT can obviously vary. Hence, regular expressions (or EL) should be allowed in the SplitContent processor when using the "text" option.
It would also be great if it's possible to immediately extract relevant information from the split. For instance, create a RegEx that back-references SOME_TEXT in the aforementioned example. That way you could split files based on section markers yet immediately get rid of these markers. This additional request is a nice-to-have feature.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)