You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Phil Hawkins (Jira)" <ji...@apache.org> on 2022/04/27 08:11:00 UTC

[jira] [Commented] (NIFI-1517) Allow SplitContent to be split on a regular expression

    [ https://issues.apache.org/jira/browse/NIFI-1517?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17528624#comment-17528624 ] 

Phil Hawkins commented on NIFI-1517:
------------------------------------

I have created a fix for this issue [https://github.com/gippyphil/nifi/pull/1]

 

> Allow SplitContent to be split on a regular expression
> ------------------------------------------------------
>
>                 Key: NIFI-1517
>                 URL: https://issues.apache.org/jira/browse/NIFI-1517
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>            Reporter: Ian Hellstrom
>            Priority: Minor
>
> Currently SplitContent allows HEX and text sequences to be added. However, it is sometimes necessary to split on alternatives or based on different sections of a log file (sometimes indicated by "[SOME_TEXT]"), where the section name SOME_TEXT can obviously vary. Hence, regular expressions (or EL) should be allowed in the SplitContent processor when using the "text" option.
> It would also be great if it's possible to immediately extract relevant information from the split. For instance, create a RegEx that back-references SOME_TEXT in the aforementioned example. That way you could split files based on section markers yet immediately get rid of these markers. This additional request is a nice-to-have feature.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)