You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Luigi De Giovanni (Jira)" <ji...@apache.org> on 2022/11/23 13:56:00 UTC

[jira] [Updated] (NIFI-10869) ExtractText processor - RegEx captured twice

     [ https://issues.apache.org/jira/browse/NIFI-10869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Luigi De Giovanni updated NIFI-10869:
-------------------------------------
    Priority: Minor  (was: Major)

> ExtractText processor - RegEx captured twice
> --------------------------------------------
>
>                 Key: NIFI-10869
>                 URL: https://issues.apache.org/jira/browse/NIFI-10869
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.18.0
>            Reporter: Luigi De Giovanni
>            Priority: Minor
>         Attachments: image (1).png, image (2).png, image.png
>
>
> Hi,
> There is an issue with the ExtractText processor, in the attempt of creating FlowFile attributes from FlowFile contents.
> When creating a custom property with a regex value that does not contain named groups, the value captured in the group is added as 2 different attributes.
> E.G.
> ||Property Name||Property Value||Expected FlowFile Attribute||Actual outcome||
> |request.param.alpha|<regex for alpha>|request.param.alpha=alpha|request.param.alpha=alpha
> request.param.alpha.1=alpha|
> Reading the documentation, this might even be an expected behaviour, but if so, it is preferable to have the captured value only as a single attribute, without duplication.
> Please see the attachment for a real example
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)