You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Luigi De Giovanni (Jira)" <ji...@apache.org> on 2022/11/23 13:56:00 UTC
[jira] [Updated] (NIFI-10869) ExtractText processor - RegEx captured twice
[ https://issues.apache.org/jira/browse/NIFI-10869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Luigi De Giovanni updated NIFI-10869:
-------------------------------------
Priority: Minor (was: Major)
> ExtractText processor - RegEx captured twice
> --------------------------------------------
>
> Key: NIFI-10869
> URL: https://issues.apache.org/jira/browse/NIFI-10869
> Project: Apache NiFi
> Issue Type: Bug
> Affects Versions: 1.18.0
> Reporter: Luigi De Giovanni
> Priority: Minor
> Attachments: image (1).png, image (2).png, image.png
>
>
> Hi,
> There is an issue with the ExtractText processor, in the attempt of creating FlowFile attributes from FlowFile contents.
> When creating a custom property with a regex value that does not contain named groups, the value captured in the group is added as 2 different attributes.
> E.G.
> ||Property Name||Property Value||Expected FlowFile Attribute||Actual outcome||
> |request.param.alpha|<regex for alpha>|request.param.alpha=alpha|request.param.alpha=alpha
> request.param.alpha.1=alpha|
> Reading the documentation, this might even be an expected behaviour, but if so, it is preferable to have the captured value only as a single attribute, without duplication.
> Please see the attachment for a real example
>
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)