You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "John Michaels (Jira)" <ji...@apache.org> on 2020/06/02 01:00:58 UTC

[jira] [Assigned] (NIFI-4356) GetHTMLElement removes content from resulting flowfiles even when destination is attribute

     [ https://issues.apache.org/jira/browse/NIFI-4356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Michaels reassigned NIFI-4356:
-----------------------------------

    Assignee: John Michaels

> GetHTMLElement removes content from resulting flowfiles even when destination is attribute
> ------------------------------------------------------------------------------------------
>
>                 Key: NIFI-4356
>                 URL: https://issues.apache.org/jira/browse/NIFI-4356
>             Project: Apache NiFi
>          Issue Type: Bug
>          Components: Extensions
>    Affects Versions: 1.3.0
>            Reporter: Andy LoPresto
>            Assignee: John Michaels
>            Priority: Major
>              Labels: html
>
> As noted in [StackOverflow|https://stackoverflow.com/questions/46070288/multiple-gethtml-attributes-in-apache-nifi], {{GetHTMLElement}} processors cannot be chained because the success relationship clears the flowfile content even if the destination is an attribute. I propose the logic be changed to the following:
> * Destination = content
>   * Replace the flowfile content for the *success* relationship
>   * Maintain the content for *original*
> * Destination = attribute
>   * Maintain the content for *success* and *original*
>   * Add a property descriptor for the attribute name
>   * Name attributes {{Attribute.1}} as in {{Extract*}} in the event that multiple elements are matched



--
This message was sent by Atlassian Jira
(v8.3.4#803005)