You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nifi.apache.org by "Bryan Bende (JIRA)" <ji...@apache.org> on 2015/11/18 23:36:10 UTC

[jira] [Updated] (NIFI-1174) Create a Put HBase processor that can put multiple cells

     [ https://issues.apache.org/jira/browse/NIFI-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Bryan Bende updated NIFI-1174:
------------------------------
    Attachment: NIFI-1174.patch

Attaching a patch which adds a PutHBaseJSON that can add a row to HBase using the elements of the JSON as the column qualifiers and values. For right now it only supports input of a single JSON document (the processor does operate on a configurable batch of FlowFiles though), and only flat elements. If a value of an element is an array it will be skipped over and not sent in the row.

Probably pushing my luck, but would be nice to get this in 0.4.0 for two reasons... 
* It refactors the HBaseClientService API a little bit to better support adding a whole row, and it will be harder to change the API after we release. We can easily add other variations after this such as CSV, or anything else.
* We can support a lot more HBase use cases by being able to add a whole row at a time

> Create a Put HBase processor that can put multiple cells
> --------------------------------------------------------
>
>                 Key: NIFI-1174
>                 URL: https://issues.apache.org/jira/browse/NIFI-1174
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Bryan Bende
>            Assignee: Bryan Bende
>            Priority: Minor
>         Attachments: NIFI-1174.patch
>
>
> We recently added a PutHBaseCell processor which works great for writing one individual cell at a time, but it can require a significant amount of work in a flow to create a row with multiple cells. 
> We should support a variation of this processor that can accept a flow file with key/value pairs in the content of the flow file (possibly json). The key/value pairs then turned into the cells for the given row and get added in one put operation. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)