You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Chris Sampson (Jira)" <ji...@apache.org> on 2020/11/12 09:15:00 UTC

[jira] [Updated] (NIFI-7990) PutElasticsearch/RecordHttp processors should support Elasticsearch Data Streams

     [ https://issues.apache.org/jira/browse/NIFI-7990?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Chris Sampson updated NIFI-7990:
--------------------------------
    Description: 
PutElasticsearchHttp and PutElasticsearchRecordHttp (and possibly other ES related processors) should support the new [Elasticsearch Data Streams|https://www.elastic.co/guide/en/elasticsearch/reference/current/use-a-data-stream.html#add-documents-to-a-data-stream].

As these processors use the {{_bulk}} endpoint to PUT one or more documents in one request, the processors need to be updated to support the "create" operation type. This change is likely related to: NIFI-7474.

Also, Data Streams require an {{@timestamp}} field to be provided in each document, however such a field name is illegal in [Avro schemas|http://avro.apache.org/docs/1.8.2/spec.html#names] due to the leading {{@}}. The processors should therefore allow for the injection of this field into the JSON being sent to Elasticsearch - this could be based upon an existing field within the FlowFile and be identified by a property on the processor (e.g. like the {{_id}} field can be specified using Record Path). Optionally, the processor allow for the field used as the {{@timestamp}} field to be removed from the data being sent to Elasticsearch (i.e. rename the existing field *or* duplicate it depending upon property settings). Such field transformation should also take the timestamp format settings into account(e.g. if a {{Long}} epoch millisecond value is to be converted to a formatted date/time {{String}}).

  was:
PutElasticsearchHttp and PutElasticsearchRecordHttp (and possibly other ES related processors) should support the new [Elasticsearch Data Streams|https://www.elastic.co/guide/en/elasticsearch/reference/current/use-a-data-stream.html#add-documents-to-a-data-stream].

As these processors use the _bulk endpoint to PUT one or more documents in one request, the processors need to be updated to support the "create" operation type.

Likely related to: NIFI-7474


> PutElasticsearch/RecordHttp processors should support Elasticsearch Data Streams
> --------------------------------------------------------------------------------
>
>                 Key: NIFI-7990
>                 URL: https://issues.apache.org/jira/browse/NIFI-7990
>             Project: Apache NiFi
>          Issue Type: Improvement
>    Affects Versions: 1.11.4, 1.12.1
>            Reporter: Chris Sampson
>            Priority: Minor
>
> PutElasticsearchHttp and PutElasticsearchRecordHttp (and possibly other ES related processors) should support the new [Elasticsearch Data Streams|https://www.elastic.co/guide/en/elasticsearch/reference/current/use-a-data-stream.html#add-documents-to-a-data-stream].
> As these processors use the {{_bulk}} endpoint to PUT one or more documents in one request, the processors need to be updated to support the "create" operation type. This change is likely related to: NIFI-7474.
> Also, Data Streams require an {{@timestamp}} field to be provided in each document, however such a field name is illegal in [Avro schemas|http://avro.apache.org/docs/1.8.2/spec.html#names] due to the leading {{@}}. The processors should therefore allow for the injection of this field into the JSON being sent to Elasticsearch - this could be based upon an existing field within the FlowFile and be identified by a property on the processor (e.g. like the {{_id}} field can be specified using Record Path). Optionally, the processor allow for the field used as the {{@timestamp}} field to be removed from the data being sent to Elasticsearch (i.e. rename the existing field *or* duplicate it depending upon property settings). Such field transformation should also take the timestamp format settings into account(e.g. if a {{Long}} epoch millisecond value is to be converted to a formatted date/time {{String}}).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)