You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "dingli123 (Jira)" <ji...@apache.org> on 2020/05/22 10:55:00 UTC

[jira] [Commented] (NIFI-7478) support bloom filter in orc format output

    [ https://issues.apache.org/jira/browse/NIFI-7478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17113939#comment-17113939 ] 

dingli123 commented on NIFI-7478:
---------------------------------

in {color:#6f42c1}NiFiOrcUtils.java:{color}
{code:java}
 return new OrcFlowFileWriter(flowFileOutputStream,
                path,
                conf,
                inspector,
                stripeSize,
                compress,
                bufferSize,
                rowIndexStride,
                getMemoryManager(conf),
                addBlockPadding,
                versionValue,
                null, // no callback
                encodingStrategy,
                compressionStrategy,
                paddingTolerance,
                blockSizeValue,
                null, // no Bloom Filter column names
                bloomFilterFpp);
{code}
{color:#6f42c1}it seems ignore the bloom filter column config{color}

 

> support bloom filter in orc format output
> -----------------------------------------
>
>                 Key: NIFI-7478
>                 URL: https://issues.apache.org/jira/browse/NIFI-7478
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: dingli123
>            Priority: Major
>
> current orc output don't support create bloom filter for column
> in Hive, create orc table can set bloom filter config
> but if hive create external table with orc file created by NiFi ,
> there are no bloom filter can be used for speed query
> Please add bloom filter support in orc file output



--
This message was sent by Atlassian Jira
(v8.3.4#803005)