You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2016/07/27 03:30:20 UTC

[jira] [Commented] (NIFI-1663) Add support for ORC format

    [ https://issues.apache.org/jira/browse/NIFI-1663?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15394999#comment-15394999 ] 

ASF GitHub Bot commented on NIFI-1663:
--------------------------------------

GitHub user mattyb149 opened a pull request:

    https://github.com/apache/nifi/pull/727

    NIFI-1663: Add ConvertAvroToORC processor

    This PR is based on #706 which removed the ConvertAvroToORC processor using Hive 2.x and Apache ORC 1.x. This PR replaces that processor with one that uses Hive 1.2.1 (which includes hive-orc before it was split into its own Apache project).

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/mattyb149/nifi old_orc

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/727.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #727
    
----
commit fa04188470e96d979819b7bf71723cad25c25105
Author: Matt Burgess <ma...@apache.org>
Date:   2016-07-21T15:59:41Z

    NIFI-1868: Add PutHiveStreaming processor

commit 108fb52113e57c424516fbfdcfc6b15edc34f12b
Author: Matt Burgess <ma...@apache.org>
Date:   2016-07-22T15:14:16Z

    NIFI-1868: Downgrade to Hive 1.2.1 and remove ConvertAvroToORC

commit 91de9edcbc85f90f104c24f1f18caf4f4f0283d9
Author: Matt Burgess <ma...@apache.org>
Date:   2016-07-27T03:25:11Z

    NIFI-1663: Add ConvertAvroToORC processor

----


> Add support for ORC format
> --------------------------
>
>                 Key: NIFI-1663
>                 URL: https://issues.apache.org/jira/browse/NIFI-1663
>             Project: Apache NiFi
>          Issue Type: New Feature
>            Reporter: Matt Burgess
>            Assignee: Matt Burgess
>             Fix For: 1.0.0
>
>
> From the Hive/ORC wiki (https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC): 
> The Optimized Row Columnar (ORC) file format provides a highly efficient way to store Hive data ... Using ORC files improves performance when Hive is reading, writing, and processing data.
> As users are interested in NiFi integrations with Hive (NIFI-981, NIFI-1193, etc.), NiFi should be able to support ORC file format to enable users to efficiently store flow files for use by Hive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)