You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "HAOFENG DENG (Jira)" <ji...@apache.org> on 2020/02/21 15:19:00 UTC

[jira] [Created] (ARROW-7906) Full functionality for ORC format

HAOFENG DENG created ARROW-7906:
-----------------------------------

             Summary: Full functionality for ORC format
                 Key: ARROW-7906
                 URL: https://issues.apache.org/jira/browse/ARROW-7906
             Project: Apache Arrow
          Issue Type: New Feature
          Components: C++, Python
            Reporter: HAOFENG DENG


Just like parquet format, ORC have a big group fans in Bigdata area, it have better performance that parquet in some use case.

But there has a problem in python is which is does not have the standard write function.

 

Seem the ORC team itself maintain the standard C++ code([ORC-C++|[https://github.com/apache/orc/tree/master/c%2B%2B]]), so I think will not take too much effort to integrate into Arrow(C++) and build the hook for python.

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)