You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Gopal V (JIRA)" <ji...@apache.org> on 2015/06/25 00:24:04 UTC

[jira] [Commented] (ORC-17) Support HDFS as a plug-in module

    [ https://issues.apache.org/jira/browse/ORC-17?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600275#comment-14600275 ] 

Gopal V commented on ORC-17:
----------------------------

HTTP range queries would be a huge win when dealing with S3 - to be able to convert an ORC columnar read plan into a single HTTP fetch spanning multiple seeks.

> Support HDFS as a plug-in module
> --------------------------------
>
>                 Key: ORC-17
>                 URL: https://issues.apache.org/jira/browse/ORC-17
>             Project: Orc
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Charles Pritchard
>
> Continuing from: https://github.com/hortonworks/orc/issues/149
> There are C/C++ modules for reading from and writing to HDFS. There (may be) additional use cases for HTTP-based protocols like WebHDFS and S3.
> As omalley states: "The C++ API has a pluggable interface for where to get the bytes. It already has a binding the local file system and it would be easy to extend it to other file systems or object stores.".
> It would be nice to see a build of C++ ORC which includes HDFS support, as a build flag or otherwise provided as a module/separate repository.
> I'd also like to see WebHDFS/S3 supported, once that baseline has been met.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)