You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Dinesh S. Atreya (Jira)" <ji...@apache.org> on 2020/08/19 17:05:00 UTC

[jira] [Commented] (ORC-42) Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA

    [ https://issues.apache.org/jira/browse/ORC-42?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17180682#comment-17180682 ] 

Dinesh S. Atreya commented on ORC-42:
-------------------------------------

 

Now Hadoop Distributed Data Store is an active effort, starting with _Hadoop Distributed Storage Layer (HDSL)_ HDFS-7240, *Ozone File System* (aka Hadoop Distributed Data Store) HDFS-13074 after formally being named with *Ozone: Rename HDSL to HDDS* HDFS-13405.

 

Ozone is built on a highly available, replicated block storage layer called Hadoop Distributed Data Store (HDDS) and is S3 compatible. Ozone can function effectively in containerized environments such as _*Kubernetes*_ and _*YARN*_. Ozone supports different protocols like S3 and Hadoop File System APIs.

 

Work on this JIRA and related JIRAs are using S3 compatible Ozone as a base.

> Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA
> ---------------------------------------------------------------
>
>                 Key: ORC-42
>                 URL: https://issues.apache.org/jira/browse/ORC-42
>             Project: ORC
>          Issue Type: New Feature
>            Reporter: Dinesh S. Atreya
>            Priority: Major
>
> Link to Umbrella JIRA
> https://issues.apache.org/jira/browse/HADOOP-12620
> See https://issues.apache.org/jira/browse/HADOOP-12620?focusedCommentId=15046300&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15046300 for more details. 
> This JIRA is an umbrella (parent/master) JIRA for advancing ORC through ORC update capability given https://issues.apache.org/jira/browse/HDFS-9607.
> A number of capabilities that can be added to ORC once ORC update (HDFS update) is supported may include: 
> JSON_ORC -- native processing of JSON (add MongoDB/CouchDB type capabilities in Hadoop)
> XML_ORC -- add native XML processing capability to ORC.
> RDF_ORC -- native processing of RDF documents
> MVCC_ORC -- Add Multi Version Concurrency Control (MVCC) support to ORC
> INDEX_ORC -- Create a variety of Indexes such as B-Tree, Bitmap etc. to other files in Hadoop.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)