You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@orc.apache.org by "Dinesh S. Atreya (JIRA)" <ji...@apache.org> on 2016/02/15 08:14:18 UTC

[jira] [Comment Edited] (ORC-42) Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA

    [ https://issues.apache.org/jira/browse/ORC-42?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15146928#comment-15146928 ] 

Dinesh S. Atreya edited comment on ORC-42 at 2/15/16 7:13 AM:
--------------------------------------------------------------

{panel:title=INDEX_ORC}
_*Once comprehensive index processing capabilities is added to ORC i.e, Hadoop, it can be used to build indexes to other types of files in Hadoop.*_

Some candidate index types are given below
* Binary-Tree
* B-Tree
* B+-Tree
* Bit-Map
* Search Indexes

Search engines such as Solr, Elastic-Search etc. can use these index processing capabilities. 
{panel}


was (Author: dinatreya):
{panel:title=INDEX_ORC}
_*Once comprehensive index processing capabilities is added to ORC i.e, Hadoop, it can be used to build indexes to other types of file in Hadoop.*_

Some candidate index types are given below
* Binary-Tree
* B-Tree
* B+-Tree
* Bit-Map
* Search Indexes

Search engines such as Solr, Elastic-Search etc. can use these index processing capabilities. 
{panel}

> Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA
> ---------------------------------------------------------------
>
>                 Key: ORC-42
>                 URL: https://issues.apache.org/jira/browse/ORC-42
>             Project: Orc
>          Issue Type: New Feature
>            Reporter: Dinesh S. Atreya
>
> Link to Umbrella JIRA
> https://issues.apache.org/jira/browse/HADOOP-12620
> See https://issues.apache.org/jira/browse/HADOOP-12620?focusedCommentId=15046300&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15046300 for more details. 
> This JIRA is an umbrella (parent/master) JIRA for advancing ORC given https://issues.apache.org/jira/browse/HDFS-9607.
> A number of capabilities that can be added to ORC once HDFS update is supported may include: 
> JSON_ORC -- native processing of JSON (add MongoDB/CouchDB type capabilities in Hadoop)
> XML_ORC -- add native XML processing capability to ORC.
> RDF_ORC -- native processing of RDF documents
> MVCC_ORC -- Add Multi Version Concurrency MVCC support to ORC
> INDEX_ORC -- Create a variety of Indexes such as B-Tree, Bitmap etc. to other files in Hadoop.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)