You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Jun Rao (JIRA)" <ji...@apache.org> on 2009/01/12 05:25:00 UTC

[jira] Created: (HBASE-1122) Leveraging HBase control layer to build a distributed text index

Leveraging HBase control layer to build a distributed text index
----------------------------------------------------------------

                 Key: HBASE-1122
                 URL: https://issues.apache.org/jira/browse/HBASE-1122
             Project: Hadoop HBase
          Issue Type: New Feature
            Reporter: Jun Rao


Hi,

A few us at IBM Almaden Research Center built a distributed text index prototype called HIndex. The key design point of HIndex is to build the index by leveraging the distributed control layer in HBase, for availability, elasticity and load balancing. In our prototype, we used Lucene to implement a new type of region for storing the text index. Attached is a research paper that we wrote and submitted to USENIX 2009. It covers the design of HIndex and a performance evaluation (some of the results are applicable to HBase too).

We are grateful for the HBase community. We welcome comments and suggestions.


Jun


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HBASE-1122) Leveraging HBase control layer to build a distributed text index

Posted by "Jun Rao (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jun Rao updated HBASE-1122:
---------------------------

    Attachment: usenix09.pdf

A research paper containing the design and the performance evaluation of HIndex.

> Leveraging HBase control layer to build a distributed text index
> ----------------------------------------------------------------
>
>                 Key: HBASE-1122
>                 URL: https://issues.apache.org/jira/browse/HBASE-1122
>             Project: Hadoop HBase
>          Issue Type: New Feature
>            Reporter: Jun Rao
>         Attachments: usenix09.pdf
>
>
> Hi,
> A few us at IBM Almaden Research Center built a distributed text index prototype called HIndex. The key design point of HIndex is to build the index by leveraging the distributed control layer in HBase, for availability, elasticity and load balancing. In our prototype, we used Lucene to implement a new type of region for storing the text index. Attached is a research paper that we wrote and submitted to USENIX 2009. It covers the design of HIndex and a performance evaluation (some of the results are applicable to HBase too).
> We are grateful for the HBase community. We welcome comments and suggestions.
> Jun

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.