You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Jun Rao (JIRA)" <ji...@apache.org> on 2009/01/12 05:25:00 UTC
[jira] Created: (HBASE-1122) Leveraging HBase control layer to
build a distributed text index
Leveraging HBase control layer to build a distributed text index
----------------------------------------------------------------
Key: HBASE-1122
URL: https://issues.apache.org/jira/browse/HBASE-1122
Project: Hadoop HBase
Issue Type: New Feature
Reporter: Jun Rao
Hi,
A few us at IBM Almaden Research Center built a distributed text index prototype called HIndex. The key design point of HIndex is to build the index by leveraging the distributed control layer in HBase, for availability, elasticity and load balancing. In our prototype, we used Lucene to implement a new type of region for storing the text index. Attached is a research paper that we wrote and submitted to USENIX 2009. It covers the design of HIndex and a performance evaluation (some of the results are applicable to HBase too).
We are grateful for the HBase community. We welcome comments and suggestions.
Jun
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HBASE-1122) Leveraging HBase control layer to
build a distributed text index
Posted by "Jun Rao (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HBASE-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jun Rao updated HBASE-1122:
---------------------------
Attachment: usenix09.pdf
A research paper containing the design and the performance evaluation of HIndex.
> Leveraging HBase control layer to build a distributed text index
> ----------------------------------------------------------------
>
> Key: HBASE-1122
> URL: https://issues.apache.org/jira/browse/HBASE-1122
> Project: Hadoop HBase
> Issue Type: New Feature
> Reporter: Jun Rao
> Attachments: usenix09.pdf
>
>
> Hi,
> A few us at IBM Almaden Research Center built a distributed text index prototype called HIndex. The key design point of HIndex is to build the index by leveraging the distributed control layer in HBase, for availability, elasticity and load balancing. In our prototype, we used Lucene to implement a new type of region for storing the text index. Attached is a research paper that we wrote and submitted to USENIX 2009. It covers the design of HIndex and a performance evaluation (some of the results are applicable to HBase too).
> We are grateful for the HBase community. We welcome comments and suggestions.
> Jun
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.