You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Nitish Upreti (JIRA)" <ji...@apache.org> on 2014/07/20 07:35:39 UTC
[jira] [Commented] (HBASE-5626) Compactions simulator tool for
proofing algorithms
[ https://issues.apache.org/jira/browse/HBASE-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14067807#comment-14067807 ]
Nitish Upreti commented on HBASE-5626:
--------------------------------------
I am a newbie (student), learning about HBase and want to get started contributing to the project. I have been scanning through the HBase "noob" tag and found this issue interesting to work on. As this issue was last updated on 24/Jan/13 21:51, is the community still interested in this Task?
I understand the overall concepts of log-structured merge tree and reducing the maximum number of disk seeks needed by compaction. I also understand how HBase has pluggable a compaction component where we can exploit performance benefits by knowing our data and request patterns in depth.
What are the relevant packages / source files / API References I should look into for this task? Any general pointers from the community for working on this task will be of great help. :)
> Compactions simulator tool for proofing algorithms
> --------------------------------------------------
>
> Key: HBASE-5626
> URL: https://issues.apache.org/jira/browse/HBASE-5626
> Project: HBase
> Issue Type: Task
> Reporter: stack
> Priority: Minor
> Labels: noob
> Attachments: cf_compact.py
>
>
> A tool to run compaction simulations would be a nice to have. We could use it to see how well an algo ran under different circumstances loaded w/ different value types with different rates of flushes and splits, etc. HBASE-2462 had one (see in patch). Or we could try doing it using something like this: http://en.wikipedia.org/wiki/Discrete_event_simulation
--
This message was sent by Atlassian JIRA
(v6.2#6252)