You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hama.apache.org by "Edward J. Yoon (JIRA)" <ji...@apache.org> on 2008/12/05 10:50:44 UTC
[jira] Created: (HAMA-125) Refactor dense matrix storage
Refactor dense matrix storage
-----------------------------
Key: HAMA-125
URL: https://issues.apache.org/jira/browse/HAMA-125
Project: Hama
Issue Type: Improvement
Components: implementation
Affects Versions: 0.1.0
Reporter: Edward J. Yoon
Fix For: 0.1.0
Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
Current table schema:
column:
=============================
rowKey1 column:0 <0.323..>
column:1 <0.413..>
... N
rowKey2
...
We can group the columns as describe below. (2D array is too complex.)
vector:
=============================
rowKey1 vector:0 <0 ~ 9999999>
vector:1 <10000000 ~ 19999999>
...
rowKey2
...
----
Should be changed things :
- DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
- DenseVector : get/set(), subVector(), ...
...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Resolved: (HAMA-125) Refactor dense matrix storage
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Edward J. Yoon resolved HAMA-125.
---------------------------------
Resolution: Won't Fix
I think we need more experiences to discuss this.
won't fix.
> Refactor dense matrix storage
> -----------------------------
>
> Key: HAMA-125
> URL: https://issues.apache.org/jira/browse/HAMA-125
> Project: Hama
> Issue Type: Improvement
> Components: implementation
> Affects Versions: 0.1.0
> Reporter: Edward J. Yoon
> Assignee: Edward J. Yoon
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema:
> column:
> =============================
> rowKey1 column:0 <0.323..>
> column:1 <0.413..>
> ... N
> rowKey2
> ...
> We can group the columns as describe below. (2D array is too complex.)
> vector:
> =============================
> rowKey1 vector:0 <0 ~ 9999999>
> vector:1 <10000000 ~ 19999999>
> ...
> rowKey2
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HAMA-125) Refactor dense matrix storage
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653723#action_12653723 ]
Edward J. Yoon commented on HAMA-125:
-------------------------------------
Also,
- BlockingMapred
- Reduce writers
> Refactor dense matrix storage
> -----------------------------
>
> Key: HAMA-125
> URL: https://issues.apache.org/jira/browse/HAMA-125
> Project: Hama
> Issue Type: Improvement
> Components: implementation
> Affects Versions: 0.1.0
> Reporter: Edward J. Yoon
> Fix For: 0.1.0
>
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema:
> column:
> =============================
> rowKey1 column:0 <0.323..>
> column:1 <0.413..>
> ... N
> rowKey2
> ...
> We can group the columns as describe below. (2D array is too complex.)
> vector:
> =============================
> rowKey1 vector:0 <0 ~ 9999999>
> vector:1 <10000000 ~ 19999999>
> ...
> rowKey2
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Assigned: (HAMA-125) Refactor dense matrix storage
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Edward J. Yoon reassigned HAMA-125:
-----------------------------------
Assignee: Edward J. Yoon
> Refactor dense matrix storage
> -----------------------------
>
> Key: HAMA-125
> URL: https://issues.apache.org/jira/browse/HAMA-125
> Project: Hama
> Issue Type: Improvement
> Components: implementation
> Affects Versions: 0.1.0
> Reporter: Edward J. Yoon
> Assignee: Edward J. Yoon
> Fix For: 0.1.0
>
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema:
> column:
> =============================
> rowKey1 column:0 <0.323..>
> column:1 <0.413..>
> ... N
> rowKey2
> ...
> We can group the columns as describe below. (2D array is too complex.)
> vector:
> =============================
> rowKey1 vector:0 <0 ~ 9999999>
> vector:1 <10000000 ~ 19999999>
> ...
> rowKey2
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Commented: (HAMA-125) Refactor dense matrix storage
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12654292#action_12654292 ]
Edward J. Yoon commented on HAMA-125:
-------------------------------------
In this case,
vector a = matrix.getColumn(2);
It'll very slow. 100 * 100 submatrix seems better.
> Refactor dense matrix storage
> -----------------------------
>
> Key: HAMA-125
> URL: https://issues.apache.org/jira/browse/HAMA-125
> Project: Hama
> Issue Type: Improvement
> Components: implementation
> Affects Versions: 0.1.0
> Reporter: Edward J. Yoon
> Assignee: Edward J. Yoon
> Fix For: 0.1.0
>
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema:
> column:
> =============================
> rowKey1 column:0 <0.323..>
> column:1 <0.413..>
> ... N
> rowKey2
> ...
> We can group the columns as describe below. (2D array is too complex.)
> vector:
> =============================
> rowKey1 vector:0 <0 ~ 9999999>
> vector:1 <10000000 ~ 19999999>
> ...
> rowKey2
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
[jira] Updated: (HAMA-125) Refactor dense matrix storage
Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Edward J. Yoon updated HAMA-125:
--------------------------------
Fix Version/s: (was: 0.1.0)
Fix version changed.
We need more experiences before the decision.
> Refactor dense matrix storage
> -----------------------------
>
> Key: HAMA-125
> URL: https://issues.apache.org/jira/browse/HAMA-125
> Project: Hama
> Issue Type: Improvement
> Components: implementation
> Affects Versions: 0.1.0
> Reporter: Edward J. Yoon
> Assignee: Edward J. Yoon
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema:
> column:
> =============================
> rowKey1 column:0 <0.323..>
> column:1 <0.413..>
> ... N
> rowKey2
> ...
> We can group the columns as describe below. (2D array is too complex.)
> vector:
> =============================
> rowKey1 vector:0 <0 ~ 9999999>
> vector:1 <10000000 ~ 19999999>
> ...
> rowKey2
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.