You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hama.apache.org by "Edward J. Yoon (JIRA)" <ji...@apache.org> on 2008/12/05 10:50:44 UTC

[jira] Created: (HAMA-125) Refactor dense matrix storage

Refactor dense matrix storage
-----------------------------

                 Key: HAMA-125
                 URL: https://issues.apache.org/jira/browse/HAMA-125
             Project: Hama
          Issue Type: Improvement
          Components: implementation
    Affects Versions: 0.1.0
            Reporter: Edward J. Yoon
             Fix For: 0.1.0


Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.

Current table schema: 

                       column:
=============================
rowKey1       column:0   <0.323..>
                       column:1   <0.413..>
                      ...             N
rowKey2 
...

We can group the columns as describe below. (2D array is too complex.)

                       vector:
=============================
rowKey1       vector:0   <0 ~ 9999999>
                       vector:1   <10000000 ~ 19999999>
                      ...             
rowKey2 
...

----
Should be changed things :
- DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
- DenseVector : get/set(), subVector(), ...
...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Resolved: (HAMA-125) Refactor dense matrix storage

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward J. Yoon resolved HAMA-125.
---------------------------------

    Resolution: Won't Fix

I think we need more experiences to discuss this.

won't fix.

> Refactor dense matrix storage
> -----------------------------
>
>                 Key: HAMA-125
>                 URL: https://issues.apache.org/jira/browse/HAMA-125
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>    Affects Versions: 0.1.0
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema: 
>                        column:
> =============================
> rowKey1       column:0   <0.323..>
>                        column:1   <0.413..>
>                       ...             N
> rowKey2 
> ...
> We can group the columns as describe below. (2D array is too complex.)
>                        vector:
> =============================
> rowKey1       vector:0   <0 ~ 9999999>
>                        vector:1   <10000000 ~ 19999999>
>                       ...             
> rowKey2 
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-125) Refactor dense matrix storage

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12653723#action_12653723 ] 

Edward J. Yoon commented on HAMA-125:
-------------------------------------

Also, 

- BlockingMapred
- Reduce writers

> Refactor dense matrix storage
> -----------------------------
>
>                 Key: HAMA-125
>                 URL: https://issues.apache.org/jira/browse/HAMA-125
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>    Affects Versions: 0.1.0
>            Reporter: Edward J. Yoon
>             Fix For: 0.1.0
>
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema: 
>                        column:
> =============================
> rowKey1       column:0   <0.323..>
>                        column:1   <0.413..>
>                       ...             N
> rowKey2 
> ...
> We can group the columns as describe below. (2D array is too complex.)
>                        vector:
> =============================
> rowKey1       vector:0   <0 ~ 9999999>
>                        vector:1   <10000000 ~ 19999999>
>                       ...             
> rowKey2 
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (HAMA-125) Refactor dense matrix storage

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward J. Yoon reassigned HAMA-125:
-----------------------------------

    Assignee: Edward J. Yoon

> Refactor dense matrix storage
> -----------------------------
>
>                 Key: HAMA-125
>                 URL: https://issues.apache.org/jira/browse/HAMA-125
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>    Affects Versions: 0.1.0
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>             Fix For: 0.1.0
>
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema: 
>                        column:
> =============================
> rowKey1       column:0   <0.323..>
>                        column:1   <0.413..>
>                       ...             N
> rowKey2 
> ...
> We can group the columns as describe below. (2D array is too complex.)
>                        vector:
> =============================
> rowKey1       vector:0   <0 ~ 9999999>
>                        vector:1   <10000000 ~ 19999999>
>                       ...             
> rowKey2 
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-125) Refactor dense matrix storage

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12654292#action_12654292 ] 

Edward J. Yoon commented on HAMA-125:
-------------------------------------

In this case, 
vector a = matrix.getColumn(2);

It'll very slow. 100 * 100 submatrix seems better.





> Refactor dense matrix storage
> -----------------------------
>
>                 Key: HAMA-125
>                 URL: https://issues.apache.org/jira/browse/HAMA-125
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>    Affects Versions: 0.1.0
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>             Fix For: 0.1.0
>
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema: 
>                        column:
> =============================
> rowKey1       column:0   <0.323..>
>                        column:1   <0.413..>
>                       ...             N
> rowKey2 
> ...
> We can group the columns as describe below. (2D array is too complex.)
>                        vector:
> =============================
> rowKey1       vector:0   <0 ~ 9999999>
>                        vector:1   <10000000 ~ 19999999>
>                       ...             
> rowKey2 
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-125) Refactor dense matrix storage

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward J. Yoon updated HAMA-125:
--------------------------------

    Fix Version/s:     (was: 0.1.0)

Fix version changed.
We need more experiences before the decision.

> Refactor dense matrix storage
> -----------------------------
>
>                 Key: HAMA-125
>                 URL: https://issues.apache.org/jira/browse/HAMA-125
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>    Affects Versions: 0.1.0
>            Reporter: Edward J. Yoon
>            Assignee: Edward J. Yoon
>
> Current entry is too small, have a lot of column keys. So, it cause too many requests over hbase.
> Current table schema: 
>                        column:
> =============================
> rowKey1       column:0   <0.323..>
>                        column:1   <0.413..>
>                       ...             N
> rowKey2 
> ...
> We can group the columns as describe below. (2D array is too complex.)
>                        vector:
> =============================
> rowKey1       vector:0   <0 ~ 9999999>
>                        vector:1   <10000000 ~ 19999999>
>                       ...             
> rowKey2 
> ...
> ----
> Should be changed things :
> - DenseMatrix : get/set(), get/setRow(), get/setColumn() ..
> - DenseVector : get/set(), subVector(), ...
> ...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.