You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hama.apache.org by "Samuel Guo (JIRA)" <ji...@apache.org> on 2009/01/14 04:36:59 UTC

[jira] Created: (HAMA-144) GetProgress during MR over a matrix

GetProgress during MR over a matrix
-----------------------------------

                 Key: HAMA-144
                 URL: https://issues.apache.org/jira/browse/HAMA-144
             Project: Hama
          Issue Type: Improvement
          Components: implementation
            Reporter: Samuel Guo
            Priority: Minor


Defaultly, we don't know how to get progress during MR over a HBase table.

But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
we can compute *progress* by <no. row processed> / <total rows of the split>.

And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

test passed. 
commit it.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch, HAMA-144_v03.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-144) GetProgress during MR over a matrix

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12663978#action_12663978 ] 

Hudson commented on HAMA-144:
-----------------------------

-1 overall.  Here are the results of testing the latest attachment 
http://issues.apache.org/jira/secure/attachment/12397931/HAMA-144.patch
against trunk revision 732267.

    @author +1.  The patch does not contain any @author tags.

    tests included -1.  The patch doesn't appear to include any new or modified tests.
                        Please justify why no tests are needed for this patch.

    patch -1.  The patch command could not apply the patch.

Console output: http://hudson.zones.apache.org/hudson/job/Hama-Patch/142/console

This message is automatically generated.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Status: Patch Available  (was: Open)

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Attachment: HAMA-144_v02.patch

again

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Attachment: HAMA-144_v03.patch

try again. 


> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch, HAMA-144_v03.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-144) GetProgress during MR over a matrix

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664013#action_12664013 ] 

Hudson commented on HAMA-144:
-----------------------------

-1 overall.  Here are the results of testing the latest attachment 
http://issues.apache.org/jira/secure/attachment/12397941/HAMA-144_v01.patch
against trunk revision 734602.

    @author +1.  The patch does not contain any @author tags.

    tests included -1.  The patch doesn't appear to include any new or modified tests.
                        Please justify why no tests are needed for this patch.

    patch -1.  The patch command could not apply the patch.

Console output: http://hudson.zones.apache.org/hudson/job/Hama-Patch/144/console

This message is automatically generated.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-144) GetProgress during MR over a matrix

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664099#action_12664099 ] 

Hudson commented on HAMA-144:
-----------------------------

-1 overall.  Here are the results of testing the latest attachment 
http://issues.apache.org/jira/secure/attachment/12397956/HAMA-144_v03.patch
against trunk revision 734602.

    @author +1.  The patch does not contain any @author tags.

    tests included -1.  The patch doesn't appear to include any new or modified tests.
                        Please justify why no tests are needed for this patch.

    javadoc +1.  The javadoc tool did not generate any warning messages.

    javac +1.  The applied patch does not generate any new javac compiler warnings.

    release audit +1.  The applied patch does not generate any new release audit warnings.

    findbugs +1.  The patch does not introduce any new Findbugs warnings.

    core tests +1.  The patch passed core unit tests.

Test results: http://hudson.zones.apache.org/hudson/job/Hama-Patch/145/testReport/
Findbugs warnings: http://hudson.zones.apache.org/hudson/job/Hama-Patch/145/artifact/trunk/build/reports/findbugs/newPatchFindbugsWarnings.html
Checkstyle results: http://hudson.zones.apache.org/hudson/job/Hama-Patch/145/artifact/trunk/build/test/checkstyle-errors.html
Console output: http://hudson.zones.apache.org/hudson/job/Hama-Patch/145/console

This message is automatically generated.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch, HAMA-144_v03.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-144) GetProgress during MR over a matrix

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12663960#action_12663960 ] 

Edward J. Yoon commented on HAMA-144:
-------------------------------------

+1 for this patch.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Attachment: HAMA-144.patch

add *getProgress* method in VectorInputFormat's RecordReader.
so we can get detail progress information during processing matrix using VectorInputFormat.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664044#action_12664044 ] 

Samuel Guo commented on HAMA-144:
---------------------------------

hmm, yes, I create the patch by 'svn diff'. I also update my code reposity and create a new patch.
I will try again.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Attachment: HAMA-144_v01.patch

repatched it. 

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (HAMA-144) GetProgress during MR over a matrix

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664071#action_12664071 ] 

Edward J. Yoon commented on HAMA-144:
-------------------------------------

Looks good.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch, HAMA-144_v03.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Status: Patch Available  (was: Open)

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Status: Patch Available  (was: Open)

hope it will work.

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch, HAMA-144_v03.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Samuel Guo (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Samuel Guo updated HAMA-144:
----------------------------

    Status: Open  (was: Patch Available)

> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HAMA-144) GetProgress during MR over a matrix

Posted by "Edward J. Yoon (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HAMA-144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward J. Yoon updated HAMA-144:
--------------------------------

    Status: Open  (was: Patch Available)

02.patch also invalid.
You can create a patch as '$ svn diff > HADOOP-1234.patch'

----
[root@udanax hama-trunk]# patch -p0 < HAMA-144_v02.patch 
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/hama/algebra/RowCyclicAdditionMap.java
Hunk #1 FAILED at 43.
1 out of 1 hunk FAILED -- saving rejects to file src/java/org/apache/hama/algebra/RowCyclicAdditionMap.java.rej
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/hama/algebra/SIMDMultiplyMap.java
Hunk #1 FAILED at 49.
1 out of 1 hunk FAILED -- saving rejects to file src/java/org/apache/hama/algebra/SIMDMultiplyMap.java.rej
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/hama/mapred/VectorInputFormat.java
Hunk #1 FAILED at 26.
Hunk #2 FAILED at 50.
Hunk #3 FAILED at 115.
3 out of 3 hunks FAILED -- saving rejects to file src/java/org/apache/hama/mapred/VectorInputFormat.java.rej
(Stripping trailing CRs from patch.)
patching file src/java/org/apache/hama/SubMatrix.java
Hunk #1 FAILED at 209.
1 out of 1 hunk FAILED -- saving rejects to file src/java/org/apache/hama/SubMatrix.java.rej
[root@udanax hama-trunk]# 


> GetProgress during MR over a matrix
> -----------------------------------
>
>                 Key: HAMA-144
>                 URL: https://issues.apache.org/jira/browse/HAMA-144
>             Project: Hama
>          Issue Type: Improvement
>          Components: implementation
>            Reporter: Samuel Guo
>            Priority: Minor
>         Attachments: HAMA-144.patch, HAMA-144_v01.patch, HAMA-144_v02.patch
>
>
> Defaultly, we don't know how to get progress during MR over a HBase table.
> But in a matrix, the data are sorted by row-no. we know the rows of a split of a HBase table.
> we can compute *progress* by <no. row processed> / <total rows of the split>.
> And this is helpful that we can know the progress during blocking, multiplication and other operations.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.