You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Han Hui Wen (JIRA)" <ji...@apache.org> on 2010/08/18 18:38:17 UTC

[jira] Issue Comment Edited: (MAHOUT-483) Job RowSimilarityJob-Mapper-EntriesToVectorsReducer improvement

    [ https://issues.apache.org/jira/browse/MAHOUT-483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12899897#action_12899897 ] 

Han Hui Wen  edited comment on MAHOUT-483 at 8/18/10 12:37 PM:
---------------------------------------------------------------

I remembered that we already sorted the  key in previous phase .
So we can do the task in original reducer task in the new Mapper phase,
The new job should no reducer phase,also need  not copy the data from mapper to reducer.
 

      was (Author: huiwenhan):
    I remembered that we already sorted the  key in previous phase .
So we can do the task in original reducer task in the new Mapper phase,
The new job should no reducer phase,also need copy the data from mapper to reducer.
 
  
> Job RowSimilarityJob-Mapper-EntriesToVectorsReducer improvement 
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-483
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-483
>             Project: Mahout
>          Issue Type: Test
>          Components: Collaborative Filtering
>    Affects Versions: 0.4
>            Reporter: Han Hui Wen 
>             Fix For: 0.4
>
>         Attachments: screenshot-3.jpg
>
>
> the Mapper phase of Job RowSimilarityJob-Mapper-EntriesToVectorsReducer uses too long time but do nothing actually.
> Can we move the EntriesToVectorsReducer task to Mapper phase ? set the Reducer phase is null,
> So can improve the performance.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.