You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Grant Ingersoll (JIRA)" <ji...@apache.org> on 2011/09/07 22:21:10 UTC

[jira] [Created] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Start Phase doesn't properly work in RecommenderJob
---------------------------------------------------

                 Key: MAHOUT-802
                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
             Project: Mahout
          Issue Type: Bug
            Reporter: Grant Ingersoll
            Priority: Minor


I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
{quote}
java.lang.IllegalArgumentException: Number of columns was not correctly set!
	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
{quote}

This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Sean Owen (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen resolved MAHOUT-802.
------------------------------

    Resolution: Fixed

(Grant looks like this was committed.)
                
> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100279#comment-13100279 ] 

Sebastian Schelter commented on MAHOUT-802:
-------------------------------------------

I plan to change the input preparation in https://issues.apache.org/jira/browse/MAHOUT-767 I'll provide a first patch shortly maybe it will be easier to address this issue here after that.

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13099599#comment-13099599 ] 

Grant Ingersoll commented on MAHOUT-802:
----------------------------------------

patch coming either tonight or tomorrow am

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Priority: Minor
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-802:
-----------------------------------

    Attachment: MAHOUT-802b.patch

Makes the indexItemId mapping optional

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Reopened] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll reopened MAHOUT-802:
------------------------------------


This also doesn't work because it is hardcoded to accept only the item id path.

Seems to me, id mapping should be an optional step and shouldn't be inherent to the generation of recommendations.

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-802:
-----------------------------------

    Attachment: MAHOUT-802.patch

draft patch.  Has a step to count the items if they weren't already counted in an earlier phase.

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Priority: Minor
>         Attachments: MAHOUT-802.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100285#comment-13100285 ] 

Grant Ingersoll commented on MAHOUT-802:
----------------------------------------

Thanks, Sebastian.  The hard part is I'm up against a deadline ;-).

The bigger issue is I have my own input prep all together and even though the Job is built in theory to handle starting at arbitrary phases, it assumes certain things are in specific places.  

I'll try to have my dictionary output to the appropriate places.

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll resolved MAHOUT-802.
------------------------------------

       Resolution: Fixed
    Fix Version/s: 0.6
         Assignee: Grant Ingersoll

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101721#comment-13101721 ] 

Hudson commented on MAHOUT-802:
-------------------------------

Integrated in Mahout-Quality #1028 (See [https://builds.apache.org/job/Mahout-Quality/1028/])
    MAHOUT-802: make item id look ups optional

gsingers : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1167345
Files : 
* /mahout/trunk/core/src/main/java/org/apache/mahout/cf/taste/hadoop/item/AggregateAndRecommendReducer.java


> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101525#comment-13101525 ] 

Grant Ingersoll commented on MAHOUT-802:
----------------------------------------

Sebastian,

Can you detail the input changes?  My stuff that was working is now not working.  Can I still rely on just needing a VectorWritable?

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100301#comment-13100301 ] 

Grant Ingersoll commented on MAHOUT-802:
----------------------------------------

I also don't get the long to int mapping and vice versa.  If it isn't meant to be a long coming in, then why handle it?

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100683#comment-13100683 ] 

Sean Owen commented on MAHOUT-802:
----------------------------------

I think these are fine changes and Sebastian's comment is correct. What's the issue with hard-coded paths?

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13101564#comment-13101564 ] 

Grant Ingersoll commented on MAHOUT-802:
----------------------------------------

What's the new RatingMatrix?  I guess I should just give up and output user, item, preference, but it just seems like such a waste when I already have everything for the user vector matrix. 

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAHOUT-802) Start Phase doesn't properly work in RecommenderJob

Posted by "Sebastian Schelter (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100305#comment-13100305 ] 

Sebastian Schelter commented on MAHOUT-802:
-------------------------------------------

The non-distributed recommender code uses longs to identify users and items. In order to stay compatible the distributed code has to support them too, although our distributed matrix operations are always keyed by ints. That's why we need the conversion.

> Start Phase doesn't properly work in RecommenderJob
> ---------------------------------------------------
>
>                 Key: MAHOUT-802
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-802
>             Project: Mahout
>          Issue Type: Bug
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-802.patch, MAHOUT-802b.patch
>
>
> I'm trying to run RecommenderJob and do --startPhase 2 since I have my prefs already in the right format.  Unfortunately, when I do that, I get:
> {quote}
> java.lang.IllegalArgumentException: Number of columns was not correctly set!
> 	at com.google.common.base.Preconditions.checkArgument(Preconditions.java:88)
> 	at org.apache.mahout.math.hadoop.similarity.RowSimilarityJob$SimilarityReducer.setup(RowSimilarityJob.java:296)
> 	at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)
> 	at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:648)
> 	at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:416)
> 	at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:256)
> {quote}
> This appears to be due to the fact that the numberOfUsers variable defaults to 0 and is only set when phase 1 is run.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira