You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sebastian Schelter (JIRA)" <ji...@apache.org> on 2014/04/20 10:39:16 UTC

[jira] [Commented] (MAHOUT-1518) Preprocessing for collaborative filtering with the Scala DSL

    [ https://issues.apache.org/jira/browse/MAHOUT-1518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13975083#comment-13975083 ] 

Sebastian Schelter commented on MAHOUT-1518:
--------------------------------------------

[~pferrel] the attached patch contains a preprocessor that should be able to handle your input. Have a look at the PreprocessorExample in the patch to see how it is applied.

Once we're happy with this, we can marry it with MAHOUT-1464



> Preprocessing for collaborative filtering with the Scala DSL
> ------------------------------------------------------------
>
>                 Key: MAHOUT-1518
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1518
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>            Reporter: Sebastian Schelter
>            Assignee: Sebastian Schelter
>             Fix For: 1.0
>
>         Attachments: MAHOUT-1518.patch
>
>
> The aim here is to provide some easy-to-use machinery to enable the usage of the new Cooccurrence Analysis code from MAHOUT-1464 with datasets represented as follows in a CSV file with the schema _timestamp, userId, itemId, action_, e.g.
> {code}
> timestamp1, userIdString1, itemIdString1, “view"
> timestamp2, userIdString2, itemIdString1, “like"
> {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)