You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Olivier Grisel (JIRA)" <ji...@apache.org> on 2010/01/19 15:05:54 UTC

[jira] Created: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Writable for labeled vectors for supervised learning algorithms
---------------------------------------------------------------

                 Key: MAHOUT-262
                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
             Project: Mahout
          Issue Type: New Feature
          Components: Classification
    Affects Versions: 0.2
            Reporter: Olivier Grisel
             Fix For: 0.3


Implement two new classes:

 - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)

 - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Commented: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by Jake Mannix <ja...@gmail.com>.
Looks like the patch was generated with a "git diff" without the --no-prefix
option.

On Jan 22, 2010 12:10 AM, "Sean Owen (JIRA)" <ji...@apache.org> wrote:


   [
https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803629#action_12803629]

Sean Owen commented on MAHOUT-262:
----------------------------------

It looks fine to me, but for some reason I am not able to apply the patch.
My SVN client doesn't like it -- it's getting mixed by things like the
/dev/null business and the a/ and b/ directories.

Are you able to generate a patch in a different way, like with SVN itself?

It's not terrible for me to manually apply the patch with some copy and
paste but since I had trouble with a couple of your patches I'm wondering if
it's just me, or if we should look at the issue so your future patches work
seamlessly. I wouldn't want to make a typo when applying by hand.

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item
(one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0
or more category indexes per instance)

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by "Olivier Grisel (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olivier Grisel updated MAHOUT-262:
----------------------------------

    Attachment: MAHOUT-262-2.patch

updated version of the patch + --no-prefix to work against current trunk

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch, MAHOUT-262-2.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803629#action_12803629 ] 

Sean Owen commented on MAHOUT-262:
----------------------------------

It looks fine to me, but for some reason I am not able to apply the patch. My SVN client doesn't like it -- it's getting mixed by things like the /dev/null business and the a/ and b/ directories.

Are you able to generate a patch in a different way, like with SVN itself?

It's not terrible for me to manually apply the patch with some copy and paste but since I had trouble with a couple of your patches I'm wondering if it's just me, or if we should look at the issue so your future patches work seamlessly. I wouldn't want to make a typo when applying by hand.

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by "Isabel Drost (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12803690#action_12803690 ] 

Isabel Drost commented on MAHOUT-262:
-------------------------------------

Should be possible to apply the patch with -p1 instead of -p0 to remove the a/b directories.

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by "Olivier Grisel (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olivier Grisel updated MAHOUT-262:
----------------------------------

    Attachment: MAHOUT-262-1.patch

here is a patch that implements those two classes and fixes a tiny bug in SequentialRandomAccessSparseVector found in the tests.

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated MAHOUT-262:
-----------------------------

    Resolution: Fixed
      Assignee: Olivier Grisel
        Status: Resolved  (was: Patch Available)

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>            Assignee: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch, MAHOUT-262-2.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-262) Writable for labeled vectors for supervised learning algorithms

Posted by "Olivier Grisel (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Olivier Grisel updated MAHOUT-262:
----------------------------------

    Status: Patch Available  (was: Open)

Please feel free to review and comment on this patch.

> Writable for labeled vectors for supervised learning algorithms
> ---------------------------------------------------------------
>
>                 Key: MAHOUT-262
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-262
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Classification
>    Affects Versions: 0.2
>            Reporter: Olivier Grisel
>             Fix For: 0.3
>
>         Attachments: MAHOUT-262-1.patch
>
>
> Implement two new classes:
>  - SingleLabelVectorWritable for singly classified vectorized data item (one and only one label index per instance)
>  - MultiLabelVectorWritable for multi categorized vectorized data item (0 or more category indexes per instance)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.