You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@systemml.apache.org by "Matthias Boehm (JIRA)" <ji...@apache.org> on 2016/09/24 07:08:20 UTC
[jira] [Comment Edited] (SYSTEMML-700) Inflexible category labels
for Multinomial Logistic Regression
[ https://issues.apache.org/jira/browse/SYSTEMML-700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15518574#comment-15518574 ]
Matthias Boehm edited comment on SYSTEMML-700 at 9/24/16 7:07 AM:
------------------------------------------------------------------
that's a good point - could you clarify what exactly you have in mind (e.g., scale values, non-contiguous integers, or even frames with strings).
Just to bring everybody on the same page, here is what we currently do for adjusting y (after preparation we require integers starting from 1)
{code}
if (min (Y_vec) <= 0) {
# Category labels "0", "-1" etc. are converted into the largest label
max_y = max (Y_vec);
Y_vec = Y_vec + (- Y_vec + max_y + 1) * (Y_vec <= 0);
}
{code}
One could imaging something like {{transformencode(as.frame(y))}} to recode anything into the required range.
was (Author: mboehm7):
that's a good point - could you clarify what exactly you have in mind (e.g., scale values, non-contiguous integers, or even frames with strings).
Just to bring everybody on the same page, here is what we currently do for adjusting y (after preparation we require integers starting from 1)
{code}
if (min (Y_vec) <= 0) {
# Category labels "0", "-1" etc. are converted into the largest label
max_y = max (Y_vec);
Y_vec = Y_vec + (- Y_vec + max_y + 1) * (Y_vec <= 0);
}
{code}
One could imaging something like transformencode(as.frame(y)) to recode anything into the required range.
> Inflexible category labels for Multinomial Logistic Regression
> --------------------------------------------------------------
>
> Key: SYSTEMML-700
> URL: https://issues.apache.org/jira/browse/SYSTEMML-700
> Project: SystemML
> Issue Type: Bug
> Components: Algorithms
> Reporter: Jeremy
> Priority: Minor
> Original Estimate: 4h
> Remaining Estimate: 4h
>
> The Logistic Regression algorithm requires that category labels be labeled as 0 up to the number of classes-1. It should be able to handle any set of category labels provided by the user. B_out should have the appropriate size regardless of the values of the labels given, and the algorithm should also preserve the original labeling for the user.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)