You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Nandish Jayaram (JIRA)" <ji...@apache.org> on 2019/03/28 23:36:00 UTC

[jira] [Created] (MADLIB-1313) Add 1-hot encoding support for dependent var in fit

Nandish Jayaram created MADLIB-1313:
---------------------------------------

             Summary: Add 1-hot encoding support for dependent var in fit
                 Key: MADLIB-1313
                 URL: https://issues.apache.org/jira/browse/MADLIB-1313
             Project: Apache MADlib
          Issue Type: New Feature
          Components: Deep Learning
            Reporter: Nandish Jayaram
             Fix For: v1.16


The current fit function for DL assumes dependent variable is not one-hot encoded. But fit uses data obtained after running minibatch_preprocessor_dl that returns a 1-hot encoded array for each dependent var (https://issues.apache.org/jira/browse/MADLIB-1303). 

Fit should be able to work with this 1-hot encoded data to train the model, and also create a column called class_values in the model summary table to map 1-hot index with a class value.

Predict should then be able to use class_values to figure out the label for a given row in the test table.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)