You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2016/03/09 01:34:40 UTC

[jira] [Closed] (MADLIB-777) SVM Regression produces different predictions on multiple runs of the same training and test sets.

     [ https://issues.apache.org/jira/browse/MADLIB-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Frank McQuillan closed MADLIB-777.
----------------------------------
    Resolution: Fixed

Resolved by writing new SVM for scratch for v1.9

Closing this JIRA.

> SVM Regression produces different predictions on multiple runs of the same training and test sets.
> --------------------------------------------------------------------------------------------------
>
>                 Key: MADLIB-777
>                 URL: https://issues.apache.org/jira/browse/MADLIB-777
>             Project: Apache MADlib
>          Issue Type: Bug
>            Reporter: Srivatsan
>            Assignee: Rahul Iyer
>            Priority: Critical
>             Fix For: v1.9
>
>
> I tested this on MADlib 0.7 but I am not sure if this is version specific.
> Attaching the training & test tables (combo_svm_train.sql and combo_svm_dev.sql) and the SQL file to train MADlib's SVM Regression and to predict using the trained model on the dev set.
> Each time you run the training & prediction, you get wildly different prediction results (the R^2 varies between -0.50 to 0.50 in the several attempts that I ran the model).
> Not sure if this is expected behavior or if there is an error I've overlooked. If it is the expected behavior, the models are unusable unless I train the multiple models in parallel and use some sort of voting to minimize the variation. But this seems serious otherwise.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)