You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2016/03/09 01:34:40 UTC

[jira] [Closed] (MADLIB-642) SVM Classfication Performance: Classification with Kernel function can improve performance on below datasets

     [ https://issues.apache.org/jira/browse/MADLIB-642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Frank McQuillan closed MADLIB-642.
----------------------------------
    Resolution: Fixed

Resolved by writing new SVM for scratch for v1.9

Closing this JIRA.

> SVM Classfication Performance: Classification with Kernel function can improve performance on below datasets
> ------------------------------------------------------------------------------------------------------------
>
>                 Key: MADLIB-642
>                 URL: https://issues.apache.org/jira/browse/MADLIB-642
>             Project: Apache MADlib
>          Issue Type: Bug
>            Reporter: Jiali Yao
>            Assignee: Rahul Iyer
>             Fix For: v1.9
>
>
> Below data sets can not return result in several hours. It also can not return result in libsvm with similar parameter.
> Data sets name	TrainSize	TestSize	Attributes	Rate(1:-1)	Missing	Source URL
> rcv1.binary	20242	677399	47236	365951:331690	N	http://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html#rcv1.binary
> URL Reputation Data Set	19000		3231961	6638:12362	N	http://archive.ics.uci.edu/ml/machine-learning-databases/url/url.names
> Test case
> {code}
> -- method: svm_cls_linear_ds_0_7_lsvm_classification_0
> SELECT madlib.lsvm_classification
>                         ( 'madlibtestdata.svm_url'::text     --input_table
>                         , 'madlibtestresult.cls_model_table'::text    --model_table
>                         , 'true'::boolean       --parallel
>                         , 'false'::boolean        --verbose
>                         , '0.1'::float8            --eta
>                         , '0.001'::float8            --reg
>                    ) AS q;
> -- method: svm_cls_dot_ds_0_1_svm_classification_0
> SELECT madlib.svm_classification
>                         ( 'madlibtestdata.svm_rcv1_binary'::text     --input_table
>                         , 'madlibtestresult.cls_model_table'::text    --model_table
>                         , 'true'::boolean       --parallel
>                         , 'madlib.svm_dot'::text    --kernel_func
>                         , 'false'::boolean        --verbose
>                         , '0.01'::float8            --eta
>                         , '0.005'::float8             --nu
>                    ) AS q;
> {code}
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)