You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2016/01/09 02:49:39 UTC

[jira] [Created] (MADLIB-952) Type error in elastic net

Frank McQuillan created MADLIB-952:
--------------------------------------

             Summary: Type error in elastic net
                 Key: MADLIB-952
                 URL: https://issues.apache.org/jira/browse/MADLIB-952
             Project: Apache MADlib
          Issue Type: Bug
          Components: Module: Regularized Regression
            Reporter: Frank McQuillan


First check the columns which are text or character using the following code: 

select 
   column_name, data_type
from 
   information_schema.columns
   where table_schema='analytics'
   and table_name='discharge_train' 
   and data_type in ('character varying','text')

It gave me the following result. 

"cohort_assignment";"text"
"providergender";"character varying"
"locationname";"character varying"
"patientkey";"character varying"

Then this is the code I used for elastic net. 

DROP TABLE IF EXISTS analytics.elastic_net_logistic;
SELECT madlib.elastic_net_train( 'analytics.discharge_train',
                                 'analytics.elastic_net_train_result',
                                 'readmission < 0',
                                 '*',
                                 'binomial',
                                 0.6,
                                 0.02,
                                 TRUE,
                                 NULL,
                                 'fista',
                                 '',
                                'totalemergencyvisits_in_current_visit  , totalgimvisits_in_current_visit  , 
                                  totalcardiologyvisits_in_current_visit  , total_time_in_medical_short_inpatient  , 
                                  totalacupulmonaryvisits_in_current_visit  , current_complain_  ,
                                  total_current_dea_ciii, curr_rx_gen_lactated_ringers  ,patientkey  , 
                                  totalobservationvisits_in_current_visit, total_unitsvisited_before_admission  , 
                                  totalpcuneurovisits_in_current_visit , totalicuvisits_in_current_visit  ,curr_rx_gen_oxycodone_w_  , 
                                  total_time_in_medical_short_in_er , totalmfmvisits_in_current_visit  , 
                                  totalmedsurgvisits_in_current_visit  , hospitalaccountrecord, inpatientadmitdatetime  ,totalongvisits_in_current_visit,
                                  no_null_days_btw_admissions,cohort_assignment,locationname,providergender,patientkey',
                                 10000,
                                 1e-6
                               );

If you check in exlcuded list of variables, at the very last I have excluded the character or text variables. 

It still gives me this error. 



                           Elastic Net error: All columns to be included in the

                           independent variables should be of the same numeric type.

Context: Traceback (most recent call last):

  PL/Python function "elastic_net_train", line 23, in <module>

    return elastic_net.elastic_net_train(**globals())

  PL/Python function "elastic_net_train", line 308, in elastic_net_train

  PL/Python function "elastic_net_train", line 427, in analyze_input_str

PL/Python function "elastic_net_train"


Please note I have also tried by dropping all the character or text variables and then passing it to elastic net code. And it still gives me the same error.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)