You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Maximilian Schleich (JIRA)" <ji...@apache.org> on 2016/12/02 11:06:59 UTC

[jira] [Created] (MADLIB-1050) Encoding of categorical variables limited to ~1600 colums?

Maximilian Schleich created MADLIB-1050:
-------------------------------------------

             Summary: Encoding of categorical variables limited to ~1600 colums? 
                 Key: MADLIB-1050
                 URL: https://issues.apache.org/jira/browse/MADLIB-1050
             Project: Apache MADlib
          Issue Type: Bug
            Reporter: Maximilian Schleich


Hello, 
I am trying to use the dummy encoding for categorical variables and feed it to a linear regression model. but my dataset has more than 1664 categories, so Postgres cannot store it in one table. Is there any other way for encoding dummy variables that does not require the creation of a new table, perhaps the function can be streamlined into the regression model? 
Thank you for your help!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)