You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Jingyi Mei (JIRA)" <ji...@apache.org> on 2018/03/08 19:18:00 UTC

[jira] [Created] (MADLIB-1213) Support logical expression for minibatch preprocessor

Jingyi Mei created MADLIB-1213:
----------------------------------

             Summary: Support logical expression for minibatch preprocessor
                 Key: MADLIB-1213
                 URL: https://issues.apache.org/jira/browse/MADLIB-1213
             Project: Apache MADlib
          Issue Type: Improvement
          Components: Module: Utilities
            Reporter: Jingyi Mei
             Fix For: v1.14


The minibatch preprocessor currently does not support all logical expressions for independent and dependent variables.
 # Independent varname does not support any logical expression.
 # Dependent varname only supports logical expression for numerical columns. For ex 'length >1' is a valid expression but it does not support creating an alias for this expression.

This is the only expression that is supported
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input', 'minibatch_preprocessing_out',  'y > 10',  ' x1,x2', 4);
 {code}

Not supported :
{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input', 'minibatch_preprocessing_out',  'y > 10 as foo',  'x1,x2', 4);
{code}

{code}
SELECT madlib.minibatch_preprocessor('minibatch_preprocessing_input', 'minibatch_preprocessing_out',  'y=''F''',  'x1,x2', 4);
{code}

Open Questions :
1. Do we need to support logical expressions for both independent and dependent varname
2. If yes, to what extent ? 
3. Should the user be allowed to create an alias for logical expressions? 
4. There might be other modules that may partially support logical expressions. Should we find out which modules ?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)