You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2018/08/08 23:08:00 UTC

[jira] [Commented] (MADLIB-1270) Unexepcted behavior in vec2cols function

    [ https://issues.apache.org/jira/browse/MADLIB-1270?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16574016#comment-16574016 ] 

Frank McQuillan commented on MADLIB-1270:
-----------------------------------------

Thanks for reporting, Rashmi.  I suspect it is looking at the length of the array in the first row and basing things off that.  Let us look into it more.

> Unexepcted behavior in vec2cols function
> ----------------------------------------
>
>                 Key: MADLIB-1270
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1270
>             Project: Apache MADlib
>          Issue Type: Bug
>          Components: Module: Utilities
>            Reporter: Rashmi Raghu
>            Priority: Minor
>             Fix For: v1.15.1
>
>
> There is some unexpected behavior when vector column to be split contains different numbers of elements in the vectors. E.g.
> Input table:
> select * from test order by id;
>  id | t 
> ----+---------
>  1 | \{a,b}
>  2 | \{c,d}
>  3 | \{e,f}
>  4 | \{g,h,i}
>  5 | \{j}
> (5 rows)
>  
> select madlib.vec2cols('test','test_out_5','t',array['c1','c2','c3'],'id');
> ERROR: plpy.Error: vec2cols: Mismatch between size of vector_col and number of cols in feature_names.
> CONTEXT: Traceback (most recent call last):
>  PL/Python function "vec2cols", line 23, in <module>
>  return vec2cols_obj.vec2cols(**globals())
>  PL/Python function "vec2cols", line 149, in vec2cols
>  PL/Python function "vec2cols", line 112, in get_names_for_split_output_cols
>  PL/Python function "vec2cols", line 77, in _assert
> PL/Python function "vec2cols"
>  
> select madlib.vec2cols('test','test_out_5','t',array['c1','c2'],'id');
>  vec2cols 
> ----------
>  
> (1 row)
> select * from test_out_5 order by id;
>  id | c1 | c2 
> ----+----+----
>  1 | a | b
>  2 | c | d
>  3 | e | f
>  4 | g | h
>  5 | j | 
> (5 rows)
>  
>  
> select madlib.vec2cols('test','test_out_6','t',array['c1'],'id');
> ERROR: plpy.Error: vec2cols: Mismatch between size of vector_col and number of cols in feature_names.
> CONTEXT: Traceback (most recent call last):
>  PL/Python function "vec2cols", line 23, in <module>
>  return vec2cols_obj.vec2cols(**globals())
>  PL/Python function "vec2cols", line 149, in vec2cols
>  PL/Python function "vec2cols", line 112, in get_names_for_split_output_cols
>  PL/Python function "vec2cols", line 77, in _assert
> PL/Python function "vec2cols"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)