You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2016/11/09 21:25:58 UTC

[jira] [Updated] (MADLIB-1039) Pivoting - Phase 3

     [ https://issues.apache.org/jira/browse/MADLIB-1039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Frank McQuillan updated MADLIB-1039:
------------------------------------
    Description: 
Follow on to these JIRAs
https://issues.apache.org/jira/browse/MADLIB-908
https://issues.apache.org/jira/browse/MADLIB-1004

this capability is to carry over some good ideas from
https://issues.apache.org/jira/browse/MADLIB-1038

Candidate improvements:
* output column naming options
* adding an ‘*’ option and list of features to exclude
* pivot more than 1600 column limit, i.e., most MADlib algos take array input so pivot should support array output
* Support non-STRICT functions in Greenplum and HAWQ; this was removed in 1.9.1 since it is not handled correctly.  Does work OK for Postgres.
* others??? 

References

[1] Good data set
http://pbpython.com/pandas-pivot-table-explained.html

  was:
Follow on to these JIRAs
https://issues.apache.org/jira/browse/MADLIB-908
https://issues.apache.org/jira/browse/MADLIB-1004

this capability is to carry over some good ideas from
https://issues.apache.org/jira/browse/MADLIB-1038

Candidate improvements:
* output column naming options
* adding an ‘*’ option and list of features to exclude
* pivot more than 1600 column limit, i.e., most MADlib algos take array input so pivot should support array output
* Support non-STRICT functions in Greenplum and HAWQ; this was removed in 1.9.1 since it is not handled correctly.  Does work OK for Postgres.
* others??? 



> Pivoting - Phase 3
> ------------------
>
>                 Key: MADLIB-1039
>                 URL: https://issues.apache.org/jira/browse/MADLIB-1039
>             Project: Apache MADlib
>          Issue Type: Improvement
>          Components: Module: Utilities
>            Reporter: Frank McQuillan
>             Fix For: v1.10
>
>
> Follow on to these JIRAs
> https://issues.apache.org/jira/browse/MADLIB-908
> https://issues.apache.org/jira/browse/MADLIB-1004
> this capability is to carry over some good ideas from
> https://issues.apache.org/jira/browse/MADLIB-1038
> Candidate improvements:
> * output column naming options
> * adding an ‘*’ option and list of features to exclude
> * pivot more than 1600 column limit, i.e., most MADlib algos take array input so pivot should support array output
> * Support non-STRICT functions in Greenplum and HAWQ; this was removed in 1.9.1 since it is not handled correctly.  Does work OK for Postgres.
> * others??? 
> References
> [1] Good data set
> http://pbpython.com/pandas-pivot-table-explained.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)