You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hivemall.apache.org by "Makoto Yui (Jira)" <ji...@apache.org> on 2019/11/28 17:48:00 UTC

[jira] [Updated] (HIVEMALL-259) [BUG] feature_binning does not work properly under certain conditions

     [ https://issues.apache.org/jira/browse/HIVEMALL-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Makoto Yui updated HIVEMALL-259:
--------------------------------
    Issue Type: Bug  (was: Improvement)

> [BUG] feature_binning does not work properly under certain conditions
> ---------------------------------------------------------------------
>
>                 Key: HIVEMALL-259
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-259
>             Project: Hivemall
>          Issue Type: Bug
>    Affects Versions: 0.5.2
>            Reporter: Makoto Yui
>            Assignee: Makoto Yui
>            Priority: Trivial
>             Fix For: 0.6.0
>
>
>  
> feature_binning does not properly work in certain condition.
> It might be a bug in quantiles lookup by a different key type object at [this line|[https://github.com/apache/incubator-hivemall/blob/master/core/src/main/java/hivemall/ftvec/binning/FeatureBinningUDF.java#L133]].
>  
> {code:java}
> WITH extracted as (
>   select
>     extract_feature(feature) as index,
>     extract_weight(feature) as value
>   from
>     input l
>     LATERAL VIEW explode(features) r as feature
> ),
> bins as (
>    select
>      map(index, build_bins(value, 5, true)) as quantiles -- 5 bins with auto bin shrinking
>    from
>      extracted
>    group by
>      index
> )
> select
>   l.features as original,
>   feature_binning(l.features, r.quantiles) as features
> from
>   input l
>   cross join bins r
> ;
> {code}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)