You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hivemall.apache.org by "Makoto Yui (Jira)" <ji...@apache.org> on 2019/11/28 17:48:00 UTC
[jira] [Updated] (HIVEMALL-259) [BUG] feature_binning does not work
properly under certain conditions
[ https://issues.apache.org/jira/browse/HIVEMALL-259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Makoto Yui updated HIVEMALL-259:
--------------------------------
Issue Type: Bug (was: Improvement)
> [BUG] feature_binning does not work properly under certain conditions
> ---------------------------------------------------------------------
>
> Key: HIVEMALL-259
> URL: https://issues.apache.org/jira/browse/HIVEMALL-259
> Project: Hivemall
> Issue Type: Bug
> Affects Versions: 0.5.2
> Reporter: Makoto Yui
> Assignee: Makoto Yui
> Priority: Trivial
> Fix For: 0.6.0
>
>
>
> feature_binning does not properly work in certain condition.
> It might be a bug in quantiles lookup by a different key type object at [this line|[https://github.com/apache/incubator-hivemall/blob/master/core/src/main/java/hivemall/ftvec/binning/FeatureBinningUDF.java#L133]].
>
> {code:java}
> WITH extracted as (
> select
> extract_feature(feature) as index,
> extract_weight(feature) as value
> from
> input l
> LATERAL VIEW explode(features) r as feature
> ),
> bins as (
> select
> map(index, build_bins(value, 5, true)) as quantiles -- 5 bins with auto bin shrinking
> from
> extracted
> group by
> index
> )
> select
> l.features as original,
> feature_binning(l.features, r.quantiles) as features
> from
> input l
> cross join bins r
> ;
> {code}
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)