You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ignite.apache.org by "Alexey Zinoviev (Jira)" <ji...@apache.org> on 2019/10/01 14:05:00 UTC

[jira] [Updated] (IGNITE-7025) Implement different strategies to fill missed data in LabeledDataset during loading from file

     [ https://issues.apache.org/jira/browse/IGNITE-7025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Alexey Zinoviev updated IGNITE-7025:
------------------------------------
    Affects Version/s:     (was: 3.0)

> Implement different strategies to fill missed data in LabeledDataset during loading from file
> ---------------------------------------------------------------------------------------------
>
>                 Key: IGNITE-7025
>                 URL: https://issues.apache.org/jira/browse/IGNITE-7025
>             Project: Ignite
>          Issue Type: Task
>          Components: ml
>            Reporter: Alexey Zinoviev
>            Assignee: Alexey Zinoviev
>            Priority: Trivial
>
> For example, it can be four strategies 
> * Fill missed value with zero or empty string or default value for categorical features = ZERO
> * Fill missed value with mean on column/Requires an additional time to calculate = MEAN
> * Fill missed value with mode on column. Requires an additional time to calculate = MODE
> * Deletes observation with missed values. Transforms dataset and changes indexing = DELETE



--
This message was sent by Atlassian Jira
(v8.3.4#803005)