You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@solr.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2023/01/30 14:31:00 UTC

[jira] [Commented] (SOLR-16596) LTR MultipleAdditiveTreeModel do not support missing features' value

    [ https://issues.apache.org/jira/browse/SOLR-16596?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17682117#comment-17682117 ] 

ASF subversion and git services commented on SOLR-16596:
--------------------------------------------------------

Commit 8477a83da9250e9d3dcb8d37ab43a615fc16f47c in solr's branch refs/heads/main from aruggero
[ https://gitbox.apache.org/repos/asf?p=solr.git;h=8477a83da92 ]

SOLR-16596: Introduce support for null feature values in LTR MultipleAdditiveTreeModel (#1257)



> LTR MultipleAdditiveTreeModel do not support missing features' value
> --------------------------------------------------------------------
>
>                 Key: SOLR-16596
>                 URL: https://issues.apache.org/jira/browse/SOLR-16596
>             Project: Solr
>          Issue Type: Improvement
>            Reporter: Anna
>            Priority: Minor
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> The current MultipleAdditiveTree model doesn't support missing features' values.
> When a feature value is not passed, the model directly translates it to zero.
> Other LTR model libraries, like xgboost, are able to differentiate missing values from other values and also from zero values. They learn how to treat missing values at training time and add an additional "missing" branch to the tree with the direction learned to be the best in that situation.
> It would be nice to integrate this feature also in Solr MultipleAdditiveTree models. An additional "missing" parameter should be added to the RegressionTreeNode. This will determine the direction to take in case the feature value is missing.
> This integration will allow us to differentiate between zero and missing features. 
> For example, if the feature is "hotel_avg_review" (with a ranking between zero and five stars), we would like to behave differently if the hotel has no reviews (we do not know if it is good) or if it has a review of zero stars (the hotel is bad).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@solr.apache.org
For additional commands, e-mail: issues-help@solr.apache.org