You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2010/03/01 20:28:05 UTC

[jira] Commented: (SOLR-1772) UpdateProcessor to prune "empty" values

    [ https://issues.apache.org/jira/browse/SOLR-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12839812#action_12839812 ] 

Hoss Man commented on SOLR-1772:
--------------------------------

bq. I'd almost rather see the default behavior changed rather than to put another configurable component in the chain that would slow things down (slightly) for everyone.

That seems backwards -- if FieldType(s) start checking for the empty string, that's a few extra cycles of cost that everyone spends even if their indexing clients are already well behaved and only send "real" values.

Adding it as an optional UpdateProcessor makes it something that only people who need hand holdinghave to spend cycles on.

bq. ... confused that the empty string was being indexed at all, for fields that aren't even numbers. They thought this was equivalent to not sending it any value. I haven't verified this first hand but I believe it.

Nope: there are many use cases for both strings and numbers where you may need to "skip" a value in a multiValued field -- parallel arrays and such. ... it's actually one main situations we still have where IntField comes in handy (besides just supporting completely legacy Lucene indexes)

> UpdateProcessor to prune "empty" values
> ---------------------------------------
>
>                 Key: SOLR-1772
>                 URL: https://issues.apache.org/jira/browse/SOLR-1772
>             Project: Solr
>          Issue Type: Wish
>            Reporter: Hoss Man
>
> Users seem to frequently get confused when some FieldTypes (typically the numeric ones) complain about invalid field values when the inadvertantly index an empty string.
> It would be cool to provide an UpdateProcessor that makes it easy to strip out any fields being added as empty values ... it could be configured using field (and/or field type) names or globs to select/ignore certain fields -- i haven't thought it through all that hard

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.