You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Areek Zillur (JIRA)" <ji...@apache.org> on 2013/11/05 10:00:26 UTC

[jira] [Created] (LUCENE-5329) Make DocumentDictionary and co more lenient to dirty documents

Areek Zillur created LUCENE-5329:
------------------------------------

             Summary: Make DocumentDictionary and co more lenient to dirty documents
                 Key: LUCENE-5329
                 URL: https://issues.apache.org/jira/browse/LUCENE-5329
             Project: Lucene - Core
          Issue Type: Improvement
          Components: core/search
            Reporter: Areek Zillur


Currently DocumentDictionary errors out whenever any document does not have value for any relevant stored fields. It would be nice to make it lenient and instead ignore the invalid documents.

Another "issue" with the DocumentDictionary is that it only allows string fields as suggestions and binary fields as payloads. When exposing these dictionaries to solr (via https://issues.apache.org/jira/browse/SOLR-5378), it is inconvenient for the user to ensure that a suggestion field is a string field and a payload field is a binary field. It would be nice to have the dictionary "just work" whenever a string/binary field is passed to suggestion/payload field. The patch provides one solution to this problem (by accepting string or binary values), though it would be great if there are any other solution to this, without making the DocumentDictionary "too flexible"



--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org