You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Michael Gibney (Jira)" <ji...@apache.org> on 2020/05/01 21:16:00 UTC
[jira] [Created] (SOLR-14454) support for UTF-8 (string) types with
DocValuesType.BINARY
Michael Gibney created SOLR-14454:
-------------------------------------
Summary: support for UTF-8 (string) types with DocValuesType.BINARY
Key: SOLR-14454
URL: https://issues.apache.org/jira/browse/SOLR-14454
Project: Solr
Issue Type: New Feature
Security Level: Public (Default Security Level. Issues are Public)
Components: Schema and Analysis
Affects Versions: master (9.0)
Reporter: Michael Gibney
The goal is to add support for string fields with arbitrarily large values in the {{/export}} handler and streaming expressions.
{{StrField}} values are currently limited to 32766 bytes for the case where {{indexed=true}} or {{docValues=true}}. Exceeding this value triggers an "immense field" warning, and causes indexing to fail for the associated input doc.
Configuring a {{StrField}} field as "{{indexed=false docValues=false}}" removes this size limitation, so it is already possible to have large _stored_ {{StrField}} values. But the "{{docValues=true}}" prerequisite for the {{/export}} handler (and consequently for streaming expressions) limits the size of field that can be used in conjunction with these features.
Adding support for UTF-8/string field types with {{DocValuesType.BINARY}} would address this limitation and allow considerable flexibility in the implementation of custom field types. N.b.: this would address field value retrieval use cases only (e.g., {{/export}} and {{useDocValuesAsStored}}); neither sorting nor faceting would be supported.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org