You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@solr.apache.org by "Joel Bernstein (Jira)" <ji...@apache.org> on 2022/11/04 20:38:00 UTC
[jira] [Created] (SOLR-16524) Index time hash partitioning
Joel Bernstein created SOLR-16524:
-------------------------------------
Summary: Index time hash partitioning
Key: SOLR-16524
URL: https://issues.apache.org/jira/browse/SOLR-16524
Project: Solr
Issue Type: Improvement
Security Level: Public (Default Security Level. Issues are Public)
Reporter: Joel Bernstein
Both Streaming Expressions and Spark-Solr currently rely on the query time hash partitioning using the HashQParserPlugin. The query time hash partitioning, although extremely flexible, is also very slow when it build its initial filters.
This ticket will add an indexing time hash partitioner that Streaming Expressions and Spark-solr will both be able to use.
When this ticket is complete I'll also update the ParallelStream and Spark-Solr to be able to use the index time partitioning rather than the HashQParserPlugin.
This is a stepping stone towards much more performant parallel distributed joins.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@solr.apache.org
For additional commands, e-mail: issues-help@solr.apache.org