You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Dennis Gove (JIRA)" <ji...@apache.org> on 2015/11/19 03:28:10 UTC

[jira] [Issue Comment Deleted] (SOLR-8281) Add RollupMergeStream to Streaming API

     [ https://issues.apache.org/jira/browse/SOLR-8281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dennis Gove updated SOLR-8281:
------------------------------
    Comment: was deleted

(was: To be honest I think this logic should live in the ParallelStream. As a user of this stream I would expect it to properly merge all workers together, including metrics calculated in those workers. )

> Add RollupMergeStream to Streaming API
> --------------------------------------
>
>                 Key: SOLR-8281
>                 URL: https://issues.apache.org/jira/browse/SOLR-8281
>             Project: Solr
>          Issue Type: Bug
>            Reporter: Joel Bernstein
>            Assignee: Joel Bernstein
>
> The RollupMergeStream merges the aggregate results emitted by the RollupStream on *worker* nodes.
> This is designed to be used in conjunction with the HashJoinStream to perform rollup Aggregations on the joined Tuples. The HashJoinStream will require the tuples to be partitioned on the Join keys. To avoid needing to repartition on the *group by* fields for the RollupStream, we can perform a merge of the rolled up Tuples coming from the workers.
> The construct would like this:
> {code}
> mergeRollup (...
>                       parallel (...
>                                     rollup (...
>                                                 hashJoin (
>                                                                   search(...),
>                                                                   search(...),
>                                                                   on="fieldA" 
>                                                 )
>                                      )
>                          )
>                )
> {code}
> The pseudo code above would push the *hashJoin* and *rollup* to the *worker* nodes. The emitted rolled up tuples would be merged by the mergeRollup.
>  



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org