You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Joel Bernstein (JIRA)" <ji...@apache.org> on 2016/04/10 18:06:25 UTC

[jira] [Created] (SOLR-8965) Add Path reduce operation to aggregate paths taken in a session

Joel Bernstein created SOLR-8965:
------------------------------------

             Summary: Add Path reduce operation to aggregate paths taken in a session
                 Key: SOLR-8965
                 URL: https://issues.apache.org/jira/browse/SOLR-8965
             Project: Solr
          Issue Type: New Feature
            Reporter: Joel Bernstein


One of the things it's tricky to do at scale is session aggregation. MapReduce of course makes this easy. Now that we have MapReduce it would be good to add some session aggregations to the base library. 

The Path reduce operation can be used with the *reduce* function to concatenate the path taken in a session into a single field. These path records can then be added to another SolrCloud collection using the update stream. Once they have been consolidated in another collection aggregations can be run on the paths using the RollupStream.

A HashRollupStream could also be developed to aggregate the paths as they are reduced. The HashRollupStream would keep all the paths in a hash map during the aggregation so it would not require the paths to be received in order.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org