You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Aroop (Jira)" <ji...@apache.org> on 2020/07/01 17:31:00 UTC

[jira] [Created] (SOLR-14614) Add Simplified Aggregation Interface to Streaming Expression

Aroop created SOLR-14614:
----------------------------

             Summary: Add Simplified Aggregation Interface to Streaming Expression
                 Key: SOLR-14614
                 URL: https://issues.apache.org/jira/browse/SOLR-14614
             Project: Solr
          Issue Type: Improvement
      Security Level: Public (Default Security Level. Issues are Public)
          Components: query, query parsers, streaming expressions
    Affects Versions: 8.4.1, 7.7.2
            Reporter: Aroop


For the Data Analytics use cases the standard use case is:
 # Find a pattern
 # Then Aggregate by certain dimensions
 # Then compute metrics (like count, sum, avg)
 # Sort by a dimension or metric
 # look at top-n

This functionality has been available over many different interfaces in the past on solr, but only streaming expressions have the ability to deliver results in a scalable, performant and stable manner for systems that have large data to the tune of Big data systems.

However, one barrier to entry is the query interface, not being simple enough in streaming expressions.

This Jira is to track the work of creating a simplified analytics endpoint augmenting streaming expressions.

a starting proposal is to have the endpoint have these query parameters:
{code:java}
/analytics&q=*:*&fq=name:alex*&dimensions=age,city&metrics=count&sort=count&sortOrder=desc&limit=10{code}
This is equivalent to a sql that an analyst would write:
{code:java}
select age, city, count(*) from collection where name like 'alex%'
group by age, city order by age desc limit 10;{code}
 

On the solr side this would get translated to the best possible streaming expression using _*rollups, top, sort, plist* etc.; b_ut all done transparently to the user.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org