You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Joel Bernstein (JIRA)" <ji...@apache.org> on 2015/06/08 04:54:01 UTC

[jira] [Updated] (SOLR-7560) Parallel SQL Support

     [ https://issues.apache.org/jira/browse/SOLR-7560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joel Bernstein updated SOLR-7560:
---------------------------------
    Attachment: SOLR-7560.patch

Initial patch implements parallel Group By with aggregates. Very basic tests included. It shows the strategy of compiling SQL statements to Streaming API objects. 

SQL Tables are mapped to SolrCloud collections using a simple table mapping configured in the SolrConfig.

Sample Group By:
{code}
select str_s, count(*), sum(field_i), min(field_i), max(field_i), avg(field_i) from mytable where text='XXXX' group by str_s order by sum(field_i) asc limit 2
{code}


> Parallel SQL Support
> --------------------
>
>                 Key: SOLR-7560
>                 URL: https://issues.apache.org/jira/browse/SOLR-7560
>             Project: Solr
>          Issue Type: New Feature
>          Components: clients - java, search
>            Reporter: Joel Bernstein
>             Fix For: 5.3
>
>         Attachments: SOLR-7560.patch
>
>
> This ticket provides support for executing *Parallel SQL* queries across SolrCloud collections. The SQL engine will be built on top of the Streaming API (SOLR-7082), which provides support for *parallel relational algebra* and *real-time map-reduce*.
> Basic design:
> 1) A new SQLHandler will be added to process SQL requests. The SQL statements will be compiled to live Streaming API objects for parallel execution across SolrCloud worker nodes.
> 2) SolrCloud collections will be abstracted as *Relational Tables*. 
> 3) The Presto SQL parser will be used to parse the SQL statements.
> 4) A JDBC thin client will be added as a Solrj client.
> This ticket will focus on putting the framework in place and providing basic SELECT support and GROUP BY aggregate support.
> Future releases will build on this framework to provide additional SQL features.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org