You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Ramkumar Aiyengar (JIRA)" <ji...@apache.org> on 2014/09/17 08:39:34 UTC

[jira] [Commented] (SOLR-6526) Solr Streaming API

    [ https://issues.apache.org/jira/browse/SOLR-6526?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14136858#comment-14136858 ] 

Ramkumar Aiyengar commented on SOLR-6526:
-----------------------------------------

Should the interface be tied to export? One nice future extension would be to have a streaming API which indexes to shard leaders directly (we currently can do direct indexing but that's not streamed).

> Solr Streaming API
> ------------------
>
>                 Key: SOLR-6526
>                 URL: https://issues.apache.org/jira/browse/SOLR-6526
>             Project: Solr
>          Issue Type: New Feature
>          Components: clients - java
>            Reporter: Joel Bernstein
>             Fix For: 5.0
>
>         Attachments: SOLR-6526.patch
>
>
> It would be great if there was a SolrJ library that could connect to Solr's /export handler (SOLR-5244) and perform streaming operations on the sorted result sets.
> This ticket defines the base interfaces and implementations for the Streaming API. The base API contains three classes:
> *SolrStream*: This represents a stream from a single Solr instance. It speaks directly to the /export handler and provides methods to read() Tuples and close() the stream
> *CloudSolrStream*: This represents a stream from a SolrCloud collection. It speaks with Zk to discover the Solr instances in the collection and then creates SolrStreams to make the requests. The results from the underlying streams are merged inline to produce a single sorted stream of tuples.
> *Tuple*: The data structure returned by the read() method of the SolrStream API. It is nested to support grouping and Cartesian product set operations.
> Once these base classes are implemented it paves the way for building *Decorator* streams that perform operations on the sorted Tuple sets. For example a CollapseStream could be created:
> {code}
> CollapseStream collapseStream = new CollapseStream(new CloudSolrStream(zkUrl, queryRequest));
> Tuple tuple = null;
> while((tuple = collapseStream.read()) != null) {
> } 
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org