You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Mark Harwood (JIRA)" <ji...@apache.org> on 2010/05/11 13:08:41 UTC

[jira] Commented: (LUCENE-2454) Nested Document query support

    [ https://issues.apache.org/jira/browse/LUCENE-2454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866128#action_12866128 ] 

Mark Harwood commented on LUCENE-2454:
--------------------------------------

Robust use of this feature is dependent on careful management of segments i.e. that all compound documents are held in the same segment.

Michael Busch suggested the introduction of a new "FlushPolicy" on IndexWriter to offer the required control. (see http://mail-archives.apache.org/mod_mbox/lucene-dev/201005.mbox/%3C4BE5A14C.6040108@gmail.com%3E )
Sounds sensible to me given that IndexWriter currently manages to muddle 2 alternative policies in the one implementation and it looks like we now need a third.

Is this the place to start the debate on "FlushPolicy" ?
My guess is this change would involve :
* Deprecating/removing IndexWriter's setMaxBufferedDocs and setRAMBufferSizeMB.
* Providing a new "FlushPolicy" abstract class that is called with a "BufferContext " class to hold number buffered docs + ram usage. FlushPolicy is asked if flushing of various structures should be triggered given the context
* Provide default implementations of FlushPolicy that are number-of-documents-based and RAM-based.
* Provide a special "NestedDocumentFlushPolicy" that can wrap any other policy (ram/num docs) but only triggers flushes when application code has primed it to say a batch of related documents is completed.

Let me know where it's best to continue the thinking on these IndexWriter changes.

> Nested Document query support
> -----------------------------
>
>                 Key: LUCENE-2454
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2454
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Search
>    Affects Versions: 3.0.2
>            Reporter: Mark Harwood
>            Assignee: Mark Harwood
>            Priority: Minor
>         Attachments: LuceneNestedDocumentSupport-1.zip
>
>
> A facility for querying nested documents in a Lucene index as outlined in http://www.slideshare.net/MarkHarwood/proposal-for-nested-document-support-in-lucene

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org