You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Shai Erera (JIRA)" <ji...@apache.org> on 2011/01/27 11:17:44 UTC
[jira] Closed: (LUCENE-606) Change behavior of ParallelReader.document(int)

     [ https://issues.apache.org/jira/browse/LUCENE-606?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Shai Erera closed LUCENE-606.
-----------------------------

    Resolution: Won't Fix

Closing due to long inactivity, and also because I don't think it's a bug. IndexReaders today expose a getSequentialSubReaders API which allows one to fetch a Document's fields from whatever sub-reader they want. Also, the general concept behind ParallelReader is that fields do not exist in more than 1 sub-reader.

> Change behavior of ParallelReader.document(int)
> -----------------------------------------------
>
>                 Key: LUCENE-606
>                 URL: https://issues.apache.org/jira/browse/LUCENE-606
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Index
>    Affects Versions: 2.0.0
>            Reporter: Christian Kohlschuetter
>            Priority: Minor
>         Attachments: ParallelReaderTest2.java, patch-allfields.diff
>
>
> Currently, the returned documents contain, for each field, the stored data from all enclosed IndexReaders which contain the corresponding field.
> That is, a call to ParallelReader.document(doc).getFields(fieldName) returns an array of possibly several Field objects. Since null entries are disallowed, there is no way to determine to which IndexReader the field data exactly belongs.
> On the other side, a search for a term on that field only yields results if that term was contained in the *first* matching IndexReader which contained the field.
> Thus, when merging the ParallelReader contents to another IndexWriter, the indexed data does not correspond to the stored information.
> I am not sure whether this can be considered a bug (in some cases, this may exactly be required). However I would like to see an option to change this behaviour.
> I suggest a parameter for ParallelReader which specifies whether stored data from all IndexReaders or only from the one which is repsonsible for the field's indexed data will be returned by ParallelReader.document(int).
> Please find my proposed implementation attached, as well as a JUnit testcase.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org