You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Itamar Syn-Hershko (JIRA)" <ji...@apache.org> on 2013/05/21 13:25:19 UTC
[jira] [Created] (LUCENE-5011) MemoryIndex and FVH don't play along
with multi-value fields
Itamar Syn-Hershko created LUCENE-5011:
------------------------------------------
Summary: MemoryIndex and FVH don't play along with multi-value fields
Key: LUCENE-5011
URL: https://issues.apache.org/jira/browse/LUCENE-5011
Project: Lucene - Core
Issue Type: Bug
Affects Versions: 4.3
Reporter: Itamar Syn-Hershko
When multi-value fields are indexed to a MemoryIndex, positions are computed correctly on search but the start and end offsets and the values array index aren't correct.
Comparing the same execution path for IndexReader on a Directory impl and MemoryIndex (same document, same query, same analyzer, different Index impl), the difference first shows in FieldTermStack.java line 125:
termList.add( new TermInfo( term, dpEnum.startOffset(), dpEnum.endOffset(), pos, weight ) );
dpEnum.startOffset() and dpEnum.endOffset don't match between implementations.
This looks like a bug in MemoryIndex, which doesn't seem to handle tokenized multi-value fields all too well when positions and offsets are required.
I should also mention we are using an Analyzer which outputs several tokens at a position (a la SynonymFilter), but I don't believe this is related.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org