You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Ivan Vasilev <iv...@sirma.bg> on 2008/02/01 20:00:58 UTC
Lucene File Formats web page
Hi Guys,
In the File Formats web page
(http://lucene.apache.org/java/2_3_0/fileformats.html) there is section
describing Segments File, where we read:
Segments --> Format, Version, NameCounter, ...
...
Format is -1 as of Lucene 1.4 and -3
(SemgentInfos.FORMAT_SINGLE_NORM_FILE) as of Lucene 2.1.
...
On my opinion the last sentence is not completely right. I mean it
should contain addition somthing like this:
1) "and -4 (SemgentInfos.FORMAT_SHARED_DOC_STORE) as of Lucene 2.3."
or like this:
2) "and as of Lucene 2.3 -4 (SemgentInfos.FORMAT_SHARED_DOC_STORE) in
cases when shared stored fields and term vectors exist or -3
(SemgentInfos.FORMAT_SINGLE_NORM_FILE) when no sharing exists."
So my question is what of the two suggestions is correct?
According to my tests with our IndexRecoverer tool the 1) is correct.
Here I will have to mention what does this tool. It creates segments
file out of given set of segments. I did not made a lot of tests yet but
in one single case when I created segments file for a segments set that
does not contain any shared data when I use
Format=SemgentInfos.FORMAT_SINGLE_NORM_FILE Luke can not open it using
Lucene 2.3, but when Format=SemgentInfos.FORMAT_SHARED_DOC_STORE it
opens it correctly.
Best Regards,
Ivan
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Lucene File Formats web page
Posted by Michael McCandless <lu...@mikemccandless.com>.
Woops, you are correct, the file formats doc is out of date.
It should be #1.
I'll fix it -- thank you for raising it!
Mike
Ivan Vasilev wrote:
> Hi Guys,
>
> In the File Formats web page (http://lucene.apache.org/java/2_3_0/
> fileformats.html) there is section describing Segments File, where
> we read:
>
> Segments --> Format, Version, NameCounter, ...
> ...
> Format is -1 as of Lucene 1.4 and -3
> (SemgentInfos.FORMAT_SINGLE_NORM_FILE) as of Lucene 2.1.
> ...
>
> On my opinion the last sentence is not completely right. I mean it
> should contain addition somthing like this:
> 1) "and -4 (SemgentInfos.FORMAT_SHARED_DOC_STORE) as of Lucene 2.3."
> or like this:
> 2) "and as of Lucene 2.3 -4 (SemgentInfos.FORMAT_SHARED_DOC_STORE)
> in cases when shared stored fields and term vectors exist or -3
> (SemgentInfos.FORMAT_SINGLE_NORM_FILE) when no sharing exists."
>
> So my question is what of the two suggestions is correct?
> According to my tests with our IndexRecoverer tool the 1) is
> correct. Here I will have to mention what does this tool. It
> creates segments file out of given set of segments. I did not made
> a lot of tests yet but in one single case when I created segments
> file for a segments set that does not contain any shared data when
> I use Format=SemgentInfos.FORMAT_SINGLE_NORM_FILE Luke can not open
> it using Lucene 2.3, but when
> Format=SemgentInfos.FORMAT_SHARED_DOC_STORE it opens it correctly.
>
> Best Regards,
> Ivan
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org