You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Erik Hatcher <er...@ehatchersolutions.com> on 2007/03/14 04:48:56 UTC
adding a field to every document
I'd like to add a field to every document in an index... that I'd
rather not rebuild from scratch (yet). This is behind Solr (so a
ParallelReader won't work without core modifications, right?).
Is there a way I could create an index with the same number of
documents and only the new field and "zip" it together with my
existing index?
The new field is simply the same value for every document, in order
to add new document sets segregated by "source".
Erik
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Can we extract phrase from lucene index
Posted by karl wettin <ka...@gmail.com>.
14 mar 2007 kl. 14.51 skrev Bhavin Pandya:
> what i am looking for is dictionary for spell checker.
> I am trying to customised lucene spell checker for phrase.
> so thinking if anyhow i am able to fetech phrases from the index
> itself then i can train my spellchecker.
>
> I tried with query logs but it has lot of spell mistakes...
You can try this:
https://issues.apache.org/jira/browse/LUCENE-626
--
karl
>
> Any suggestions..
>
> Thanks.
> Bhavin pandya
>
> ----- Original Message ----- From: "Erick Erickson"
> <er...@gmail.com>
> To: <ja...@lucene.apache.org>; "Bhavin Pandya"
> <bh...@rediff.co.in>
> Sent: Wednesday, March 14, 2007 6:29 PM
> Subject: Re: Can we extract phrase from lucene index
>
>
>> Your problem statement lends itself to flippant answers like "just
>> use a PhraseQuery". So I clearly don't understand what you're trying
>> to accomplish. Are you trying to find all of the occurrences of a
>> particular phrase? All the phrases (however that's defined) for
>> all the documents? What problem are you trying to solve?
>>
>>
>> Best
>> Erick
>>
>>
>> On 3/14/07, Bhavin Pandya <bh...@rediff.co.in> wrote:
>>>
>>> Hello guys,
>>>
>>> I am using lucene 1.9 and i have 3GB of index.
>>> I know we can extract tokens from index easily but can we extract
>>> phrase ?
>>>
>>> Regards.
>>> Bhavin pandya
>>>
>>> --------------------------------------------------------------------
>>> -
>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>>
>>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Can we extract phrase from lucene index
Posted by Bhavin Pandya <bh...@rediff.co.in>.
Hi erick,
what i am looking for is dictionary for spell checker.
I am trying to customised lucene spell checker for phrase.
so thinking if anyhow i am able to fetech phrases from the index itself then
i can train my spellchecker.
I tried with query logs but it has lot of spell mistakes...
Any suggestions..
Thanks.
Bhavin pandya
----- Original Message -----
From: "Erick Erickson" <er...@gmail.com>
To: <ja...@lucene.apache.org>; "Bhavin Pandya" <bh...@rediff.co.in>
Sent: Wednesday, March 14, 2007 6:29 PM
Subject: Re: Can we extract phrase from lucene index
> Your problem statement lends itself to flippant answers like "just
> use a PhraseQuery". So I clearly don't understand what you're trying
> to accomplish. Are you trying to find all of the occurrences of a
> particular phrase? All the phrases (however that's defined) for
> all the documents? What problem are you trying to solve?
>
>
> Best
> Erick
>
>
> On 3/14/07, Bhavin Pandya <bh...@rediff.co.in> wrote:
>>
>> Hello guys,
>>
>> I am using lucene 1.9 and i have 3GB of index.
>> I know we can extract tokens from index easily but can we extract phrase
>> ?
>>
>> Regards.
>> Bhavin pandya
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: Can we extract phrase from lucene index
Posted by Erick Erickson <er...@gmail.com>.
Your problem statement lends itself to flippant answers like "just
use a PhraseQuery". So I clearly don't understand what you're trying
to accomplish. Are you trying to find all of the occurrences of a
particular phrase? All the phrases (however that's defined) for
all the documents? What problem are you trying to solve?
Best
Erick
On 3/14/07, Bhavin Pandya <bh...@rediff.co.in> wrote:
>
> Hello guys,
>
> I am using lucene 1.9 and i have 3GB of index.
> I know we can extract tokens from index easily but can we extract phrase ?
>
> Regards.
> Bhavin pandya
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
Can we extract phrase from lucene index
Posted by Bhavin Pandya <bh...@rediff.co.in>.
Hello guys,
I am using lucene 1.9 and i have 3GB of index.
I know we can extract tokens from index easily but can we extract phrase ?
Regards.
Bhavin pandya
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org
Re: adding a field to every document
Posted by Daniel Noll <da...@nuix.com>.
> I'd like to add a field to every document in an index... that I'd rather
> not rebuild from scratch (yet). This is behind Solr (so a ParallelReader
> won't work without core modifications, right?).
>
> Is there a way I could create an index with the same number of documents
> and only the new field and "zip" it together with my existing index?
Well, IndexWriter#addIndexes does take an IndexReader. I don't suppose you
can just create a ParallelReader and then add that IndexReader to a new,
empty IndexWriter. It seems like it would work in theory.
Daniel Noll
Nuix Pty Ltd
Suite 79, 89 Jones St, Ultimo NSW 2007, Australia Ph: +61 2 9280 0699
Web: http://nuix.com/ Fax: +61 2 9212 6902
This message is intended only for the named recipient. If you are not
the intended recipient you are notified that disclosing, copying,
distributing or taking any action in reliance on the contents of this
message or attachment is strictly prohibited.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org