You are viewing a plain text version of this content. The canonical link for it is here.

Posted to java-user@lucene.apache.org by Erik Hatcher <er...@ehatchersolutions.com> on 2007/03/14 04:48:56 UTC

adding a field to every document

I'd like to add a field to every document in an index... that I'd  
rather not rebuild from scratch (yet).  This is behind Solr (so a  
ParallelReader won't work without core modifications, right?).

Is there a way I could create an index with the same number of  
documents and only the new field and "zip" it together with my  
existing index?

The new field is simply the same value for every document, in order  
to add new document sets segregated by "source".

	Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Can we extract phrase from lucene index

Posted by karl wettin <ka...@gmail.com>.

14 mar 2007 kl. 14.51 skrev Bhavin Pandya:

> what i am looking for is dictionary for spell checker.
> I am trying to customised lucene spell checker for phrase.
> so thinking if anyhow i am able to fetech phrases from the index  
> itself then i can train my spellchecker.
>
> I tried with query logs but it has lot of spell mistakes...

You can try this:

https://issues.apache.org/jira/browse/LUCENE-626

-- 
karl

>
> Any suggestions..
>
> Thanks.
> Bhavin pandya
>
> ----- Original Message ----- From: "Erick Erickson"  
> <er...@gmail.com>
> To: <ja...@lucene.apache.org>; "Bhavin Pandya"  
> <bh...@rediff.co.in>
> Sent: Wednesday, March 14, 2007 6:29 PM
> Subject: Re: Can we extract phrase from lucene index
>
>
>> Your problem statement lends itself to flippant answers like "just
>> use a PhraseQuery". So I clearly don't understand what you're trying
>> to accomplish. Are you trying to find all of the occurrences of a
>> particular phrase? All the phrases (however that's defined) for
>> all the documents? What problem are you trying to solve?
>>
>>
>> Best
>> Erick
>>
>>
>> On 3/14/07, Bhavin Pandya <bh...@rediff.co.in> wrote:
>>>
>>> Hello guys,
>>>
>>> I am using lucene 1.9 and i have 3GB of index.
>>> I know we can extract tokens from index easily but can we extract  
>>> phrase ?
>>>
>>> Regards.
>>> Bhavin pandya
>>>
>>> -------------------------------------------------------------------- 
>>> -
>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>>
>>>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Can we extract phrase from lucene index

Posted by Bhavin Pandya <bh...@rediff.co.in>.

Hi erick,
what i am looking for is dictionary for spell checker.
I am trying to customised lucene spell checker for phrase.
so thinking if anyhow i am able to fetech phrases from the index itself then 
i can train my spellchecker.

I tried with query logs but it has lot of spell mistakes...

Any suggestions..

Thanks.
Bhavin pandya

----- Original Message ----- 
From: "Erick Erickson" <er...@gmail.com>
To: <ja...@lucene.apache.org>; "Bhavin Pandya" <bh...@rediff.co.in>
Sent: Wednesday, March 14, 2007 6:29 PM
Subject: Re: Can we extract phrase from lucene index


> Your problem statement lends itself to flippant answers like "just
> use a PhraseQuery". So I clearly don't understand what you're trying
> to accomplish. Are you trying to find all of the occurrences of a
> particular phrase? All the phrases (however that's defined) for
> all the documents? What problem are you trying to solve?
>
>
> Best
> Erick
>
>
> On 3/14/07, Bhavin Pandya <bh...@rediff.co.in> wrote:
>>
>> Hello guys,
>>
>> I am using lucene 1.9 and i have 3GB of index.
>> I know we can extract tokens from index easily but can we extract phrase 
>> ?
>>
>> Regards.
>> Bhavin pandya
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Can we extract phrase from lucene index

Posted by Erick Erickson <er...@gmail.com>.

 Your problem statement lends itself to flippant answers like "just
use a PhraseQuery". So I clearly don't understand what you're trying
to accomplish. Are you trying to find all of the occurrences of a
particular phrase? All the phrases (however that's defined) for
all the documents? What problem are you trying to solve?

Best
Erick

On 3/14/07, Bhavin Pandya <bh...@rediff.co.in> wrote:
>
> Hello guys,
>
> I am using lucene 1.9 and i have 3GB of index.
> I know we can extract tokens from index easily but can we extract phrase ?
>
> Regards.
> Bhavin pandya
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Can we extract phrase from lucene index

Posted by Bhavin Pandya <bh...@rediff.co.in>.

Hello guys,

I am using lucene 1.9 and i have 3GB of index.
I know we can extract tokens from index easily but can we extract phrase ?

Regards.
Bhavin pandya

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: adding a field to every document

Posted by Daniel Noll <da...@nuix.com>.


> I'd like to add a field to every document in an index... that I'd  rather 
> not rebuild from scratch (yet).  This is behind Solr (so a  ParallelReader 
> won't work without core modifications, right?).
>
> Is there a way I could create an index with the same number of  documents 
> and only the new field and "zip" it together with my  existing index?


Well, IndexWriter#addIndexes does take an IndexReader.  I don't suppose you 
can just create a ParallelReader and then add that IndexReader to a new, 
empty IndexWriter.  It seems like it would work in theory.

Daniel Noll

Nuix Pty Ltd
Suite 79, 89 Jones St, Ultimo NSW 2007, Australia    Ph: +61 2 9280 0699
Web: http://nuix.com/                               Fax: +61 2 9212 6902

This message is intended only for the named recipient. If you are not
the intended recipient you are notified that disclosing, copying,
distributing or taking any action in reliance on the contents of this
message or attachment is strictly prohibited.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org