You are viewing a plain text version of this content. The canonical link for it is here.

Posted to java-user@lucene.apache.org by Peter Keegan <pe...@gmail.com> on 2007/07/12 02:19:04 UTC

Re: Payloads and PhraseQuery

I'm now looking at using payloads with SpanNearQuery but I don't see any
clear way of getting the payload(s) from the matching span terms. The term
positions for the payloads seem to be buried beneath SpanCells in the
NearSpansOrdered and NearSpansUnordered classes, which are not public. I'd
be content to be able to get the payload from just the first term of the
span.

Can anyone suggest an approach for making payloads work with SpanNearQuery?

Peter


On 6/27/07, Grant Ingersoll <gs...@apache.org> wrote:
>
> Could you get what you need combining the BoostingTermQuery with a
> SpanNearQuery to produce a score?  Just guessing here..
>
> At some point, I would like to see more Query classes around the
> payload stuff, so please submit patches/feedback if and when you get
> a solution
>
> On Jun 27, 2007, at 10:45 AM, Peter Keegan wrote:
>
> > I'm looking at the new Payload api and would like to use it in the
> > following
> > manner. Meta-data is indexed as a special phrase (all terms at same
> > position) and a payload is stored with the first term of each
> > phrase. I
> > would like to create a custom query class that extends PhraseQuery
> > and uses
> > its PhraseScorer to find matching documents. The custom query class
> > then
> > reads the payload from the first term of the matching query and
> > uses it to
> > produce a new score. However, I don't see how to get the payload
> > from the
> > PhraseScorer's TermPositions. Is this possible?
> >
> >
> > Peter
>
> --------------------------
> Grant Ingersoll
> Center for Natural Language Processing
> http://www.cnlp.org/tech/lucene.asp
>
> Read the Lucene Java FAQ at http://wiki.apache.org/lucene-java/LuceneFAQ
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Re: Payloads and PhraseQuery

Posted by Peter Keegan <pe...@gmail.com>.

Grant,

If/when you have an implementation for SpanNearQuery, I'd be happy to test
it.

Peter

On 7/12/07, Grant Ingersoll <gr...@gmail.com> wrote:
>
> Yep, totally agree.    One way to handle this initially at least is
> have isPayloadAvailable() only return true for the SpanTermQuery.
> The other option is to come up with some modification of the
> suggested methods below to return all the payloads in a span.
>
> I have a basic implementation for just the SpanTermQuery (i.e. via
> TermSpans) in the works.  I will take a crack at fleshing out the
> rest at some point soon.
>
> -Grant
>
> On Jul 12, 2007, at 1:22 PM, Paul Elschot wrote:
>
> >
> > On Thursday 12 July 2007 14:50, Grant Ingersoll wrote:
> >> That is off of the TermSpans class.  BTQ (BoostingTermQuery) is
> >> implemented to extend SpanQuery, thus SpanNearQuery isn't, w/o
> >> modification, going to have access to these things.  However, if you
> >> look at the SpanTermQuery, you will see that it's implementation of
> >> Spans is indeed the TermSpans class.  So, I think you could cast to
> >> it or handle it through instanceof.
> >>
> >> I am not completely sure here, but it seems like we may need an
> >> efficient way to access the TermPositions for each document.  That
> >> is, the Spans class doesn't provide this and maybe it should
> >> somehow.  Again, I am just thinking out loud here.
> >
> > SpanQueries can be nested, so the relationship between a span
> > and a term position can also be one to many, not only one to one.
> > For example a matching span in the Spans of a SpanNearQuery
> > can be based on two matching (near enough to match) term positions.
> >
> >>
> >> Thus, if we modified Spans to have the following methods:
> >>
> >> byte[] getPayload(byte[] data, int offset)
> >>
> >> boolean isPayloadAvailable()
> >>
> >> I think this would be useful.  Perhaps this should be discussed on
> >> dev.
> >
> > And the same holds for the payloads, there many be more than one
> > for a single Span.
> >
> > Regards,
> > Paul Elschot
> >
> >>
> >> Cheers,
> >> Grant
> >>
> >>
> >> On Jul 12, 2007, at 8:20 AM, Peter Keegan wrote:
> >>
> >>> I'm looking for Spans.getPositions(), as shown in
> >>> BoostingTermQuery, but
> >>> neither NearSpansOrdered nor NearSpansUnordered (which are the Spans
> >>> provided by SpanNearQuery) provide this method and it's not clear
> >>> to me how
> >>> to add it.
> >>>
> >>> Peter
> >>>
> >>> On 7/11/07, Chris Hostetter <ho...@fucit.org> wrote:
> >>>>
> >>>>
> >>>> : I'm now looking at using payloads with SpanNearQuery but I don't
> >>>> see any
> >>>> : clear way of getting the payload(s) from the matching span
> >>>> terms. The
> >>>> term
> >>>> : positions for the payloads seem to be buried beneath SpanCells
> >>>> in the
> >>>>
> >>>> Isn't Spans.start() and Spans.end() what you are looking for?
> >>>>
> >>>>
> >>>>
> >>>>
> >>>>
> >>>> -Hoss
> >>>>
> >>>>
> >>>> -------------------------------------------------------------------
> >>>> --
> >>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >>>> For additional commands, e-mail: java-user-help@lucene.apache.org
> >>>>
> >>>>
> >>
> >> --------------------------
> >> Grant Ingersoll
> >> Center for Natural Language Processing
> >> http://www.cnlp.org/tech/lucene.asp
> >>
> >> Read the Lucene Java FAQ at http://wiki.apache.org/lucene-java/
> >> LuceneFAQ
> >>
> >>
> >>
> >> ---------------------------------------------------------------------
> >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >> For additional commands, e-mail: java-user-help@lucene.apache.org
> >>
> >>
> >>
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
>
> ------------------------------------------------------
> Grant Ingersoll
> http://www.grantingersoll.com/
> http://lucene.grantingersoll.com
> http://www.paperoftheweek.com/
>
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Re: Payloads and PhraseQuery

Posted by Grant Ingersoll <gr...@gmail.com>.

Yep, totally agree.    One way to handle this initially at least is  
have isPayloadAvailable() only return true for the SpanTermQuery.   
The other option is to come up with some modification of the  
suggested methods below to return all the payloads in a span.

I have a basic implementation for just the SpanTermQuery (i.e. via  
TermSpans) in the works.  I will take a crack at fleshing out the  
rest at some point soon.

-Grant

On Jul 12, 2007, at 1:22 PM, Paul Elschot wrote:

>
> On Thursday 12 July 2007 14:50, Grant Ingersoll wrote:
>> That is off of the TermSpans class.  BTQ (BoostingTermQuery) is
>> implemented to extend SpanQuery, thus SpanNearQuery isn't, w/o
>> modification, going to have access to these things.  However, if you
>> look at the SpanTermQuery, you will see that it's implementation of
>> Spans is indeed the TermSpans class.  So, I think you could cast to
>> it or handle it through instanceof.
>>
>> I am not completely sure here, but it seems like we may need an
>> efficient way to access the TermPositions for each document.  That
>> is, the Spans class doesn't provide this and maybe it should
>> somehow.  Again, I am just thinking out loud here.
>
> SpanQueries can be nested, so the relationship between a span
> and a term position can also be one to many, not only one to one.
> For example a matching span in the Spans of a SpanNearQuery
> can be based on two matching (near enough to match) term positions.
>
>>
>> Thus, if we modified Spans to have the following methods:
>>
>> byte[] getPayload(byte[] data, int offset)
>>
>> boolean isPayloadAvailable()
>>
>> I think this would be useful.  Perhaps this should be discussed on  
>> dev.
>
> And the same holds for the payloads, there many be more than one
> for a single Span.
>
> Regards,
> Paul Elschot
>
>>
>> Cheers,
>> Grant
>>
>>
>> On Jul 12, 2007, at 8:20 AM, Peter Keegan wrote:
>>
>>> I'm looking for Spans.getPositions(), as shown in
>>> BoostingTermQuery, but
>>> neither NearSpansOrdered nor NearSpansUnordered (which are the Spans
>>> provided by SpanNearQuery) provide this method and it's not clear
>>> to me how
>>> to add it.
>>>
>>> Peter
>>>
>>> On 7/11/07, Chris Hostetter <ho...@fucit.org> wrote:
>>>>
>>>>
>>>> : I'm now looking at using payloads with SpanNearQuery but I don't
>>>> see any
>>>> : clear way of getting the payload(s) from the matching span
>>>> terms. The
>>>> term
>>>> : positions for the payloads seem to be buried beneath SpanCells
>>>> in the
>>>>
>>>> Isn't Spans.start() and Spans.end() what you are looking for?
>>>>
>>>>
>>>>
>>>>
>>>>
>>>> -Hoss
>>>>
>>>>
>>>> ------------------------------------------------------------------- 
>>>> --
>>>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>>>
>>>>
>>
>> --------------------------
>> Grant Ingersoll
>> Center for Natural Language Processing
>> http://www.cnlp.org/tech/lucene.asp
>>
>> Read the Lucene Java FAQ at http://wiki.apache.org/lucene-java/ 
>> LuceneFAQ
>>
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>
>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>

------------------------------------------------------
Grant Ingersoll
http://www.grantingersoll.com/
http://lucene.grantingersoll.com
http://www.paperoftheweek.com/



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Payloads and PhraseQuery

Posted by Paul Elschot <pa...@xs4all.nl>.

On Thursday 12 July 2007 14:50, Grant Ingersoll wrote:
> That is off of the TermSpans class.  BTQ (BoostingTermQuery) is  
> implemented to extend SpanQuery, thus SpanNearQuery isn't, w/o  
> modification, going to have access to these things.  However, if you  
> look at the SpanTermQuery, you will see that it's implementation of  
> Spans is indeed the TermSpans class.  So, I think you could cast to  
> it or handle it through instanceof.
> 
> I am not completely sure here, but it seems like we may need an  
> efficient way to access the TermPositions for each document.  That  
> is, the Spans class doesn't provide this and maybe it should  
> somehow.  Again, I am just thinking out loud here.

SpanQueries can be nested, so the relationship between a span
and a term position can also be one to many, not only one to one.
For example a matching span in the Spans of a SpanNearQuery
can be based on two matching (near enough to match) term positions.

> 
> Thus, if we modified Spans to have the following methods:
> 
> byte[] getPayload(byte[] data, int offset)
> 
> boolean isPayloadAvailable()
> 
> I think this would be useful.  Perhaps this should be discussed on dev.

And the same holds for the payloads, there many be more than one
for a single Span.

Regards,
Paul Elschot

> 
> Cheers,
> Grant
> 
> 
> On Jul 12, 2007, at 8:20 AM, Peter Keegan wrote:
> 
> > I'm looking for Spans.getPositions(), as shown in  
> > BoostingTermQuery, but
> > neither NearSpansOrdered nor NearSpansUnordered (which are the Spans
> > provided by SpanNearQuery) provide this method and it's not clear  
> > to me how
> > to add it.
> >
> > Peter
> >
> > On 7/11/07, Chris Hostetter <ho...@fucit.org> wrote:
> >>
> >>
> >> : I'm now looking at using payloads with SpanNearQuery but I don't  
> >> see any
> >> : clear way of getting the payload(s) from the matching span  
> >> terms. The
> >> term
> >> : positions for the payloads seem to be buried beneath SpanCells  
> >> in the
> >>
> >> Isn't Spans.start() and Spans.end() what you are looking for?
> >>
> >>
> >>
> >>
> >>
> >> -Hoss
> >>
> >>
> >> ---------------------------------------------------------------------
> >> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> >> For additional commands, e-mail: java-user-help@lucene.apache.org
> >>
> >>
> 
> --------------------------
> Grant Ingersoll
> Center for Natural Language Processing
> http://www.cnlp.org/tech/lucene.asp
> 
> Read the Lucene Java FAQ at http://wiki.apache.org/lucene-java/LuceneFAQ
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
> 
> 
> 

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Payloads and PhraseQuery

Posted by Peter Keegan <pe...@gmail.com>.

I guess this also ties in with 'getPositionIncrementGap', which is relevant
to fields with multiple occurrences.

Peter

On 7/27/07, Peter Keegan <pe...@gmail.com> wrote:
>
> I have a question about the way fields are analyzed and inverted by the
> index writer. Currently, if a field has multiple occurrences in a document,
> each occurrence is analyzed separately (see DocumentsWriter.processField).
> Is it safe to assume that this behavior won't change in the future? The
> reason I ask is that my custom analyzer's 'tokenStream' method creates a
> custom filter which produces a payload based on the existence of each field
> occurrence. However, if DocumentsWriter was changed and combined all the
> occurrences before inversion, my scheme wouldn't work.  Since payloads are
> created by filters/tokenizers, it helps to keep things flexible.
>
> Thanks,
> Peter
>
>
> On 7/12/07, Grant Ingersoll <gs...@apache.org> wrote:
> >
> >
> > On Jul 12, 2007, at 6:12 PM, Chris Hostetter wrote:
> >
> >
> > >
> > > Hmm... okay so the issue is that in order to get the payload data, you
> > > have to have a TermPositions instance.
> > >
> > > instead of adding getPayload methods to the Spans class (which as Paul
> >
> > > points out, can have nesting issues) perhaps more general solutions
> > > would
> > > be:
> > >
> > > a) a more high level getPayload API that let's you get a payload
> > > arbitrarily for a toc/position (perhaps as part of the TernDocs
> > > API?) ...
> > > then for Spans you could use this new API with Spans.start() and
> > > Spans.end(). (and all the positions in between)
> >
> > Not sure I follow this.  I don't see the fit w/ TermDocs.
> > >
> > > b) add a variation of the TermPositions class to allow people to
> > > iterate
> > > through the terms of a TermDoc in position order (TermPosition first
> > > iterates over the Terms and then over the positions) ... then you
> > > could
> > > seek(span.start()) to get the Payload data
> > >
> > > c) add methods to the Spans API to get the subspans (if any) ... this
> > > would be the Spans corrilary to getTerms() and would always return
> > > TermSpans which would have TermPositions for getting payload data.
> >
> >
> > This could be a good alternative.
> >
> > When we first talked about payloads we wondered if we could just make
> > all Queries into SpanQueries by passing TermPositions instead of term
> > docs, but in the end decided not to do it because of performance
> > issues (some of which are lessened by lazy loading of TermPositions.
> >
> > The thing is, I think, that the Spans is already moving you along in
> > the term positions, so it just seems like a natural fit to have it
> > there, even if there is nesting.  It doesn't seem like it would be
> > that hard to then return back the nesting stuff b/c you are just
> > collating the results from the underlying SpanTermQuery.  Having said
> > that, I haven't looked into the actual code, so take that w/ a grain
> > of salt.
> >
> > I will try to do some more investigation, as others are welcome to
> > do.  Perhaps we should move this to dev?
> >
> > Cheers,
> > Grant
> >
> >
> > ---------------------------------------------------------------------
> > To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> > For additional commands, e-mail: java-user-help@lucene.apache.org
> >
> >
>

Re: Payloads and PhraseQuery

Posted by Peter Keegan <pe...@gmail.com>.

I have a question about the way fields are analyzed and inverted by the
index writer. Currently, if a field has multiple occurrences in a document,
each occurrence is analyzed separately (see DocumentsWriter.processField).
Is it safe to assume that this behavior won't change in the future? The
reason I ask is that my custom analyzer's 'tokenStream' method creates a
custom filter which produces a payload based on the existence of each field
occurrence. However, if DocumentsWriter was changed and combined all the
occurrences before inversion, my scheme wouldn't work.  Since payloads are
created by filters/tokenizers, it helps to keep things flexible.

Thanks,
Peter


On 7/12/07, Grant Ingersoll <gs...@apache.org> wrote:
>
>
> On Jul 12, 2007, at 6:12 PM, Chris Hostetter wrote:
>
>
> >
> > Hmm... okay so the issue is that in order to get the payload data, you
> > have to have a TermPositions instance.
> >
> > instead of adding getPayload methods to the Spans class (which as Paul
> > points out, can have nesting issues) perhaps more general solutions
> > would
> > be:
> >
> > a) a more high level getPayload API that let's you get a payload
> > arbitrarily for a toc/position (perhaps as part of the TernDocs
> > API?) ...
> > then for Spans you could use this new API with Spans.start() and
> > Spans.end(). (and all the positions in between)
>
> Not sure I follow this.  I don't see the fit w/ TermDocs.
> >
> > b) add a variation of the TermPositions class to allow people to
> > iterate
> > through the terms of a TermDoc in position order (TermPosition first
> > iterates over the Terms and then over the positions) ... then you
> > could
> > seek(span.start()) to get the Payload data
> >
> > c) add methods to the Spans API to get the subspans (if any) ... this
> > would be the Spans corrilary to getTerms() and would always return
> > TermSpans which would have TermPositions for getting payload data.
>
>
> This could be a good alternative.
>
> When we first talked about payloads we wondered if we could just make
> all Queries into SpanQueries by passing TermPositions instead of term
> docs, but in the end decided not to do it because of performance
> issues (some of which are lessened by lazy loading of TermPositions.
>
> The thing is, I think, that the Spans is already moving you along in
> the term positions, so it just seems like a natural fit to have it
> there, even if there is nesting.  It doesn't seem like it would be
> that hard to then return back the nesting stuff b/c you are just
> collating the results from the underlying SpanTermQuery.  Having said
> that, I haven't looked into the actual code, so take that w/ a grain
> of salt.
>
> I will try to do some more investigation, as others are welcome to
> do.  Perhaps we should move this to dev?
>
> Cheers,
> Grant
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Re: Payloads and PhraseQuery

Posted by Grant Ingersoll <gs...@apache.org>.

On Jul 12, 2007, at 6:12 PM, Chris Hostetter wrote:

>
> Hmm... okay so the issue is that in order to get the payload data, you
> have to have a TermPositions instance.
>
> instead of adding getPayload methods to the Spans class (which as Paul
> points out, can have nesting issues) perhaps more general solutions  
> would
> be:
>
> a) a more high level getPayload API that let's you get a payload
> arbitrarily for a toc/position (perhaps as part of the TernDocs  
> API?) ...
> then for Spans you could use this new API with Spans.start() and
> Spans.end(). (and all the positions in between)

Not sure I follow this.  I don't see the fit w/ TermDocs.
>
> b) add a variation of the TermPositions class to allow people to  
> iterate
> through the terms of a TermDoc in position order (TermPosition first
> iterates over the Terms and then over the positions) ... then you  
> could
> seek(span.start()) to get the Payload data
>
> c) add methods to the Spans API to get the subspans (if any) ... this
> would be the Spans corrilary to getTerms() and would always return
> TermSpans which would have TermPositions for getting payload data.

This could be a good alternative.

When we first talked about payloads we wondered if we could just make  
all Queries into SpanQueries by passing TermPositions instead of term  
docs, but in the end decided not to do it because of performance  
issues (some of which are lessened by lazy loading of TermPositions.

The thing is, I think, that the Spans is already moving you along in  
the term positions, so it just seems like a natural fit to have it  
there, even if there is nesting.  It doesn't seem like it would be  
that hard to then return back the nesting stuff b/c you are just  
collating the results from the underlying SpanTermQuery.  Having said  
that, I haven't looked into the actual code, so take that w/ a grain  
of salt.

I will try to do some more investigation, as others are welcome to  
do.  Perhaps we should move this to dev?

Cheers,
Grant

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Payloads and PhraseQuery

Posted by Chris Hostetter <ho...@fucit.org>.

: That is off of the TermSpans class.  BTQ (BoostingTermQuery) is
	...
: I am not completely sure here, but it seems like we may need an
: efficient way to access the TermPositions for each document.  That
: is, the Spans class doesn't provide this and maybe it should
	...
: > I'm looking for Spans.getPositions(), as shown in
	...
: >> : I'm now looking at using payloads with SpanNearQuery but I don't
: >> see any
: >> : clear way of getting the payload(s) from the matching span
: >> terms. The

Hmm... okay so the issue is that in order to get the payload data, you
have to have a TermPositions instance.

instead of adding getPayload methods to the Spans class (which as Paul
points out, can have nesting issues) perhaps more general solutions would
be:

a) a more high level getPayload API that let's you get a payload
arbitrarily for a toc/position (perhaps as part of the TernDocs API?) ...
then for Spans you could use this new API with Spans.start() and
Spans.end(). (and all the positions in between)

b) add a variation of the TermPositions class to allow people to iterate
through the terms of a TermDoc in position order (TermPosition first
iterates over the Terms and then over the positions) ... then you could
seek(span.start()) to get the Payload data

c) add methods to the Spans API to get the subspans (if any) ... this
would be the Spans corrilary to getTerms() and would always return
TermSpans which would have TermPositions for getting payload data.



-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Payloads and PhraseQuery

Posted by Grant Ingersoll <gs...@apache.org>.

That is off of the TermSpans class.  BTQ (BoostingTermQuery) is  
implemented to extend SpanQuery, thus SpanNearQuery isn't, w/o  
modification, going to have access to these things.  However, if you  
look at the SpanTermQuery, you will see that it's implementation of  
Spans is indeed the TermSpans class.  So, I think you could cast to  
it or handle it through instanceof.

I am not completely sure here, but it seems like we may need an  
efficient way to access the TermPositions for each document.  That  
is, the Spans class doesn't provide this and maybe it should  
somehow.  Again, I am just thinking out loud here.

Thus, if we modified Spans to have the following methods:

byte[] getPayload(byte[] data, int offset)

boolean isPayloadAvailable()

I think this would be useful.  Perhaps this should be discussed on dev.

Cheers,
Grant

On Jul 12, 2007, at 8:20 AM, Peter Keegan wrote:

> I'm looking for Spans.getPositions(), as shown in  
> BoostingTermQuery, but
> neither NearSpansOrdered nor NearSpansUnordered (which are the Spans
> provided by SpanNearQuery) provide this method and it's not clear  
> to me how
> to add it.
>
> Peter
>
> On 7/11/07, Chris Hostetter <ho...@fucit.org> wrote:
>>
>>
>> : I'm now looking at using payloads with SpanNearQuery but I don't  
>> see any
>> : clear way of getting the payload(s) from the matching span  
>> terms. The
>> term
>> : positions for the payloads seem to be buried beneath SpanCells  
>> in the
>>
>> Isn't Spans.start() and Spans.end() what you are looking for?
>>
>>
>>
>>
>>
>> -Hoss
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>

--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org/tech/lucene.asp

Read the Lucene Java FAQ at http://wiki.apache.org/lucene-java/LuceneFAQ

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org

Re: Payloads and PhraseQuery

Posted by Peter Keegan <pe...@gmail.com>.

I'm looking for Spans.getPositions(), as shown in BoostingTermQuery, but
neither NearSpansOrdered nor NearSpansUnordered (which are the Spans
provided by SpanNearQuery) provide this method and it's not clear to me how
to add it.

Peter

On 7/11/07, Chris Hostetter <ho...@fucit.org> wrote:
>
>
> : I'm now looking at using payloads with SpanNearQuery but I don't see any
> : clear way of getting the payload(s) from the matching span terms. The
> term
> : positions for the payloads seem to be buried beneath SpanCells in the
>
> Isn't Spans.start() and Spans.end() what you are looking for?
>
>
>
>
>
> -Hoss
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

Re: Payloads and PhraseQuery

Posted by Chris Hostetter <ho...@fucit.org>.

: I'm now looking at using payloads with SpanNearQuery but I don't see any
: clear way of getting the payload(s) from the matching span terms. The term
: positions for the payloads seem to be buried beneath SpanCells in the

Isn't Spans.start() and Spans.end() what you are looking for?





-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org