You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@lucenenet.apache.org by Patrick Burrows <pb...@gmail.com> on 2007/07/31 20:39:32 UTC

test

this is not spam, please stop bouncing it

-- 
-
P

unsubscribe

Posted by Sergiy Savchenko <ma...@gmail.com>.

unsubscribe
unsubscribe

Re: test

Posted by feran <fe...@whereistand.com>.

It's also a lot easier.

IndexSearcher searcher = new IndexSearcher(directory);
Hits hits = search.Search(new TermQuery(new Term("key", "value")));
// loop through hits

----- Original Message ----- 
From: "Kurt Mackey" <ku...@mubble.net>
To: <lu...@incubator.apache.org>
Sent: Tuesday, July 31, 2007 3:10 PM
Subject: RE: test


Not only can you, but it's preferred.  The QueryParser really only exists to 
handle human input.  If you can do it programmatically, things are much 
easier.

-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com]
Sent: Tuesday, July 31, 2007 2:04 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

Yeah. That's exactly what is happening.

Didn't realize I could use my own query without going through the parser.


On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>
> Are you using the query parser thing for that?  It will split on the
> various special characters in a URL, and (by default) give you something
> like this for http://www.microsoft.com/windows:
>
> field:(http OR www OR microsoft OR come OR windows)
>
> For things like that, you'll need to build your own queries, not use the
> parser.
>
> -Kurt
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com]
> Sent: Tuesday, July 31, 2007 1:45 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> hmm.... this seems to have made it through. My previous posts kept getting
> bounced for being spam.
>
> I had been trying to ask a question on searching for URLs. One of the
> fields
> in my index is called link. It holds nothing but URLs. There may be more
> than one link field per document.
>
> When I search on the url, though (using field:fullurl syntax) it returns a
> hit on every field in the database.
>
> Is there special syntax for searching for a url?
>
>
>
>
> On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> >
> > this is not spam, please stop bouncing it
> >
> > --
> > -
> > P
>
>
>
>
> --
> -
> P
>



--
-
P

Remove

Posted by "Shepherd, Shane" <Sh...@tylertech.com>.

Please remove me from the mailing list.

Thanks.


-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com] 
Sent: Tuesday, July 31, 2007 2:21 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

Hm...

I do this:

Query q = new TermQuery(new Term("link", args[1]));
(from a command line test app I made) and it still does not find the url
that Luke is showing me in the index. args[1] has the exact url copied
out
of Luke. I get 0 hits back -- which is probably better than getting
*every*
document back... but still not ideal.


On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>
> Not only can you, but it's preferred.  The QueryParser really only
exists
> to handle human input.  If you can do it programmatically, things are
much
> easier.
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com]
> Sent: Tuesday, July 31, 2007 2:04 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> Yeah. That's exactly what is happening.
>
> Didn't realize I could use my own query without going through the
parser.
>
>
> On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
> >
> > Are you using the query parser thing for that?  It will split on the
> > various special characters in a URL, and (by default) give you
something
> > like this for http://www.microsoft.com/windows:
> >
> > field:(http OR www OR microsoft OR come OR windows)
> >
> > For things like that, you'll need to build your own queries, not use
the
> > parser.
> >
> > -Kurt
> >
> > -----Original Message-----
> > From: Patrick Burrows [mailto:pburrows@gmail.com]
> > Sent: Tuesday, July 31, 2007 1:45 PM
> > To: lucene-net-user@incubator.apache.org
> > Subject: Re: test
> >
> > hmm.... this seems to have made it through. My previous posts kept
> getting
> > bounced for being spam.
> >
> > I had been trying to ask a question on searching for URLs. One of
the
> > fields
> > in my index is called link. It holds nothing but URLs. There may be
more
> > than one link field per document.
> >
> > When I search on the url, though (using field:fullurl syntax) it
returns
> a
> > hit on every field in the database.
> >
> > Is there special syntax for searching for a url?
> >
> >
> >
> >
> > On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> > >
> > > this is not spam, please stop bouncing it
> > >
> > > --
> > > -
> > > P
> >
> >
> >
> >
> > --
> > -
> > P
> >
>
>
>
> --
> -
> P
>



-- 
-
P

RE: test

Posted by Kurt Mackey <ku...@mubble.net>.

How's the URL stored in the index?  Make sure it's indexed but not tokenized.

-Kurt

-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com]
Sent: Tuesday, July 31, 2007 2:21 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

Hm...

I do this:

Query q = new TermQuery(new Term("link", args[1]));
(from a command line test app I made) and it still does not find the url
that Luke is showing me in the index. args[1] has the exact url copied out
of Luke. I get 0 hits back -- which is probably better than getting *every*
document back... but still not ideal.


On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>
> Not only can you, but it's preferred.  The QueryParser really only exists
> to handle human input.  If you can do it programmatically, things are much
> easier.
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com]
> Sent: Tuesday, July 31, 2007 2:04 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> Yeah. That's exactly what is happening.
>
> Didn't realize I could use my own query without going through the parser.
>
>
> On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
> >
> > Are you using the query parser thing for that?  It will split on the
> > various special characters in a URL, and (by default) give you something
> > like this for http://www.microsoft.com/windows:
> >
> > field:(http OR www OR microsoft OR come OR windows)
> >
> > For things like that, you'll need to build your own queries, not use the
> > parser.
> >
> > -Kurt
> >
> > -----Original Message-----
> > From: Patrick Burrows [mailto:pburrows@gmail.com]
> > Sent: Tuesday, July 31, 2007 1:45 PM
> > To: lucene-net-user@incubator.apache.org
> > Subject: Re: test
> >
> > hmm.... this seems to have made it through. My previous posts kept
> getting
> > bounced for being spam.
> >
> > I had been trying to ask a question on searching for URLs. One of the
> > fields
> > in my index is called link. It holds nothing but URLs. There may be more
> > than one link field per document.
> >
> > When I search on the url, though (using field:fullurl syntax) it returns
> a
> > hit on every field in the database.
> >
> > Is there special syntax for searching for a url?
> >
> >
> >
> >
> > On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> > >
> > > this is not spam, please stop bouncing it
> > >
> > > --
> > > -
> > > P
> >
> >
> >
> >
> > --
> > -
> > P
> >
>
>
>
> --
> -
> P
>



--
-
P

Re: Lucene in Action (relevance and usefulness)

Posted by Ciaran Roarty <ci...@gmail.com>.

Agreed, to understand what Lucene does and how it works, Lucene in Action is
invaluable. The API may have moved on but the core is still similar.

On 01/08/07, Vijay Santhanam <vi...@spectrumwired.com> wrote:
>
> >The API may be a little different, but the javadocs are good as an API
> > reference. Lucene in Action is great as an architectural introduction.
>
> That's a better way of putting it.
>
> -Vijay
>
>
>
>
> -----Original Message-----
> From: Dean Harding [mailto:dean.harding@dload.com.au]
> Sent: Wednesday, 1 August 2007 1:37 PM
> To: lucene-net-user@incubator.apache.org
> Subject: RE: Lucene in Action (relevance and usefulness)
>
> > The book is still immensely useful, but API differences may cause new
> > readers some confusion.
>
>
> Dean.
>
>
>
> __________ NOD32 2220 (20070426) Information __________
>
> This message was checked by NOD32 antivirus system.
> http://www.eset.com
>
>
>

RE: Lucene in Action (relevance and usefulness)

Posted by Vijay Santhanam <vi...@spectrumwired.com>.

>The API may be a little different, but the javadocs are good as an API
> reference. Lucene in Action is great as an architectural introduction.

That's a better way of putting it.

-Vijay
 



-----Original Message-----
From: Dean Harding [mailto:dean.harding@dload.com.au] 
Sent: Wednesday, 1 August 2007 1:37 PM
To: lucene-net-user@incubator.apache.org
Subject: RE: Lucene in Action (relevance and usefulness)

> The book is still immensely useful, but API differences may cause new
> readers some confusion.


Dean.



__________ NOD32 2220 (20070426) Information __________

This message was checked by NOD32 antivirus system.
http://www.eset.com

RE: Lucene in Action (relevance and usefulness)

Posted by Dean Harding <de...@dload.com.au>.

> The book is still immensely useful, but API differences may cause new
> readers some confusion.

The API may be a little different, but the javadocs are good as an API
reference. Lucene in Action is great as an architectural introduction.

Dean.

Lucene in Action (relevance and usefulness)

Posted by Vijay Santhanam <vi...@spectrumwired.com>.

I found Lucene in Action perfect for 1.9 dev, but 2.0+ deviates a bit from
the API used in "Lucene in Action".

 

The book is still immensely useful, but API differences may cause new
readers some confusion.

 

I believe "Lucene in Action" is written for 1.4 java package.

 

 

Regards,

-Vijay

 

From: Simone Busoli [mailto:simone.busoli@gmail.com] 
Sent: Wednesday, 1 August 2007 7:40 AM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

 

To everyone, please read Lucene in Action, it will teach you most of the
things you need to know about Lucene.

Simone

Patrick Burrows wrote: 

Ah. that's got it. Thanks!
 
On 7/31/07, feran  <ma...@whereistand.com> <fe...@whereistand.com>
wrote:
  

Is the Field UN_TOKENIZED?
 
If it's TOKENIZED you may not find it because the value will have been
split
into terms.
 
----- Original Message -----
From: "Patrick Burrows"  <ma...@gmail.com> <pb...@gmail.com>
To:  <ma...@incubator.apache.org>
<lu...@incubator.apache.org>
Sent: Tuesday, July 31, 2007 3:21 PM
Subject: Re: test
 
 
    

Hm...
 
I do this:
 
Query q = new TermQuery(new Term("link", args[1]));
(from a command line test app I made) and it still does not find the url
that Luke is showing me in the index. args[1] has the exact url copied
      

out
    

of Luke. I get 0 hits back -- which is probably better than getting
*every*
document back... but still not ideal.
 
 
On 7/31/07, Kurt Mackey  <ma...@mubble.net> <ku...@mubble.net> wrote:
      

Not only can you, but it's preferred.  The QueryParser really only
        

exists
    

to handle human input.  If you can do it programmatically, things are
much
easier.
 
-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com]
Sent: Tuesday, July 31, 2007 2:04 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test
 
Yeah. That's exactly what is happening.
 
Didn't realize I could use my own query without going through the
        

parser.
    

 
On 7/31/07, Kurt Mackey  <ma...@mubble.net> <ku...@mubble.net> wrote:
        

Are you using the query parser thing for that?  It will split on the
various special characters in a URL, and (by default) give you
something
like this for http://www.microsoft.com/windows:
 
field:(http OR www OR microsoft OR come OR windows)
 
For things like that, you'll need to build your own queries, not use
the
parser.
 
-Kurt
 
-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com]
Sent: Tuesday, July 31, 2007 1:45 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test
 
hmm.... this seems to have made it through. My previous posts kept
          

getting
        

bounced for being spam.
 
I had been trying to ask a question on searching for URLs. One of the
fields
in my index is called link. It holds nothing but URLs. There may be
more
than one link field per document.
 
When I search on the url, though (using field:fullurl syntax) it
returns
          

a
        

hit on every field in the database.
 
Is there special syntax for searching for a url?
 
 
 
 
On 7/31/07, Patrick Burrows  <ma...@gmail.com>
<pb...@gmail.com> wrote:
          

this is not spam, please stop bouncing it
 
--
-
P
            

 
 
 
--
-
P
 
          

 
 
--
-
P
 
        

 
 
--
-
P
 
      

 
 
    

 
 
  



__________ NOD32 2220 (20070426) Information __________

This message was checked by NOD32 antivirus system.
http://www.eset.com

Re: test

Posted by Patrick Burrows <pb...@gmail.com>.

Well, alright. Yet, I checked the index for URL and it wasn't in there.

This is why all books should be electronic. Any desktop search tool
(including lucene) would have found that easily.


On 7/31/07, Michael Mitiaguin <mi...@gmail.com> wrote:
>
> Actually there is.  Page 21 (  1.5.5 Field )
> Though book describes Lucene 1.4  indexing which is obsolete in 1.9
> and probably doesn't work in  2.0 , it should give a clear idea how to
> transform to newer methods.
>
> Regards
> Michael
>
> On 8/1/07, Patrick Burrows <pb...@gmail.com> wrote:
> > This part wasn't in that book. I looked.
> >
> > Well... the parts were there. But the arrangement is the art, isn't it?
> >
> >
> > On 7/31/07, Simone Busoli <si...@gmail.com> wrote:
> > >
> > > To everyone, please read *Lucene in Action*, it will teach you most of
> the
> > > things you need to know about Lucene.
> > >
> > > Simone
> > >
> > > Patrick Burrows wrote:
> > >
> > > Ah. that's got it. Thanks!
> > >
> > > On 7/31/07, feran <fe...@whereistand.com> <fe...@whereistand.com>
> wrote:
> > >
> > >
> > > Is the Field UN_TOKENIZED?
> > >
> > > If it's TOKENIZED you may not find it because the value will have been
> > > split
> > > into terms.
> > >
> > > ----- Original Message -----
> > > From: "Patrick Burrows" <pb...@gmail.com> <pb...@gmail.com>
> > > To: <lu...@incubator.apache.org> <
> lucene-net-user@incubator.apache.org>
> > > Sent: Tuesday, July 31, 2007 3:21 PM
> > > Subject: Re: test
> > >
> > >
> > >
> > >
> > > Hm...
> > >
> > > I do this:
> > >
> > > Query q = new TermQuery(new Term("link", args[1]));
> > > (from a command line test app I made) and it still does not find the
> url
> > > that Luke is showing me in the index. args[1] has the exact url copied
> > >
> > >
> > > out
> > >
> > >
> > > of Luke. I get 0 hits back -- which is probably better than getting
> > > *every*
> > > document back... but still not ideal.
> > >
> > >
> > > On 7/31/07, Kurt Mackey <ku...@mubble.net> <ku...@mubble.net> wrote:
> > >
> > >
> > > Not only can you, but it's preferred.  The QueryParser really only
> > >
> > >
> > > exists
> > >
> > >
> > >  to handle human input.  If you can do it programmatically, things are
> > > much
> > > easier.
> > >
> > > -----Original Message-----
> > > From: Patrick Burrows [mailto:pburrows@gmail.com <pb...@gmail.com>]
> > > Sent: Tuesday, July 31, 2007 2:04 PM
> > > To: lucene-net-user@incubator.apache.org
> > > Subject: Re: test
> > >
> > > Yeah. That's exactly what is happening.
> > >
> > > Didn't realize I could use my own query without going through the
> > >
> > >
> > > parser.
> > >
> > >
> > >  On 7/31/07, Kurt Mackey <ku...@mubble.net> <ku...@mubble.net> wrote:
> > >
> > >
> > > Are you using the query parser thing for that?  It will split on the
> > > various special characters in a URL, and (by default) give you
> > > something
> > > like this for http://www.microsoft.com/windows:
> > >
> > > field:(http OR www OR microsoft OR come OR windows)
> > >
> > > For things like that, you'll need to build your own queries, not use
> > > the
> > > parser.
> > >
> > > -Kurt
> > >
> > > -----Original Message-----
> > > From: Patrick Burrows [mailto:pburrows@gmail.com <pb...@gmail.com>]
> > > Sent: Tuesday, July 31, 2007 1:45 PM
> > > To: lucene-net-user@incubator.apache.org
> > > Subject: Re: test
> > >
> > > hmm.... this seems to have made it through. My previous posts kept
> > >
> > >
> > > getting
> > >
> > >
> > > bounced for being spam.
> > >
> > > I had been trying to ask a question on searching for URLs. One of the
> > > fields
> > > in my index is called link. It holds nothing but URLs. There may be
> > > more
> > > than one link field per document.
> > >
> > > When I search on the url, though (using field:fullurl syntax) it
> > > returns
> > >
> > >
> > > a
> > >
> > >
> > > hit on every field in the database.
> > >
> > > Is there special syntax for searching for a url?
> > >
> > >
> > >
> > >
> > > On 7/31/07, Patrick Burrows <pb...@gmail.com> <pb...@gmail.com>
> wrote:
> > >
> > >
> > > this is not spam, please stop bouncing it
> > >
> > > --
> > > -
> > > P
> > >
> > >
> > >
> > > --
> > > -
> > > P
> > >
> > >
> > >
> > > --
> > > -
> > > P
> > >
> > >
> > >
> > > --
> > > -
> > > P
> > >
> > >
> > >
> > >
> >
> >
> > --
> > -
> > P
> >
>



-- 
-
P

Re: test

Posted by Michael Mitiaguin <mi...@gmail.com>.

Actually there is.  Page 21 (  1.5.5 Field )
Though book describes Lucene 1.4  indexing which is obsolete in 1.9
and probably doesn't work in  2.0 , it should give a clear idea how to
transform to newer methods.

Regards
Michael

On 8/1/07, Patrick Burrows <pb...@gmail.com> wrote:
> This part wasn't in that book. I looked.
>
> Well... the parts were there. But the arrangement is the art, isn't it?
>
>
> On 7/31/07, Simone Busoli <si...@gmail.com> wrote:
> >
> > To everyone, please read *Lucene in Action*, it will teach you most of the
> > things you need to know about Lucene.
> >
> > Simone
> >
> > Patrick Burrows wrote:
> >
> > Ah. that's got it. Thanks!
> >
> > On 7/31/07, feran <fe...@whereistand.com> <fe...@whereistand.com> wrote:
> >
> >
> > Is the Field UN_TOKENIZED?
> >
> > If it's TOKENIZED you may not find it because the value will have been
> > split
> > into terms.
> >
> > ----- Original Message -----
> > From: "Patrick Burrows" <pb...@gmail.com> <pb...@gmail.com>
> > To: <lu...@incubator.apache.org> <lu...@incubator.apache.org>
> > Sent: Tuesday, July 31, 2007 3:21 PM
> > Subject: Re: test
> >
> >
> >
> >
> > Hm...
> >
> > I do this:
> >
> > Query q = new TermQuery(new Term("link", args[1]));
> > (from a command line test app I made) and it still does not find the url
> > that Luke is showing me in the index. args[1] has the exact url copied
> >
> >
> > out
> >
> >
> > of Luke. I get 0 hits back -- which is probably better than getting
> > *every*
> > document back... but still not ideal.
> >
> >
> > On 7/31/07, Kurt Mackey <ku...@mubble.net> <ku...@mubble.net> wrote:
> >
> >
> > Not only can you, but it's preferred.  The QueryParser really only
> >
> >
> > exists
> >
> >
> >  to handle human input.  If you can do it programmatically, things are
> > much
> > easier.
> >
> > -----Original Message-----
> > From: Patrick Burrows [mailto:pburrows@gmail.com <pb...@gmail.com>]
> > Sent: Tuesday, July 31, 2007 2:04 PM
> > To: lucene-net-user@incubator.apache.org
> > Subject: Re: test
> >
> > Yeah. That's exactly what is happening.
> >
> > Didn't realize I could use my own query without going through the
> >
> >
> > parser.
> >
> >
> >  On 7/31/07, Kurt Mackey <ku...@mubble.net> <ku...@mubble.net> wrote:
> >
> >
> > Are you using the query parser thing for that?  It will split on the
> > various special characters in a URL, and (by default) give you
> > something
> > like this for http://www.microsoft.com/windows:
> >
> > field:(http OR www OR microsoft OR come OR windows)
> >
> > For things like that, you'll need to build your own queries, not use
> > the
> > parser.
> >
> > -Kurt
> >
> > -----Original Message-----
> > From: Patrick Burrows [mailto:pburrows@gmail.com <pb...@gmail.com>]
> > Sent: Tuesday, July 31, 2007 1:45 PM
> > To: lucene-net-user@incubator.apache.org
> > Subject: Re: test
> >
> > hmm.... this seems to have made it through. My previous posts kept
> >
> >
> > getting
> >
> >
> > bounced for being spam.
> >
> > I had been trying to ask a question on searching for URLs. One of the
> > fields
> > in my index is called link. It holds nothing but URLs. There may be
> > more
> > than one link field per document.
> >
> > When I search on the url, though (using field:fullurl syntax) it
> > returns
> >
> >
> > a
> >
> >
> > hit on every field in the database.
> >
> > Is there special syntax for searching for a url?
> >
> >
> >
> >
> > On 7/31/07, Patrick Burrows <pb...@gmail.com> <pb...@gmail.com> wrote:
> >
> >
> > this is not spam, please stop bouncing it
> >
> > --
> > -
> > P
> >
> >
> >
> > --
> > -
> > P
> >
> >
> >
> > --
> > -
> > P
> >
> >
> >
> > --
> > -
> > P
> >
> >
> >
> >
>
>
> --
> -
> P
>

Re: test

Posted by Patrick Burrows <pb...@gmail.com>.

This part wasn't in that book. I looked.

Well... the parts were there. But the arrangement is the art, isn't it?


On 7/31/07, Simone Busoli <si...@gmail.com> wrote:
>
> To everyone, please read *Lucene in Action*, it will teach you most of the
> things you need to know about Lucene.
>
> Simone
>
> Patrick Burrows wrote:
>
> Ah. that's got it. Thanks!
>
> On 7/31/07, feran <fe...@whereistand.com> <fe...@whereistand.com> wrote:
>
>
> Is the Field UN_TOKENIZED?
>
> If it's TOKENIZED you may not find it because the value will have been
> split
> into terms.
>
> ----- Original Message -----
> From: "Patrick Burrows" <pb...@gmail.com> <pb...@gmail.com>
> To: <lu...@incubator.apache.org> <lu...@incubator.apache.org>
> Sent: Tuesday, July 31, 2007 3:21 PM
> Subject: Re: test
>
>
>
>
> Hm...
>
> I do this:
>
> Query q = new TermQuery(new Term("link", args[1]));
> (from a command line test app I made) and it still does not find the url
> that Luke is showing me in the index. args[1] has the exact url copied
>
>
> out
>
>
> of Luke. I get 0 hits back -- which is probably better than getting
> *every*
> document back... but still not ideal.
>
>
> On 7/31/07, Kurt Mackey <ku...@mubble.net> <ku...@mubble.net> wrote:
>
>
> Not only can you, but it's preferred.  The QueryParser really only
>
>
> exists
>
>
>  to handle human input.  If you can do it programmatically, things are
> much
> easier.
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com <pb...@gmail.com>]
> Sent: Tuesday, July 31, 2007 2:04 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> Yeah. That's exactly what is happening.
>
> Didn't realize I could use my own query without going through the
>
>
> parser.
>
>
>  On 7/31/07, Kurt Mackey <ku...@mubble.net> <ku...@mubble.net> wrote:
>
>
> Are you using the query parser thing for that?  It will split on the
> various special characters in a URL, and (by default) give you
> something
> like this for http://www.microsoft.com/windows:
>
> field:(http OR www OR microsoft OR come OR windows)
>
> For things like that, you'll need to build your own queries, not use
> the
> parser.
>
> -Kurt
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com <pb...@gmail.com>]
> Sent: Tuesday, July 31, 2007 1:45 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> hmm.... this seems to have made it through. My previous posts kept
>
>
> getting
>
>
> bounced for being spam.
>
> I had been trying to ask a question on searching for URLs. One of the
> fields
> in my index is called link. It holds nothing but URLs. There may be
> more
> than one link field per document.
>
> When I search on the url, though (using field:fullurl syntax) it
> returns
>
>
> a
>
>
> hit on every field in the database.
>
> Is there special syntax for searching for a url?
>
>
>
>
> On 7/31/07, Patrick Burrows <pb...@gmail.com> <pb...@gmail.com> wrote:
>
>
> this is not spam, please stop bouncing it
>
> --
> -
> P
>
>
>
> --
> -
> P
>
>
>
> --
> -
> P
>
>
>
> --
> -
> P
>
>
>
>


-- 
-
P

Re: test

Posted by Patrick Burrows <pb...@gmail.com>.

Ah. that's got it. Thanks!

On 7/31/07, feran <fe...@whereistand.com> wrote:
>
> Is the Field UN_TOKENIZED?
>
> If it's TOKENIZED you may not find it because the value will have been
> split
> into terms.
>
> ----- Original Message -----
> From: "Patrick Burrows" <pb...@gmail.com>
> To: <lu...@incubator.apache.org>
> Sent: Tuesday, July 31, 2007 3:21 PM
> Subject: Re: test
>
>
> > Hm...
> >
> > I do this:
> >
> > Query q = new TermQuery(new Term("link", args[1]));
> > (from a command line test app I made) and it still does not find the url
> > that Luke is showing me in the index. args[1] has the exact url copied
> out
> > of Luke. I get 0 hits back -- which is probably better than getting
> > *every*
> > document back... but still not ideal.
> >
> >
> > On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
> >>
> >> Not only can you, but it's preferred.  The QueryParser really only
> exists
> >> to handle human input.  If you can do it programmatically, things are
> >> much
> >> easier.
> >>
> >> -----Original Message-----
> >> From: Patrick Burrows [mailto:pburrows@gmail.com]
> >> Sent: Tuesday, July 31, 2007 2:04 PM
> >> To: lucene-net-user@incubator.apache.org
> >> Subject: Re: test
> >>
> >> Yeah. That's exactly what is happening.
> >>
> >> Didn't realize I could use my own query without going through the
> parser.
> >>
> >>
> >> On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
> >> >
> >> > Are you using the query parser thing for that?  It will split on the
> >> > various special characters in a URL, and (by default) give you
> >> > something
> >> > like this for http://www.microsoft.com/windows:
> >> >
> >> > field:(http OR www OR microsoft OR come OR windows)
> >> >
> >> > For things like that, you'll need to build your own queries, not use
> >> > the
> >> > parser.
> >> >
> >> > -Kurt
> >> >
> >> > -----Original Message-----
> >> > From: Patrick Burrows [mailto:pburrows@gmail.com]
> >> > Sent: Tuesday, July 31, 2007 1:45 PM
> >> > To: lucene-net-user@incubator.apache.org
> >> > Subject: Re: test
> >> >
> >> > hmm.... this seems to have made it through. My previous posts kept
> >> getting
> >> > bounced for being spam.
> >> >
> >> > I had been trying to ask a question on searching for URLs. One of the
> >> > fields
> >> > in my index is called link. It holds nothing but URLs. There may be
> >> > more
> >> > than one link field per document.
> >> >
> >> > When I search on the url, though (using field:fullurl syntax) it
> >> > returns
> >> a
> >> > hit on every field in the database.
> >> >
> >> > Is there special syntax for searching for a url?
> >> >
> >> >
> >> >
> >> >
> >> > On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> >> > >
> >> > > this is not spam, please stop bouncing it
> >> > >
> >> > > --
> >> > > -
> >> > > P
> >> >
> >> >
> >> >
> >> >
> >> > --
> >> > -
> >> > P
> >> >
> >>
> >>
> >>
> >> --
> >> -
> >> P
> >>
> >
> >
> >
> > --
> > -
> > P
> >
>
>
>


-- 
-
P

Re: test

Posted by feran <fe...@whereistand.com>.

Is the Field UN_TOKENIZED?

If it's TOKENIZED you may not find it because the value will have been split 
into terms.

----- Original Message ----- 
From: "Patrick Burrows" <pb...@gmail.com>
To: <lu...@incubator.apache.org>
Sent: Tuesday, July 31, 2007 3:21 PM
Subject: Re: test


> Hm...
>
> I do this:
>
> Query q = new TermQuery(new Term("link", args[1]));
> (from a command line test app I made) and it still does not find the url
> that Luke is showing me in the index. args[1] has the exact url copied out
> of Luke. I get 0 hits back -- which is probably better than getting 
> *every*
> document back... but still not ideal.
>
>
> On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>>
>> Not only can you, but it's preferred.  The QueryParser really only exists
>> to handle human input.  If you can do it programmatically, things are 
>> much
>> easier.
>>
>> -----Original Message-----
>> From: Patrick Burrows [mailto:pburrows@gmail.com]
>> Sent: Tuesday, July 31, 2007 2:04 PM
>> To: lucene-net-user@incubator.apache.org
>> Subject: Re: test
>>
>> Yeah. That's exactly what is happening.
>>
>> Didn't realize I could use my own query without going through the parser.
>>
>>
>> On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>> >
>> > Are you using the query parser thing for that?  It will split on the
>> > various special characters in a URL, and (by default) give you 
>> > something
>> > like this for http://www.microsoft.com/windows:
>> >
>> > field:(http OR www OR microsoft OR come OR windows)
>> >
>> > For things like that, you'll need to build your own queries, not use 
>> > the
>> > parser.
>> >
>> > -Kurt
>> >
>> > -----Original Message-----
>> > From: Patrick Burrows [mailto:pburrows@gmail.com]
>> > Sent: Tuesday, July 31, 2007 1:45 PM
>> > To: lucene-net-user@incubator.apache.org
>> > Subject: Re: test
>> >
>> > hmm.... this seems to have made it through. My previous posts kept
>> getting
>> > bounced for being spam.
>> >
>> > I had been trying to ask a question on searching for URLs. One of the
>> > fields
>> > in my index is called link. It holds nothing but URLs. There may be 
>> > more
>> > than one link field per document.
>> >
>> > When I search on the url, though (using field:fullurl syntax) it 
>> > returns
>> a
>> > hit on every field in the database.
>> >
>> > Is there special syntax for searching for a url?
>> >
>> >
>> >
>> >
>> > On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
>> > >
>> > > this is not spam, please stop bouncing it
>> > >
>> > > --
>> > > -
>> > > P
>> >
>> >
>> >
>> >
>> > --
>> > -
>> > P
>> >
>>
>>
>>
>> --
>> -
>> P
>>
>
>
>
> -- 
> -
> P
>

Re: test

Posted by Patrick Burrows <pb...@gmail.com>.

Hm...

I do this:

Query q = new TermQuery(new Term("link", args[1]));
(from a command line test app I made) and it still does not find the url
that Luke is showing me in the index. args[1] has the exact url copied out
of Luke. I get 0 hits back -- which is probably better than getting *every*
document back... but still not ideal.


On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>
> Not only can you, but it's preferred.  The QueryParser really only exists
> to handle human input.  If you can do it programmatically, things are much
> easier.
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com]
> Sent: Tuesday, July 31, 2007 2:04 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> Yeah. That's exactly what is happening.
>
> Didn't realize I could use my own query without going through the parser.
>
>
> On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
> >
> > Are you using the query parser thing for that?  It will split on the
> > various special characters in a URL, and (by default) give you something
> > like this for http://www.microsoft.com/windows:
> >
> > field:(http OR www OR microsoft OR come OR windows)
> >
> > For things like that, you'll need to build your own queries, not use the
> > parser.
> >
> > -Kurt
> >
> > -----Original Message-----
> > From: Patrick Burrows [mailto:pburrows@gmail.com]
> > Sent: Tuesday, July 31, 2007 1:45 PM
> > To: lucene-net-user@incubator.apache.org
> > Subject: Re: test
> >
> > hmm.... this seems to have made it through. My previous posts kept
> getting
> > bounced for being spam.
> >
> > I had been trying to ask a question on searching for URLs. One of the
> > fields
> > in my index is called link. It holds nothing but URLs. There may be more
> > than one link field per document.
> >
> > When I search on the url, though (using field:fullurl syntax) it returns
> a
> > hit on every field in the database.
> >
> > Is there special syntax for searching for a url?
> >
> >
> >
> >
> > On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> > >
> > > this is not spam, please stop bouncing it
> > >
> > > --
> > > -
> > > P
> >
> >
> >
> >
> > --
> > -
> > P
> >
>
>
>
> --
> -
> P
>



-- 
-
P

RE: test

Posted by Kurt Mackey <ku...@mubble.net>.

Not only can you, but it's preferred.  The QueryParser really only exists to handle human input.  If you can do it programmatically, things are much easier.

-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com]
Sent: Tuesday, July 31, 2007 2:04 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

Yeah. That's exactly what is happening.

Didn't realize I could use my own query without going through the parser.


On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>
> Are you using the query parser thing for that?  It will split on the
> various special characters in a URL, and (by default) give you something
> like this for http://www.microsoft.com/windows:
>
> field:(http OR www OR microsoft OR come OR windows)
>
> For things like that, you'll need to build your own queries, not use the
> parser.
>
> -Kurt
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com]
> Sent: Tuesday, July 31, 2007 1:45 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> hmm.... this seems to have made it through. My previous posts kept getting
> bounced for being spam.
>
> I had been trying to ask a question on searching for URLs. One of the
> fields
> in my index is called link. It holds nothing but URLs. There may be more
> than one link field per document.
>
> When I search on the url, though (using field:fullurl syntax) it returns a
> hit on every field in the database.
>
> Is there special syntax for searching for a url?
>
>
>
>
> On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> >
> > this is not spam, please stop bouncing it
> >
> > --
> > -
> > P
>
>
>
>
> --
> -
> P
>



--
-
P

Re: test

Posted by Patrick Burrows <pb...@gmail.com>.

Yeah. That's exactly what is happening.

Didn't realize I could use my own query without going through the parser.


On 7/31/07, Kurt Mackey <ku...@mubble.net> wrote:
>
> Are you using the query parser thing for that?  It will split on the
> various special characters in a URL, and (by default) give you something
> like this for http://www.microsoft.com/windows:
>
> field:(http OR www OR microsoft OR come OR windows)
>
> For things like that, you'll need to build your own queries, not use the
> parser.
>
> -Kurt
>
> -----Original Message-----
> From: Patrick Burrows [mailto:pburrows@gmail.com]
> Sent: Tuesday, July 31, 2007 1:45 PM
> To: lucene-net-user@incubator.apache.org
> Subject: Re: test
>
> hmm.... this seems to have made it through. My previous posts kept getting
> bounced for being spam.
>
> I had been trying to ask a question on searching for URLs. One of the
> fields
> in my index is called link. It holds nothing but URLs. There may be more
> than one link field per document.
>
> When I search on the url, though (using field:fullurl syntax) it returns a
> hit on every field in the database.
>
> Is there special syntax for searching for a url?
>
>
>
>
> On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
> >
> > this is not spam, please stop bouncing it
> >
> > --
> > -
> > P
>
>
>
>
> --
> -
> P
>



-- 
-
P

RE: test

Posted by Kurt Mackey <ku...@mubble.net>.

Are you using the query parser thing for that?  It will split on the various special characters in a URL, and (by default) give you something like this for http://www.microsoft.com/windows:

field:(http OR www OR microsoft OR come OR windows)

For things like that, you'll need to build your own queries, not use the parser.

-Kurt

-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com]
Sent: Tuesday, July 31, 2007 1:45 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

hmm.... this seems to have made it through. My previous posts kept getting
bounced for being spam.

I had been trying to ask a question on searching for URLs. One of the fields
in my index is called link. It holds nothing but URLs. There may be more
than one link field per document.

When I search on the url, though (using field:fullurl syntax) it returns a
hit on every field in the database.

Is there special syntax for searching for a url?

On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
>
> this is not spam, please stop bouncing it
>
> --
> -
> P

--
-
P

RE: test

Posted by DIGY <di...@gmail.com>.

Your question is not clear for me. Can you send a simple sample code for
that?

DIGY

-----Original Message-----
From: Patrick Burrows [mailto:pburrows@gmail.com] 
Sent: Tuesday, July 31, 2007 9:45 PM
To: lucene-net-user@incubator.apache.org
Subject: Re: test

hmm.... this seems to have made it through. My previous posts kept getting
bounced for being spam.

I had been trying to ask a question on searching for URLs. One of the fields
in my index is called link. It holds nothing but URLs. There may be more
than one link field per document.

When I search on the url, though (using field:fullurl syntax) it returns a
hit on every field in the database.

Is there special syntax for searching for a url?

On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
>
> this is not spam, please stop bouncing it
>
> --
> -
> P

-- 
-
P

Re: test

Posted by Patrick Burrows <pb...@gmail.com>.

hmm.... this seems to have made it through. My previous posts kept getting
bounced for being spam.

I had been trying to ask a question on searching for URLs. One of the fields
in my index is called link. It holds nothing but URLs. There may be more
than one link field per document.

When I search on the url, though (using field:fullurl syntax) it returns a
hit on every field in the database.

Is there special syntax for searching for a url?

On 7/31/07, Patrick Burrows <pb...@gmail.com> wrote:
>
> this is not spam, please stop bouncing it
>
> --
> -
> P

-- 
-
P