You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "Naess, Ronny" <Ro...@avinor.no> on 2007/05/23 20:27:30 UTC

Filtering hits

 
Is it possible to filter out url: hits?

With this I mean the following.

Query: "sometext url:myurl" (quotes is not part of search only to show
the query)
This query gives me hits where "sometext" is found for given url. The
problem with this is that myurl is also displayed as a hit. So if I have
only one page with "sometext" but total 40 with myurl I will recive 40
hits, but I want only the one with sometext. Said in another way. If i
do the query "url:myurl" I do not want any hits at all.

Is it possible make it behave like I want?

I guess that 'url' is not only indexed but also stored in the document
and that might be the case why url strings is returned as hits?

Regards,
Ronny