You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by gnixinfosoft <ma...@gmail.com> on 2009/05/27 11:37:45 UTC

Apache Lucene Crawler search

How to implement crawler search in Apache Lucene,
>
> I am currently using FAST search engine in my project, which uses crawler
> facility
>
> How to implement this using Apache Lucene, I read somewhere that there is
> no
> direct functionality to this in Apache Lucene, but we can implement it
> using
> some tweaks
>
> Please help!!!!!!!!!! 
-- 
View this message in context: http://www.nabble.com/Apache-Lucene-Crawler-search-tp23739174p23739174.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Apache Lucene Crawler search

Posted by Mark Miller <ma...@gmail.com>.
Lucene is more like a search utility library than a full blown Search 
Engine like FAST. The Lucene sub project, Solr is more comparable to 
FAST, but Solr does not have a built in crawler available either (though 
its easy enough to do basic crawls).

There are many open source crawlers you could use to pass documents to 
Lucene though.

There is droids as Michael mentions below, and that goes well with Tika 
http://lucene.apache.org/tika/.

There is also Heritrix http://crawler.archive.org/, the Nutch crawler 
http://lucene.apache.org/nutch/,
and aperture http://aperture.sourceforge.net/.

There are also commercial options of course.

- Mark

Michael McCandless wrote:
> Have a look at Apache droids?
>
>     http://incubator.apache.org/droids/
>
> Mike
>
> On Wed, May 27, 2009 at 5:37 AM, gnixinfosoft <ma...@gmail.com> wrote:
>   
>> How to implement crawler search in Apache Lucene,
>>     
>>> I am currently using FAST search engine in my project, which uses crawler
>>> facility
>>>
>>> How to implement this using Apache Lucene, I read somewhere that there is
>>> no
>>> direct functionality to this in Apache Lucene, but we can implement it
>>> using
>>> some tweaks
>>>
>>> Please help!!!!!!!!!!
>>>       
>> --
>> View this message in context: http://www.nabble.com/Apache-Lucene-Crawler-search-tp23739174p23739174.html
>> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>> For additional commands, e-mail: java-user-help@lucene.apache.org
>>
>>
>>     
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>   


-- 
- Mark

http://www.lucidimagination.com




---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Apache Lucene Crawler search

Posted by Michael McCandless <lu...@mikemccandless.com>.
Have a look at Apache droids?

    http://incubator.apache.org/droids/

Mike

On Wed, May 27, 2009 at 5:37 AM, gnixinfosoft <ma...@gmail.com> wrote:
>
> How to implement crawler search in Apache Lucene,
>>
>> I am currently using FAST search engine in my project, which uses crawler
>> facility
>>
>> How to implement this using Apache Lucene, I read somewhere that there is
>> no
>> direct functionality to this in Apache Lucene, but we can implement it
>> using
>> some tweaks
>>
>> Please help!!!!!!!!!!
> --
> View this message in context: http://www.nabble.com/Apache-Lucene-Crawler-search-tp23739174p23739174.html
> Sent from the Lucene - Java Users mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org