You are viewing a plain text version of this content. The canonical link for it is here.

Posted to java-user@lucene.apache.org by Clemens Marschner <cm...@lanlab.de> on 2003/06/30 12:46:01 UTC

Re: Lucene crawler plan

There's an experimental webcrawler in the lucene-sandbox area called
larm-webcrawler (see
http://jakarta.apache.org/lucene/docs/lucene-sandbox/larm/overview.html),

and a project on Sourceforge (http://larm.sf.net) that tries to leverage
this on a higher level. I want to encourage you to go on that side and read
through the specs in sourceforge's CVS.

It concludes pretty much everything that Andy wrote in his proposal, and
more. The project only contains conceptual documents at this time, but if
you're willing to contribute actively, that's very appreciated.

Unfortunately I have to stop my efforts regarding LARM. Long story short: My
future employer says it's too close to their business. But in contrast to
other open source projects, there's already lots of ideas in that document
and lots of code in the old crawler. If you wish to contribute, it's now up
to you.

Clemens



----- Original Message ----- 
From: "Andrew C. Oliver" <ac...@apache.org>
To: "Peter Becker" <pb...@dstc.edu.au>
Cc: "Lucene Developers List" <lu...@jakarta.apache.org>
Sent: Friday, June 27, 2003 2:53 AM
Subject: Re: Lucene crawler plan


> On 6/26/03 8:33 PM, "Peter Becker" <pb...@dstc.edu.au> wrote:
>
> > Hi Andrew,
> >
> > are you the Andy signing this:
> > http://jakarta.apache.org/lucene/docs/luceneplan.html? If no -- do you
> > know who wrote the page and could you forward this email? Thanks. BTW:
> > your website link on http://jakarta.apache.org/lucene/docs/whoweare.html
> > is dead.
> >
>
> Yes I wrote it.
>
> >
> > The question is: is there some code already? If yes: can we get it? Can
> > we join the effort? If no: what are things we should consider doing to
> > increase our chances that you guys accept our code in the end? We are
> > not really interested in maintaining the crawler bits and pieces, our
> > interest is in the visualization. We are happy to get something going as
> > part of our little demonstrator, but then we'd give it to you and hope
> > someone picks up maintenance.
> >
>
> I never wrote any code, but there is code in lucene-contrib which realized
> most of what is in this document.  I was going to write code, but someone
> beat me to the punch and I was like "wow I have things I can do that
others
> won't do for me" and moved on :-)
>
> I'm cc'ing lucene developers list.  You'll find plenty of folks interested
> in working with you on this.
>
> -Andy
> > Is this all an option anyway? It is ok to say no ;-)
> >
> > Regards,
> >  Peter
> >
>
> -- 
> Andrew C. Oliver
> http://www.superlinksoftware.com/poi.jsp
> Custom enhancements and Commercial Implementation for Jakarta POI
>
> http://jakarta.apache.org/poi
> For Java and Excel, Got POI?
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
> For additional commands, e-mail: lucene-dev-help@jakarta.apache.org
>


---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-user-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-user-help@jakarta.apache.org