You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Paul Hill <pa...@metajure.com> on 2012/06/09 01:24:07 UTC

RE: CodeMaps updates for Lucene

As text retrieval geeks, we hate manual tagging :-) We want you to analyze the content (might I suggest using Lucene and Mahout) and categorize it for us. :-)
But jokes aside, a major category of tags would be "(text) analysis" or "tokenization" or "term processing for indexing" -- all that stuff related to Analysis and all the language specific stuff that is needed before something is put in the index.

And then there are all the classes that represent the information from the index, they are used in indexing (putting in) and searching (taking out), but they really are index related, not specifically indexing or searching.  Oh the problems coming up with a good set of tags...

Note, I added a handful of tags anyway.

-Paul

> -----Original Message-----
> From: Seth Rosen [mailto:seth@architexa.com]
> We were wondering if you guys could do us a quick favor. Would you take 5-minutes, go to the site, and
> tag code as being performance-sensitive, query-related, indexing, needs-refactoring, or something similar.
> We believe that with people tagging the Lucene codebase - it will become an awesome community
> resource.

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: CodeMaps updates for Lucene

Posted by Abhishek Rakshit <ab...@architexa.com>.
So I just looked at the tags added by Paul and quickly tagged a few other
related items. You can see them here:
http://www.codemaps.org/s/Lucene/t/query-related

Do you think this might be helpful to people trying to learn more about
queries?

(It seems that the site has a bug. I tagged some of the existing content on
the site, but it is right now saying that the content was created today)

-Abhishek

Disclaimer: I have been working on CodeMaps for a bit - hoping that it will
be helpful on my code.


On Fri, Jun 8, 2012 at 7:24 PM, Paul Hill <pa...@metajure.com> wrote:

> As text retrieval geeks, we hate manual tagging :-) We want you to analyze
> the content (might I suggest using Lucene and Mahout) and categorize it for
> us. :-)
> But jokes aside, a major category of tags would be "(text) analysis" or
> "tokenization" or "term processing for indexing" -- all that stuff related
> to Analysis and all the language specific stuff that is needed before
> something is put in the index.
>
> And then there are all the classes that represent the information from the
> index, they are used in indexing (putting in) and searching (taking out),
> but they really are index related, not specifically indexing or searching.
>  Oh the problems coming up with a good set of tags...
>
> Note, I added a handful of tags anyway.
>
> -Paul
>
> > -----Original Message-----
> > From: Seth Rosen [mailto:seth@architexa.com]
> > We were wondering if you guys could do us a quick favor. Would you take
> 5-minutes, go to the site, and
> > tag code as being performance-sensitive, query-related, indexing,
> needs-refactoring, or something similar.
> > We believe that with people tagging the Lucene codebase - it will become
> an awesome community
> > resource.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-user-help@lucene.apache.org
>
>