You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by lucenenew <mi...@yahoo.com> on 2009/11/09 16:02:27 UTC

Lucene - Text Classification.

i want to classify sentences stored as strings to a bunch of keywords related
to a certain category.

so i will have 10 strings which will be a sentence long. and i will want to
compare each string to a set of 30 keywords stored somewhere, and then
compare with another set of 30 keywords, so on.

i want to rank each string based on the number of times it matches a set of
keywords. so basically i want to categorize each sentence.

is this possible with lucene, or would any other approach be more efficient.

will this process take long? in terms of speed of program.

and what tools would i need?

any help would be great.

thanks.
-- 
View this message in context: http://old.nabble.com/Lucene---Text-Classification.-tp26267794p26267794.html
Sent from the Lucene - Java Developer mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org


Re: Lucene - Text Classification.

Posted by Erick Erickson <er...@gmail.com>.
Please re-post this question on the lucene user's list, this list is
intended for development discussions....

Best
Erick

On Mon, Nov 9, 2009 at 10:02 AM, lucenenew <mi...@yahoo.com> wrote:

>
> i want to classify sentences stored as strings to a bunch of keywords
> related
> to a certain category.
>
> so i will have 10 strings which will be a sentence long. and i will want to
> compare each string to a set of 30 keywords stored somewhere, and then
> compare with another set of 30 keywords, so on.
>
> i want to rank each string based on the number of times it matches a set of
> keywords. so basically i want to categorize each sentence.
>
> is this possible with lucene, or would any other approach be more
> efficient.
>
> will this process take long? in terms of speed of program.
>
> and what tools would i need?
>
> any help would be great.
>
> thanks.
> --
> View this message in context:
> http://old.nabble.com/Lucene---Text-Classification.-tp26267794p26267794.html
> Sent from the Lucene - Java Developer mailing list archive at Nabble.com.
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
> For additional commands, e-mail: java-dev-help@lucene.apache.org
>
>