You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@mahout.apache.org by Marko Ciric <ci...@gmail.com> on 2011/07/21 14:49:31 UTC

Evaluating boolean preference data sets

Hi guys,

I wonder if Mahout should have a "precision and recall" evaluator that
calculates the relevant items data set without looking to the relevance
threshold. This would be suitable for data sets with boolean preference
nature. In addition, the relevant items can be removed from the training
data set by random (removing first couple of preferred items every time
wouldn't be a great idea).

On the other hand, having relevance threshold
with RecommenderIRStatsEvaluator set to 1.0 removes exactly "at" number of
items. As the recommender returns that number of items, the precision and
recall would have the same value. Is this Ok or is it a bug, given that
  precision = intersection / num_recommended_items (where
num_recommended_items is almost always "at")
  recall = intersection / num_relevant_items (also "at" as the previously
mentioned why relevanceThreshold is 1.0)?


--
Marko Ćirić
ciric.marko@gmail.com

Re: Evaluating boolean preference data sets

Posted by Marko Ciric <ci...@gmail.com>.

Also the evaluation could be done per user, and thus manually running
multiple times per each user. Or simple defining a matrix with relevant
items per each user..
On Jul 21, 2011 4:18 PM, "Marko Ciric" <ci...@gmail.com> wrote:
> Yes, there should exist an evaluation that allows you to pass which items
> are relevant. On the other hand, generally speaking, I am also trying to
> evaluate with having relevant items all chosen randomly. Maybe both
> implementations should exist.
>
> On 21 July 2011 15:59, Sean Owen <sr...@gmail.com> wrote:
>
>> You mean, have the user specify all items that are considered relevant?
yes
>> that could be useful. Do you have a patch in mind?
>>
>> Your analysis is correct, and I would not call it a bug. It's a symptom
of
>> how little information the evaluation has to work with here without
>> ratings.
>> It has to pick random items as "relevant", for starters. It's another
>> reason
>> your idea is good, to let the user specify those relevant items.
>>
>> On Thu, Jul 21, 2011 at 1:49 PM, Marko Ciric <ci...@gmail.com>
>> wrote:
>>
>> > Hi guys,
>> >
>> > I wonder if Mahout should have a "precision and recall" evaluator that
>> > calculates the relevant items data set without looking to the relevance
>> > threshold. This would be suitable for data sets with boolean preference
>> > nature. In addition, the relevant items can be removed from the
training
>> > data set by random (removing first couple of preferred items every time
>> > wouldn't be a great idea).
>> >
>> > On the other hand, having relevance threshold
>> > with RecommenderIRStatsEvaluator set to 1.0 removes exactly "at" number
>> of
>> > items. As the recommender returns that number of items, the precision
and
>> > recall would have the same value. Is this Ok or is it a bug, given that
>> > precision = intersection / num_recommended_items (where
>> > num_recommended_items is almost always "at")
>> > recall = intersection / num_relevant_items (also "at" as the previously
>> > mentioned why relevanceThreshold is 1.0)?
>> >
>> >
>> > --
>> > Marko Ćirić
>> > ciric.marko@gmail.com
>> >
>>
>
>
>
> --
> --
> Marko Ćirić
> ciric.marko@gmail.com

Re: Evaluating boolean preference data sets

Posted by Marko Ciric <ci...@gmail.com>.

Yes, there should exist an evaluation that allows you to pass which items
are relevant. On the other hand, generally speaking, I am also trying to
evaluate with having relevant items all chosen randomly. Maybe both
implementations should exist.

On 21 July 2011 15:59, Sean Owen <sr...@gmail.com> wrote:

> You mean, have the user specify all items that are considered relevant? yes
> that could be useful. Do you have a patch in mind?
>
> Your analysis is correct, and I would not call it a bug. It's a symptom of
> how little information the evaluation has to work with here without
> ratings.
> It has to pick random items as "relevant", for starters. It's another
> reason
> your idea is good, to let the user specify those relevant items.
>
> On Thu, Jul 21, 2011 at 1:49 PM, Marko Ciric <ci...@gmail.com>
> wrote:
>
> > Hi guys,
> >
> > I wonder if Mahout should have a "precision and recall" evaluator that
> > calculates the relevant items data set without looking to the relevance
> > threshold. This would be suitable for data sets with boolean preference
> > nature. In addition, the relevant items can be removed from the training
> > data set by random (removing first couple of preferred items every time
> > wouldn't be a great idea).
> >
> > On the other hand, having relevance threshold
> > with RecommenderIRStatsEvaluator set to 1.0 removes exactly "at" number
> of
> > items. As the recommender returns that number of items, the precision and
> > recall would have the same value. Is this Ok or is it a bug, given that
> >  precision = intersection / num_recommended_items (where
> > num_recommended_items is almost always "at")
> >  recall = intersection / num_relevant_items (also "at" as the previously
> > mentioned why relevanceThreshold is 1.0)?
> >
> >
> > --
> > Marko Ćirić
> > ciric.marko@gmail.com
> >
>



-- 
--
Marko Ćirić
ciric.marko@gmail.com

Re: Evaluating boolean preference data sets

Posted by Sean Owen <sr...@gmail.com>.

You mean, have the user specify all items that are considered relevant? yes
that could be useful. Do you have a patch in mind?

Your analysis is correct, and I would not call it a bug. It's a symptom of
how little information the evaluation has to work with here without ratings.
It has to pick random items as "relevant", for starters. It's another reason
your idea is good, to let the user specify those relevant items.

On Thu, Jul 21, 2011 at 1:49 PM, Marko Ciric <ci...@gmail.com> wrote:

> Hi guys,
>
> I wonder if Mahout should have a "precision and recall" evaluator that
> calculates the relevant items data set without looking to the relevance
> threshold. This would be suitable for data sets with boolean preference
> nature. In addition, the relevant items can be removed from the training
> data set by random (removing first couple of preferred items every time
> wouldn't be a great idea).
>
> On the other hand, having relevance threshold
> with RecommenderIRStatsEvaluator set to 1.0 removes exactly "at" number of
> items. As the recommender returns that number of items, the precision and
> recall would have the same value. Is this Ok or is it a bug, given that
>  precision = intersection / num_recommended_items (where
> num_recommended_items is almost always "at")
>  recall = intersection / num_relevant_items (also "at" as the previously
> mentioned why relevanceThreshold is 1.0)?
>
>
> --
> Marko Ćirić
> ciric.marko@gmail.com
>