You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by Byung-Gon Chun <bg...@gmail.com> on 2010/04/14 19:50:13 UTC
workload with highly variable execution time
Hi,
I'm interested in looking at workload with highly variable mapper/reducer
execution time.
Which mahout workload do you recommend to run? I see clustering,
classification, recommendation, etc. Also, could you recommend what dataset
to use?
Thanks.
-Brian
Re: workload with highly variable execution time
Posted by Ted Dunning <te...@gmail.com>.
These are all very predictable, I think, since they are the sum of a lot of
small operations that don't vary much.
On Wed, Apr 14, 2010 at 2:16 PM, Byung-Gon Chun <bg...@gmail.com> wrote:
> > This may be too dumb but if your goal is to run a mapper/reducer that
> > takes a long time sometimes, just have then run a mapper that sleeps a
> > long time some times? a random number of milliseconds with wide range?
> >
> >
> Yes. But I'd like to find realistic workload. How about classification or
> clustering examples?
Re: workload with highly variable execution time
Posted by Byung-Gon Chun <bg...@gmail.com>.
Thanks, Sean.
On Wed, Apr 14, 2010 at 12:38 PM, Sean Owen <sr...@gmail.com> wrote:
> (I can tell you the recommender bits are pretty steady and
> predictable, so maybe not your best bet.)
>
> This may be too dumb but if your goal is to run a mapper/reducer that
> takes a long time sometimes, just have then run a mapper that sleeps a
> long time some times? a random number of milliseconds with wide range?
>
>
Yes. But I'd like to find realistic workload. How about classification or
clustering examples?
Thanks.
- Brian
> On Wed, Apr 14, 2010 at 6:50 PM, Byung-Gon Chun <bg...@gmail.com> wrote:
> > Hi,
> >
> > I'm interested in looking at workload with highly variable mapper/reducer
> > execution time.
> > Which mahout workload do you recommend to run? I see clustering,
> > classification, recommendation, etc. Also, could you recommend what
> dataset
> > to use?
> >
> > Thanks.
> > -Brian
> >
>
Re: workload with highly variable execution time
Posted by Sean Owen <sr...@gmail.com>.
(I can tell you the recommender bits are pretty steady and
predictable, so maybe not your best bet.)
This may be too dumb but if your goal is to run a mapper/reducer that
takes a long time sometimes, just have then run a mapper that sleeps a
long time some times? a random number of milliseconds with wide range?
On Wed, Apr 14, 2010 at 6:50 PM, Byung-Gon Chun <bg...@gmail.com> wrote:
> Hi,
>
> I'm interested in looking at workload with highly variable mapper/reducer
> execution time.
> Which mahout workload do you recommend to run? I see clustering,
> classification, recommendation, etc. Also, could you recommend what dataset
> to use?
>
> Thanks.
> -Brian
>