You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by sirvan paraste <si...@gmail.com> on 2013/07/05 21:42:48 UTC

How to evaluate StreamingKmeans in mahout .8

Hi,
I asked this question last week,

In generated output for streamingkmenas , there is now information of
elements of each clusters,
I think it should be implemented, please guide me, what I must do? I am new
to mahout

thanks,

Re: How to evaluate StreamingKmeans in mahout .8

Posted by Ted Dunning <te...@gmail.com>.
Sirvan,

If you have general questions about the code, sending email to the dev list
is ideal.

To track a specific effort and especially to collect code patches, you
should file a JIRA as you suggest.



On Fri, Jul 5, 2013 at 9:59 PM, Suneel Marthi <su...@yahoo.com>wrote:

> Create a Jira and submit a patch for review.
>
>
>
>
> ________________________________
>  From: sirvan paraste <si...@gmail.com>
> To: dev@mahout.apache.org
> Sent: Saturday, July 6, 2013 12:30 AM
> Subject: Re: How to evaluate StreamingKmeans in mahout .8
>
>
> Dear Ted,
>
>
> It will be done and I will report the resulting code.
> Therefore, How to ask question about code and report it? and where to
> attach the implemented code? I think it should be done in jira.
> Thank you.
>
>
>
> On Sat, Jul 6, 2013 at 12:21 AM, Ted Dunning <te...@gmail.com>
> wrote:
>
> > That information is discarded in order to make the streaming job
> possible.
> >
> > To recreate this information, you would need to do another pass over the
> > data in order to assign records to the closest centroid.  There may
> already
> > be such a job, but it is not likely to be compatible with the streaming
> > k-means.
> >
> > Making these pieces fit (or writing them) would be an excellent project
> and
> > I am sure that the community would help you in doing so.
> >
> >
> >
> > On Fri, Jul 5, 2013 at 12:42 PM, sirvan paraste <
> sirvan.paraste@gmail.com
> > >wrote:
> >
> > > Hi,
> > > I asked this question last week,
> > >
> > > In generated output for streamingkmenas , there is now information of
> > > elements of each clusters,
> > > I think it should be implemented, please guide me, what I must do? I am
> > new
> > > to mahout
> > >
> > > thanks,
> > >
> >
>

Re: How to evaluate StreamingKmeans in mahout .8

Posted by Suneel Marthi <su...@yahoo.com>.
Create a Jira and submit a patch for review.




________________________________
 From: sirvan paraste <si...@gmail.com>
To: dev@mahout.apache.org 
Sent: Saturday, July 6, 2013 12:30 AM
Subject: Re: How to evaluate StreamingKmeans in mahout .8
 

Dear Ted,


It will be done and I will report the resulting code.
Therefore, How to ask question about code and report it? and where to
attach the implemented code? I think it should be done in jira.
Thank you.



On Sat, Jul 6, 2013 at 12:21 AM, Ted Dunning <te...@gmail.com> wrote:

> That information is discarded in order to make the streaming job possible.
>
> To recreate this information, you would need to do another pass over the
> data in order to assign records to the closest centroid.  There may already
> be such a job, but it is not likely to be compatible with the streaming
> k-means.
>
> Making these pieces fit (or writing them) would be an excellent project and
> I am sure that the community would help you in doing so.
>
>
>
> On Fri, Jul 5, 2013 at 12:42 PM, sirvan paraste <sirvan.paraste@gmail.com
> >wrote:
>
> > Hi,
> > I asked this question last week,
> >
> > In generated output for streamingkmenas , there is now information of
> > elements of each clusters,
> > I think it should be implemented, please guide me, what I must do? I am
> new
> > to mahout
> >
> > thanks,
> >
>

Re: How to evaluate StreamingKmeans in mahout .8

Posted by sirvan paraste <si...@gmail.com>.
Dear Ted,


It will be done and I will report the resulting code.
Therefore, How to ask question about code and report it? and where to
attach the implemented code? I think it should be done in jira.
Thank you.



On Sat, Jul 6, 2013 at 12:21 AM, Ted Dunning <te...@gmail.com> wrote:

> That information is discarded in order to make the streaming job possible.
>
> To recreate this information, you would need to do another pass over the
> data in order to assign records to the closest centroid.  There may already
> be such a job, but it is not likely to be compatible with the streaming
> k-means.
>
> Making these pieces fit (or writing them) would be an excellent project and
> I am sure that the community would help you in doing so.
>
>
>
> On Fri, Jul 5, 2013 at 12:42 PM, sirvan paraste <sirvan.paraste@gmail.com
> >wrote:
>
> > Hi,
> > I asked this question last week,
> >
> > In generated output for streamingkmenas , there is now information of
> > elements of each clusters,
> > I think it should be implemented, please guide me, what I must do? I am
> new
> > to mahout
> >
> > thanks,
> >
>

Re: How to evaluate StreamingKmeans in mahout .8

Posted by Ted Dunning <te...@gmail.com>.
That information is discarded in order to make the streaming job possible.

To recreate this information, you would need to do another pass over the
data in order to assign records to the closest centroid.  There may already
be such a job, but it is not likely to be compatible with the streaming
k-means.

Making these pieces fit (or writing them) would be an excellent project and
I am sure that the community would help you in doing so.



On Fri, Jul 5, 2013 at 12:42 PM, sirvan paraste <si...@gmail.com>wrote:

> Hi,
> I asked this question last week,
>
> In generated output for streamingkmenas , there is now information of
> elements of each clusters,
> I think it should be implemented, please guide me, what I must do? I am new
> to mahout
>
> thanks,
>