You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Robin Anil (JIRA)" <ji...@apache.org> on 2009/10/06 15:01:31 UTC

[jira] Created: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Add mahout shell script for easy launching of various algorithms
----------------------------------------------------------------

                 Key: MAHOUT-185
                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
             Project: Mahout
          Issue Type: New Feature
    Affects Versions: 0.2
         Environment: linux, bash
            Reporter: Robin Anil
             Fix For: 0.2


Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following

mahout classify -algorithm bayes [OPTIONS]
mahout cluster -algorithm canopy  [OPTIONS]
mahout fpm -algorithm pfpgrowth [OPTIONS]
mahout taste -algorithm slopeone [OPTIONS] 
mahout misc -algorithm createVectorsFromText [OPTIONS]
mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Ted Dunning <te...@gmail.com>.
Surely there is a clever way to use annotations for this.  Not that I know
what it might be.

On Fri, Feb 5, 2010 at 4:05 AM, Robin Anil (JIRA) <ji...@apache.org> wrote:

> If we go like this we might have too many options. Any way to streamline
> this ?
>
> One thought i have is to have package level Main classes in Core like
> org.apache.mahout.Clustering.java which internally calls the different main
> functions ?




-- 
Ted Dunning, CTO
DeepDyve

[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Drew Farris (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Drew Farris updated MAHOUT-185:
-------------------------------

    Attachment: MAHOUT-185.patch

This patch adds bin/mahout, a simple bash script based heavily on similar scripts found in hadoop and nutch. Doesn't follow Robin's original spec to the letter, but perhaps this is a reasonable start upon which we can build. 

I really put this together because I'm tired of typing 'mvn exec:java -D [...]' all the time. 




> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.3
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Jake Mannix (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12832239#action_12832239 ] 

Jake Mannix commented on MAHOUT-185:
------------------------------------

Why don't we just commit the shell script and close this for now - it's useful as is.  

We can open another ticket for 0.4 around doing something more around the lines that Robin mentions above (which I've got partially complete on my local git repo).

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.4
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Ted Dunning (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797277#action_12797277 ] 

Ted Dunning commented on MAHOUT-185:
------------------------------------

Regarding the properties file idea, I have had very good luck with a convention that I now use pretty ubiquitously.  Each application has a default properties file that is baked into the jar file.  This allows slow changes subject to recompilation.  All of these default properties are subject to over-ride in an external property file found in the class path or the current working directory.  These over-rides are monitored for changes to allow on-the-fly reconfiguration of long-running processes.

For transaction systems (not Mahout-like stuff), I also allow requests to contain an additional over-ride map of properties.  This allows certain things to be changed on a request by request basis.  This helps enormously because it allows almost anything to be the subject of A/B testing.

 

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.3
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12832661#action_12832661 ] 

Grant Ingersoll commented on MAHOUT-185:
----------------------------------------

Committed revision 909120.

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>         Environment: linux, bash
>            Reporter: Robin Anil
>            Assignee: Grant Ingersoll
>             Fix For: 0.3
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll updated MAHOUT-185:
-----------------------------------

    Affects Version/s:     (was: 0.2)
        Fix Version/s:     (was: 0.4)
                       0.3

I'll take it.

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>         Environment: linux, bash
>            Reporter: Robin Anil
>            Assignee: Grant Ingersoll
>             Fix For: 0.3
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated MAHOUT-185:
-----------------------------

    Fix Version/s:     (was: 0.3)
                   0.4

This timed out for 0.3 methinks

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.4
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Robin Anil (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12830077#action_12830077 ] 

Robin Anil commented on MAHOUT-185:
-----------------------------------

I like the script as i am running k-means these days :)
{code}
if [ "$COMMAND" = "vectordump" ] ; then
  CLASS=org.apache.mahout.utils.vectors.VectorDumper
elif [ "$COMMAND" = "clusterdump" ] ; then
  CLASS=org.apache.mahout.utils.clustering.ClusterDumper
elif [ "$COMMAND" = "seqdump" ] ; then
  CLASS=org.apache.mahout.utils.SequenceFileDumper
elif [ "$COMMAND" = "kmeans" ] ; then
  CLASS=org.apache.mahout.clustering.kmeans.KMeansDriver
elif [ "$COMMAND" = "canopy" ] ; then
  CLASS=org.apache.mahout.clustering.canopy.CanopyDriver
elif [ "$COMMAND" = "lucenevector" ]; then
  CLASS=org.apache.mahout.utils.vectors.lucene.Driver
elif [ "$COMMAND" = "seqdirectory" ]; then
  CLASS=org.apache.mahout.text.SequenceFilesFromDirectory
elif [ "$COMMAND" = "seqwiki" ]; then
  CLASS=org.apache.mahout.text.WikipediaToSequenceFile
{code}

If we go like this we might have too many options. Any way to streamline this ?

One thought i have is to have package level Main classes in Core like org.apache.mahout.Clustering.java which internally calls the different main functions ?
Similarly in examples and util we can keep One Entry class each Examples.java and Util.java

So with this limited set we can keep a global conf object which implements Tool and the fs object which is the default filesystem as specified by the conf
This way each algorithm can request a conf object (which copies everything Tool has set)
How does that sound? I can whip up all the main classes tonight











> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.3
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12795991#action_12795991 ] 

Sean Owen commented on MAHOUT-185:
----------------------------------

I like the idea. Is there any elaboration or movement on this?

I wonder to what extent we can make every "thing" in Mahout a command-line program? For example the CF bits aren't quite like that. Well, you could make about 15 different sets of args for the 15 different variations of a CF algorithm. Or you could make some general framework for taking a class with main() and args, but then we approach just reproducing "java". 

And then there are the Hadoop-related versions of everything, which already provide a "Job" class or "Driver" class to run it from the command line. It might be undesirable to duplicate this.

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.3
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Jake Mannix (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12797233#action_12797233 ] 

Jake Mannix commented on MAHOUT-185:
------------------------------------

As a note on this:  one of the things I've sometimes done (and we do for managing our Hadoop jobs at LinkedIn) to make dealing with messy CLI stuff more managable, is to also allow for Properties files with default arguments for various jobs (makes for much more easily reproducible results, and it's self documenting - just have "mahout classify" look first in classify.props to see if default args are defined, go from there...).

Using a base class like hadoop's Tool, you can leverage ToolRunner and GenericOptionsParser as well, and then hooking in a Properties-based way to run it as well makes it pretty flexible.

It would be really nice to consolidate all of our Driver/Job classes into this issue, so that it's a) not duplicated, but b) in one place.  

This issue should get some priority - it will seriously help with our usability if there's an easy way to launch all the various tasks from one simple place.  I'd love to have a little jruby script to run some of this stuff too, because when I was first writing decomposer, I found it invaluable to be able to just drop into jirb's REPL and start issuing java commands to run the various Hadoop jobs I was testing.

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.3
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12832626#action_12832626 ] 

Grant Ingersoll commented on MAHOUT-185:
----------------------------------------

Looks like a good start.  Longer term, we might want to integrate launching EC2, etc.: http://openbixo.org/documentation/running-bixo-in-ec2/



> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.4
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Assigned: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Grant Ingersoll (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll reassigned MAHOUT-185:
--------------------------------------

    Assignee: Grant Ingersoll

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>            Assignee: Grant Ingersoll
>             Fix For: 0.4
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Robin Anil <ro...@gmail.com>.
Yeah for now the former. Am I right deneche?

On Sat, Feb 13, 2010 at 12:11 AM, Drew Farris <dr...@gmail.com> wrote:

> Ok, this is better: http://pastebin.com/d3783fa0f
>
> The only overlap is:
>
> BuildForest org.apache.mahout.df.mapred.BuildForest
> BuildForest org.apache.mahout.df.mapreduce.BuildForest
>
> We only want to choose one of those anyway, right?
>
> On Fri, Feb 12, 2010 at 1:36 PM, Drew Farris <dr...@gmail.com>
> wrote:
> > bah, no there are some dupes in there.
> >
> > On Fri, Feb 12, 2010 at 1:35 PM, Drew Farris <dr...@gmail.com>
> wrote:
> >> how about this:
> >> http://pastebin.com/m55f8dced
> >>
> >> On Fri, Feb 12, 2010 at 1:26 PM, Robin Anil <ro...@gmail.com>
> wrote:
> >>> We can always rename them. Its an easier way out. The other way out is
> to
> >>> keep a mapping list.
> >>>
> >>> Plenty of options on the table. Just need to choose one.
> >>>
> >
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Drew Farris <dr...@gmail.com>.
Ok, this is better: http://pastebin.com/d3783fa0f

The only overlap is:

BuildForest org.apache.mahout.df.mapred.BuildForest
BuildForest org.apache.mahout.df.mapreduce.BuildForest

We only want to choose one of those anyway, right?

On Fri, Feb 12, 2010 at 1:36 PM, Drew Farris <dr...@gmail.com> wrote:
> bah, no there are some dupes in there.
>
> On Fri, Feb 12, 2010 at 1:35 PM, Drew Farris <dr...@gmail.com> wrote:
>> how about this:
>> http://pastebin.com/m55f8dced
>>
>> On Fri, Feb 12, 2010 at 1:26 PM, Robin Anil <ro...@gmail.com> wrote:
>>> We can always rename them. Its an easier way out. The other way out is to
>>> keep a mapping list.
>>>
>>> Plenty of options on the table. Just need to choose one.
>>>
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Drew Farris <dr...@gmail.com>.
bah, no there are some dupes in there.

On Fri, Feb 12, 2010 at 1:35 PM, Drew Farris <dr...@gmail.com> wrote:
> how about this:
> http://pastebin.com/m55f8dced
>
> On Fri, Feb 12, 2010 at 1:26 PM, Robin Anil <ro...@gmail.com> wrote:
>> We can always rename them. Its an easier way out. The other way out is to
>> keep a mapping list.
>>
>> Plenty of options on the table. Just need to choose one.
>>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Drew Farris <dr...@gmail.com>.
how about this:
http://pastebin.com/m55f8dced

On Fri, Feb 12, 2010 at 1:26 PM, Robin Anil <ro...@gmail.com> wrote:
> We can always rename them. Its an easier way out. The other way out is to
> keep a mapping list.
>
> Plenty of options on the table. Just need to choose one.
>
> On Fri, Feb 12, 2010 at 11:53 PM, Jake Mannix <ja...@gmail.com> wrote:
>
>> Don't we have a few examples of "Driver" as the class name?
>>
>> On Fri, Feb 12, 2010 at 10:21 AM, Robin Anil <ro...@gmail.com> wrote:
>>
>> > To Simply things how about just use the ClassName. Everything seems
>> unique?
>> >
>> > mahout TestClassifier
>> > mahout KmeansDriver
>> > mahout ClusterDumper.
>> >
>> > its easier to explain to people if the EntryPoints a verbose enough. Like
>> > SparseVectorsFromSequenceFiles
>> >
>> > and have a script to generate mappings and detect collisions.
>> >
>> > Robin
>> >
>>
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Robin Anil <ro...@gmail.com>.
We can always rename them. Its an easier way out. The other way out is to
keep a mapping list.

Plenty of options on the table. Just need to choose one.

On Fri, Feb 12, 2010 at 11:53 PM, Jake Mannix <ja...@gmail.com> wrote:

> Don't we have a few examples of "Driver" as the class name?
>
> On Fri, Feb 12, 2010 at 10:21 AM, Robin Anil <ro...@gmail.com> wrote:
>
> > To Simply things how about just use the ClassName. Everything seems
> unique?
> >
> > mahout TestClassifier
> > mahout KmeansDriver
> > mahout ClusterDumper.
> >
> > its easier to explain to people if the EntryPoints a verbose enough. Like
> > SparseVectorsFromSequenceFiles
> >
> > and have a script to generate mappings and detect collisions.
> >
> > Robin
> >
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Drew Farris <dr...@gmail.com>.
And 'Job' in the synthetic control stuff.

On Fri, Feb 12, 2010 at 1:23 PM, Jake Mannix <ja...@gmail.com> wrote:
> Don't we have a few examples of "Driver" as the class name?
>
> On Fri, Feb 12, 2010 at 10:21 AM, Robin Anil <ro...@gmail.com> wrote:
>
>> To Simply things how about just use the ClassName. Everything seems unique?
>>
>> mahout TestClassifier
>> mahout KmeansDriver
>> mahout ClusterDumper.
>>
>> its easier to explain to people if the EntryPoints a verbose enough. Like
>> SparseVectorsFromSequenceFiles
>>
>> and have a script to generate mappings and detect collisions.
>>
>> Robin
>>
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Jake Mannix <ja...@gmail.com>.
Don't we have a few examples of "Driver" as the class name?

On Fri, Feb 12, 2010 at 10:21 AM, Robin Anil <ro...@gmail.com> wrote:

> To Simply things how about just use the ClassName. Everything seems unique?
>
> mahout TestClassifier
> mahout KmeansDriver
> mahout ClusterDumper.
>
> its easier to explain to people if the EntryPoints a verbose enough. Like
> SparseVectorsFromSequenceFiles
>
> and have a script to generate mappings and detect collisions.
>
> Robin
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Robin Anil <ro...@gmail.com>.
To Simply things how about just use the ClassName. Everything seems unique?

mahout TestClassifier
mahout KmeansDriver
mahout ClusterDumper.

its easier to explain to people if the EntryPoints a verbose enough. Like
SparseVectorsFromSequenceFiles

and have a script to generate mappings and detect collisions.

Robin

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Drew Farris <dr...@gmail.com>.
For what it is worth, if we don't get aliases for all the classes in
there, you can still to stuff like

mahout org.apache.mahout.fpm.pfpgrowth.DeliciousTagsExample ...

..and the script will setup the classpath properly and execute the
class -- just sayin' :)

I can take some of these, but what sort of naming pattern should we
use? Something like the following?:

dfBriemanExample
dfBuildForest
fpmDeliciousTagsExample
scKmeans (for syntheticcontrol.kmeans.Job)
kmeans

Do we need all of the examples?

What's the difference between the syntheticcontrol and other
clustering stuff, e.g:

org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
org.apache.mahout.clustering.kmeans.KMeansDriver

While we're at it, If someone's editing and committing, there's a
spelling error in the comments:

HADOOP_CONFIF_DIR which should be HADOOP_CONF_DIR

On Fri, Feb 12, 2010 at 1:00 PM, Robin Anil <ro...@gmail.com> wrote:
> A Sorted Version here http://pastebin.com/m25252110. Just add the name next
> to the class
>
> Robin
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Sean Owen <sr...@gmail.com>.
Oops I marked it Fixed for 0.4. It should be Fixed for 0.3 -- or in
progress for 0.4 if you want to keep it open. Change accordingly.

On Fri, Feb 12, 2010 at 6:03 PM, Jake Mannix <ja...@gmail.com> wrote:
> I have code for a more maintainable way to do this, at least on the level of
> not
> having to change code / recompile to add more, using some property files,
> but
> it's not done yet / tested fully yet.  We can open a new ticket for
> continuing,
> as it's not really a shell script.
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Jake Mannix <ja...@gmail.com>.
I have code for a more maintainable way to do this, at least on the level of
not
having to change code / recompile to add more, using some property files,
but
it's not done yet / tested fully yet.  We can open a new ticket for
continuing,
as it's not really a shell script.

  -jake

On Fri, Feb 12, 2010 at 10:00 AM, Robin Anil <ro...@gmail.com> wrote:

> A Sorted Version here http://pastebin.com/m25252110. Just add the name
> next
> to the class
>
> Robin
>

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Robin Anil <ro...@gmail.com>.
A Sorted Version here http://pastebin.com/m25252110. Just add the name next
to the class

Robin

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Grant Ingersoll <gs...@apache.org>.
On Feb 12, 2010, at 2:38 PM, Grant Ingersoll wrote:

> FWIW, the script has a way that you can pass in the classname of the program to run.    

D'oh, should always finish reading the thread before responding ;-)

-Grant

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Grant Ingersoll <gs...@apache.org>.
FWIW, the script has a way that you can pass in the classname of the program to run.    

-Grant

On Feb 12, 2010, at 12:57 PM, Robin Anil wrote:

> It not complete to really say its done. How about someone picking out all
> main classes and putting it there.
> I just did this
> 
> find . |grep -v svn |grep "java$" | xargs grep -i "main(String" | cut -d " "
> -f1|cut -d':' -f1 |cut -d "." -f2  |cut -d"/" -f 2-| sed 's/\//./g' |cut
> -d"." -f 5-
> 
> Can someone help naming them
> 
> org.apache.mahout.cf.taste.hadoop.item.RecommenderJob
> org.apache.mahout.cf.taste.hadoop.pseudo.RecommenderJob
> org.apache.mahout.cf.taste.hadoop.slopeone.SlopeOneAverageDiffsJob
> org.apache.mahout.classifier.bayes.TestClassifier
> org.apache.mahout.classifier.bayes.TrainClassifier
> org.apache.mahout.classifier.BayesFileFormatter
> org.apache.mahout.classifier.Classify
> org.apache.mahout.clustering.canopy.CanopyClusteringJob
> org.apache.mahout.clustering.canopy.CanopyDriver
> org.apache.mahout.clustering.canopy.ClusterDriver
> org.apache.mahout.clustering.dirichlet.DirichletDriver
> org.apache.mahout.clustering.dirichlet.DirichletJob
> org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver
> org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansJob
> org.apache.mahout.clustering.kmeans.KMeansDriver
> org.apache.mahout.clustering.lda.LDADriver
> org.apache.mahout.clustering.lda.LDAPrintTopics
> org.apache.mahout.clustering.meanshift.MeanShiftCanopyDriver
> org.apache.mahout.clustering.meanshift.MeanShiftCanopyJob
> org.apache.mahout.df.tools.Describe
> org.apache.mahout.df.tools.Frequencies
> org.apache.mahout.df.tools.UDistrib
> org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver
> org.apache.mahout.cf.taste.impl.LoadTest
> org.apache.mahout.clustering.kmeans.SimpleKMeansClustering
> org.apache.mahout.cf.taste.example.bookcrossing.BookCrossingRecommenderEvaluatorRunner
> org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommenderEvaluatorRunner
> org.apache.mahout.cf.taste.example.jester.JesterRecommenderEvaluatorRunner
> org.apache.mahout.cf.taste.example.netflix.NetflixRecommenderEvaluatorRunner
> org.apache.mahout.cf.taste.example.netflix.TransposeToByUser
> org.apache.mahout.classifier.bayes.PrepareARFFBinaryFeatures
> org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups
> org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorDriver
> org.apache.mahout.classifier.bayes.WikipediaXmlSplitter
> org.apache.mahout.clustering.canopy.DisplayCanopy
> org.apache.mahout.clustering.dirichlet.Display2dASNDirichlet
> org.apache.mahout.clustering.dirichlet.DisplayASNDirichlet
> org.apache.mahout.clustering.dirichlet.DisplayASNOutputState
> org.apache.mahout.clustering.dirichlet.DisplayDirichlet
> org.apache.mahout.clustering.dirichlet.DisplayNDirichlet
> org.apache.mahout.clustering.dirichlet.DisplayOutputState
> org.apache.mahout.clustering.dirichlet.DisplaySNDirichlet
> org.apache.mahout.clustering.fuzzykmeans.DisplayFuzzyKMeans
> org.apache.mahout.clustering.kmeans.DisplayKMeans
> org.apache.mahout.clustering.meanshift.DisplayMeanShift
> org.apache.mahout.clustering.syntheticcontrol.canopy.InputDriver
> org.apache.mahout.clustering.syntheticcontrol.canopy.Job
> org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
> org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
> org.apache.mahout.clustering.syntheticcontrol.meanshift.InputDriver
> org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
> org.apache.mahout.clustering.syntheticcontrol.meanshift.OutputDriver
> org.apache.mahout.df.BreimanExample
> org.apache.mahout.df.mapred.BuildForest
> org.apache.mahout.df.mapreduce.BuildForest
> org.apache.mahout.fpm.pfpgrowth.DeliciousTagsExample
> org.apache.mahout.ga.watchmaker.cd.CDGA
> org.apache.mahout.ga.watchmaker.cd.tool.CDInfosTool
> org.apache.mahout.ga.watchmaker.travellingsalesman.TravellingSalesman
> org.apache.mahout.text.SequenceFilesFromDirectory
> org.apache.mahout.text.WikipediaToSequenceFile
> org.apache.mahout.math.decomposer.hebbian.HebbianSolver
> org.apache.mahout.math.jet.stat.quantile.QuantileCalc
> org.apache.mahout.text.SparseVectorsFromSequenceFiles
> org.apache.mahout.utils.clustering.ClusterDumper
> org.apache.mahout.utils.nlp.collocations.llr.CollocDriver
> org.apache.mahout.utils.SequenceFileDumper
> org.apache.mahout.utils.vectors.arff.Driver
> org.apache.mahout.utils.vectors.lucene.ClusterLabels
> org.apache.mahout.utils.vectors.lucene.Driver
> org.apache.mahout.utils.vectors.VectorDumper

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search


Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Robin Anil <ro...@gmail.com>.
It not complete to really say its done. How about someone picking out all
main classes and putting it there.
I just did this

find . |grep -v svn |grep "java$" | xargs grep -i "main(String" | cut -d " "
-f1|cut -d':' -f1 |cut -d "." -f2  |cut -d"/" -f 2-| sed 's/\//./g' |cut
-d"." -f 5-

Can someone help naming them

org.apache.mahout.cf.taste.hadoop.item.RecommenderJob
org.apache.mahout.cf.taste.hadoop.pseudo.RecommenderJob
org.apache.mahout.cf.taste.hadoop.slopeone.SlopeOneAverageDiffsJob
org.apache.mahout.classifier.bayes.TestClassifier
org.apache.mahout.classifier.bayes.TrainClassifier
org.apache.mahout.classifier.BayesFileFormatter
org.apache.mahout.classifier.Classify
org.apache.mahout.clustering.canopy.CanopyClusteringJob
org.apache.mahout.clustering.canopy.CanopyDriver
org.apache.mahout.clustering.canopy.ClusterDriver
org.apache.mahout.clustering.dirichlet.DirichletDriver
org.apache.mahout.clustering.dirichlet.DirichletJob
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansDriver
org.apache.mahout.clustering.fuzzykmeans.FuzzyKMeansJob
org.apache.mahout.clustering.kmeans.KMeansDriver
org.apache.mahout.clustering.lda.LDADriver
org.apache.mahout.clustering.lda.LDAPrintTopics
org.apache.mahout.clustering.meanshift.MeanShiftCanopyDriver
org.apache.mahout.clustering.meanshift.MeanShiftCanopyJob
org.apache.mahout.df.tools.Describe
org.apache.mahout.df.tools.Frequencies
org.apache.mahout.df.tools.UDistrib
org.apache.mahout.fpm.pfpgrowth.FPGrowthDriver
org.apache.mahout.cf.taste.impl.LoadTest
org.apache.mahout.clustering.kmeans.SimpleKMeansClustering
org.apache.mahout.cf.taste.example.bookcrossing.BookCrossingRecommenderEvaluatorRunner
org.apache.mahout.cf.taste.example.grouplens.GroupLensRecommenderEvaluatorRunner
org.apache.mahout.cf.taste.example.jester.JesterRecommenderEvaluatorRunner
org.apache.mahout.cf.taste.example.netflix.NetflixRecommenderEvaluatorRunner
org.apache.mahout.cf.taste.example.netflix.TransposeToByUser
org.apache.mahout.classifier.bayes.PrepareARFFBinaryFeatures
org.apache.mahout.classifier.bayes.PrepareTwentyNewsgroups
org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorDriver
org.apache.mahout.classifier.bayes.WikipediaXmlSplitter
org.apache.mahout.clustering.canopy.DisplayCanopy
org.apache.mahout.clustering.dirichlet.Display2dASNDirichlet
org.apache.mahout.clustering.dirichlet.DisplayASNDirichlet
org.apache.mahout.clustering.dirichlet.DisplayASNOutputState
org.apache.mahout.clustering.dirichlet.DisplayDirichlet
org.apache.mahout.clustering.dirichlet.DisplayNDirichlet
org.apache.mahout.clustering.dirichlet.DisplayOutputState
org.apache.mahout.clustering.dirichlet.DisplaySNDirichlet
org.apache.mahout.clustering.fuzzykmeans.DisplayFuzzyKMeans
org.apache.mahout.clustering.kmeans.DisplayKMeans
org.apache.mahout.clustering.meanshift.DisplayMeanShift
org.apache.mahout.clustering.syntheticcontrol.canopy.InputDriver
org.apache.mahout.clustering.syntheticcontrol.canopy.Job
org.apache.mahout.clustering.syntheticcontrol.dirichlet.Job
org.apache.mahout.clustering.syntheticcontrol.kmeans.Job
org.apache.mahout.clustering.syntheticcontrol.meanshift.InputDriver
org.apache.mahout.clustering.syntheticcontrol.meanshift.Job
org.apache.mahout.clustering.syntheticcontrol.meanshift.OutputDriver
org.apache.mahout.df.BreimanExample
org.apache.mahout.df.mapred.BuildForest
org.apache.mahout.df.mapreduce.BuildForest
org.apache.mahout.fpm.pfpgrowth.DeliciousTagsExample
org.apache.mahout.ga.watchmaker.cd.CDGA
org.apache.mahout.ga.watchmaker.cd.tool.CDInfosTool
org.apache.mahout.ga.watchmaker.travellingsalesman.TravellingSalesman
org.apache.mahout.text.SequenceFilesFromDirectory
org.apache.mahout.text.WikipediaToSequenceFile
org.apache.mahout.math.decomposer.hebbian.HebbianSolver
org.apache.mahout.math.jet.stat.quantile.QuantileCalc
org.apache.mahout.text.SparseVectorsFromSequenceFiles
org.apache.mahout.utils.clustering.ClusterDumper
org.apache.mahout.utils.nlp.collocations.llr.CollocDriver
org.apache.mahout.utils.SequenceFileDumper
org.apache.mahout.utils.vectors.arff.Driver
org.apache.mahout.utils.vectors.lucene.ClusterLabels
org.apache.mahout.utils.vectors.lucene.Driver
org.apache.mahout.utils.vectors.VectorDumper

Re: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Jake Mannix <ja...@gmail.com>.
Sean was pushing out tickets to 0.4 he felt had timed out, but now that
you've checked that one it, it clearly is done for 0.3.

  -jake

On Fri, Feb 12, 2010 at 9:48 AM, Grant Ingersoll <gs...@apache.org>wrote:

> Wasn't this 0.3 or did I miss a release?
>
> Begin forwarded message:
>
> > From: "Sean Owen (JIRA)" <ji...@apache.org>
> > Date: February 12, 2010 8:58:27 AM EST
> > To: mahout-dev@lucene.apache.org
> > Subject: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy
> launching of various algorithms
> > Reply-To: mahout-dev@lucene.apache.org
> >
> >
> >     [
> https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel]
> >
> > Sean Owen resolved MAHOUT-185.
> > ------------------------------
> >
> >       Resolution: Fixed
> >    Fix Version/s:     (was: 0.3)
> >                   0.4
> >         Assignee: Robin Anil  (was: Grant Ingersoll)
> >
> >> Add mahout shell script for easy launching of various algorithms
> >> ----------------------------------------------------------------
> >>
> >>                Key: MAHOUT-185
> >>                URL: https://issues.apache.org/jira/browse/MAHOUT-185
> >>            Project: Mahout
> >>         Issue Type: New Feature
> >>        Environment: linux, bash
> >>           Reporter: Robin Anil
> >>           Assignee: Robin Anil
> >>            Fix For: 0.4
> >>
> >>        Attachments: MAHOUT-185.patch
> >>
> >>
> >> Currently, Each algorithm has a different point of entry. At its too
> complicated to understand and launch each one.  A mahout shell script needs
> to be made in the bin directory which does something like the following
> >> mahout classify -algorithm bayes [OPTIONS]
> >> mahout cluster -algorithm canopy  [OPTIONS]
> >> mahout fpm -algorithm pfpgrowth [OPTIONS]
> >> mahout taste -algorithm slopeone [OPTIONS]
> >> mahout misc -algorithm createVectorsFromText [OPTIONS]
> >> mahout examples WikipediaExample
> >
> > --
> > This message is automatically generated by JIRA.
> > -
> > You can reply to this email to add a comment to the issue online.
> >
>
> --------------------------
> Grant Ingersoll
> http://www.lucidimagination.com/
>
> Search the Lucene ecosystem using Solr/Lucene:
> http://www.lucidimagination.com/search
>
>

Fwd: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by Grant Ingersoll <gs...@apache.org>.
Wasn't this 0.3 or did I miss a release?

Begin forwarded message:

> From: "Sean Owen (JIRA)" <ji...@apache.org>
> Date: February 12, 2010 8:58:27 AM EST
> To: mahout-dev@lucene.apache.org
> Subject: [jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms
> Reply-To: mahout-dev@lucene.apache.org
> 
> 
>     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
> 
> Sean Owen resolved MAHOUT-185.
> ------------------------------
> 
>       Resolution: Fixed
>    Fix Version/s:     (was: 0.3)
>                   0.4
>         Assignee: Robin Anil  (was: Grant Ingersoll)
> 
>> Add mahout shell script for easy launching of various algorithms
>> ----------------------------------------------------------------
>> 
>>                Key: MAHOUT-185
>>                URL: https://issues.apache.org/jira/browse/MAHOUT-185
>>            Project: Mahout
>>         Issue Type: New Feature
>>        Environment: linux, bash
>>           Reporter: Robin Anil
>>           Assignee: Robin Anil
>>            Fix For: 0.4
>> 
>>        Attachments: MAHOUT-185.patch
>> 
>> 
>> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
>> mahout classify -algorithm bayes [OPTIONS]
>> mahout cluster -algorithm canopy  [OPTIONS]
>> mahout fpm -algorithm pfpgrowth [OPTIONS]
>> mahout taste -algorithm slopeone [OPTIONS] 
>> mahout misc -algorithm createVectorsFromText [OPTIONS]
>> mahout examples WikipediaExample
> 
> -- 
> This message is automatically generated by JIRA.
> -
> You can reply to this email to add a comment to the issue online.
> 

--------------------------
Grant Ingersoll
http://www.lucidimagination.com/

Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search


[jira] Resolved: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen resolved MAHOUT-185.
------------------------------

       Resolution: Fixed
    Fix Version/s:     (was: 0.3)
                   0.4
         Assignee: Robin Anil  (was: Grant Ingersoll)

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>         Environment: linux, bash
>            Reporter: Robin Anil
>            Assignee: Robin Anil
>             Fix For: 0.4
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Robin Anil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robin Anil updated MAHOUT-185:
------------------------------

    Affects Version/s: 0.3
        Fix Version/s:     (was: 0.4)
                       0.3

Moving fixed issues to 0.3. If further work is required, will open a new one for 0.4

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.3
>         Environment: linux, bash
>            Reporter: Robin Anil
>            Assignee: Robin Anil
>             Fix For: 0.3
>
>         Attachments: MAHOUT-185.patch
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (MAHOUT-185) Add mahout shell script for easy launching of various algorithms

Posted by "Sean Owen (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAHOUT-185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated MAHOUT-185:
-----------------------------

    Fix Version/s:     (was: 0.2)
                   0.3

> Add mahout shell script for easy launching of various algorithms
> ----------------------------------------------------------------
>
>                 Key: MAHOUT-185
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-185
>             Project: Mahout
>          Issue Type: New Feature
>    Affects Versions: 0.2
>         Environment: linux, bash
>            Reporter: Robin Anil
>             Fix For: 0.3
>
>
> Currently, Each algorithm has a different point of entry. At its too complicated to understand and launch each one.  A mahout shell script needs to be made in the bin directory which does something like the following
> mahout classify -algorithm bayes [OPTIONS]
> mahout cluster -algorithm canopy  [OPTIONS]
> mahout fpm -algorithm pfpgrowth [OPTIONS]
> mahout taste -algorithm slopeone [OPTIONS] 
> mahout misc -algorithm createVectorsFromText [OPTIONS]
> mahout examples WikipediaExample

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.