You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Videnova, Svetlana" <sv...@logica.com> on 2012/07/06 09:43:41 UTC

RE: train mahout ex

Hi guys.

I join pom.xml for mahout-distribution-0.7.
Im following this tutorial: http://cloudblog.8kmiles.com/2012/01/31/apache-mahout-a-clustering-example/

I still have errors when I execute this step: user1@ubuntu-server:~$ mvn clean install

I can't understand what's wrong about the pom.xml
This is the output:



################

/usr/local/mahout-distribution-0.7$ mvn clean install
[INFO] Scanning for projects...
[INFO] ------------------------------------------------------------------------
[ERROR] FATAL ERROR
[INFO] ------------------------------------------------------------------------
[INFO] Error building POM (may not be this project's POM).


Project ID: unknown
POM Location: /usr/local/mahout-distribution-0.7/pom.xml

Reason: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml


[INFO] ------------------------------------------------------------------------
[INFO] Trace
org.apache.maven.reactor.MavenExecutionException: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
	at org.apache.maven.DefaultMaven.getProjects(DefaultMaven.java:404)
	at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:272)
	at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138)
	at org.apache.maven.cli.MavenCli.main(MavenCli.java:362)
	at org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315)
	at org.codehaus.classworlds.Launcher.launch(Launcher.java:255)
	at org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430)
	at org.codehaus.classworlds.Launcher.main(Launcher.java:375)
Caused by: org.apache.maven.project.InvalidProjectModelException: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
	at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1610)
	at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1571)
	at org.apache.maven.project.DefaultMavenProjectBuilder.buildFromSourceFileInternal(DefaultMavenProjectBuilder.java:506)
	at org.apache.maven.project.DefaultMavenProjectBuilder.build(DefaultMavenProjectBuilder.java:200)
	at org.apache.maven.DefaultMaven.getProject(DefaultMaven.java:604)
	at org.apache.maven.DefaultMaven.collectProjects(DefaultMaven.java:487)
	at org.apache.maven.DefaultMaven.getProjects(DefaultMaven.java:391)
	... 12 more
Caused by: org.codehaus.plexus.util.xml.pull.XmlPullParserException: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17) 
	at org.apache.maven.model.io.xpp3.MavenXpp3Reader.parseModel(MavenXpp3Reader.java:2131)
	at org.apache.maven.model.io.xpp3.MavenXpp3Reader.read(MavenXpp3Reader.java:3913)
	at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1606)
	... 18 more
[INFO] ------------------------------------------------------------------------
[INFO] Total time: < 1 second
[INFO] Finished at: Fri Jul 06 08:41:33 CEST 2012
[INFO] Final Memory: 1M/15M
[INFO] ------------------------------------------------------------------------

################



Thanks


Regards


-----Message d'origine-----
De : Dave Stuart [mailto:dave@axistwelve.com] 
Envoyé : vendredi 22 juin 2012 16:44
À : Videnova, Svetlana
Objet : Re: train mahout ex 

Try double dashes for the options as per below 
 [--dir <dir> --idField <idField> --output <output> --delimiter <delimiter>    
--help --field <field> --max <max> --dictOut <dictOut> --norm <norm>           
--maxDFPercent <maxDFPercent> --weight <weight> --minDF <minDF>

> bin/mahout lucene.vector -dir /home/csi/Téléchargement/apache-solr-3.6.0/tcp/solr/data/index/ -output /tmp/part-out.vec -field text -idField id -dictOut /tmp/dict.out -norm 2


Think green - keep it on the screen.

This e-mail and any attachment is for authorised use by the intended recipient(s) only. It may contain proprietary material, confidential information and/or be subject to legal privilege. It should not be copied, disclosed to, retained or used by, any other party. If you are not an intended recipient then please promptly delete this e-mail and any attachment and all copies and inform the sender. Thank you.


Re: general mahout working / some solr questions / last version tests

Posted by Frank Scholten <fr...@frankscholten.nl>.
First make sure you can do a normal build.

It seems you have some local changes to the pom because trunk builds
fine on my machine. Do a clean checkout and run

$ mvn clean install -DskipTests=true

Second, the type of input and output depends on the job you want to run.

If you want to do clustering you run several jobs in sequence. Try the
clustering example on the Reuters news dataset.

Have a look at the folder examples/bin/cluster-reuters.sh, run it and
look at the script to see what kind of jobs it runs.

Frank

On Fri, Jul 6, 2012 at 11:45 AM, Videnova, Svetlana
<sv...@logica.com> wrote:
>
>
> Can someone please ask me to following questions:
> 1)What the input of mahout (a xml file? Which is the output of solr, that what it interests me!)?
> 2)What the output of mahout, I mean after clusterisation with k-means for exemple (a xml file again? )?
> 3)Where the output is store?
> 4)Can somebody please give me an exemple of code line command on unix ubuntu?
>  I tried this already :
>  $ $MAHOUT_HOME/bin/mahout --input my_file.txt --output output.txt
> Is that makes any sense for you?
>
>
> I know that there are some script to make solr and mahout working together, and create a connection between both, but no tutorials on this subject(either not very clear, nor too old...). Some ideas,tutoriels,forums ..?
>
>
>
>
> I'm still using this tuto: http://cloudblog.8kmiles.com/2012/01/31/apache-mahout-a-clustering-example/
> But with the implemented code from here: http://zoekja.nl/proxy/index.php?q=aHR0cHM6Ly9naXRodWIuY29tL2FwYWNoZS9tYWhvdXQ%3D
> Ps:hadoop is running OK, java set OK
>
> ################
> BUILD SUCCESSFUL
> [INFO] ------------------------------------------------------------------------
> [INFO] Total time: 77 minutes 5 seconds
> [INFO] Finished at: Fri Jul 06 10:48:45 CEST 2012
> [INFO] Final Memory: 67M/170M
> ################
> :):):):):):):):):):)
> Then thanks to : Sean Owen and his updates on http://zoekja.nl/proxy/index.php?q=aHR0cHM6Ly9naXRodWIuY29tL2FwYWNoZS9tYWhvdXQ%3D
>
>
>
>
>
>
>
>
> -----Message d'origine-----
> De : Videnova, Svetlana [mailto:svetlana.videnova@logica.com]
> Envoyé : vendredi 6 juillet 2012 09:44
> À : dev@mahout.apache.org
> Objet : RE: train mahout ex
>
> Hi guys.
>
> I join pom.xml for mahout-distribution-0.7.
> Im following this tutorial: http://cloudblog.8kmiles.com/2012/01/31/apache-mahout-a-clustering-example/
>
> I still have errors when I execute this step: user1@ubuntu-server:~$ mvn clean install
>
> I can't understand what's wrong about the pom.xml This is the output:
>
>
>
> ################
>
> /usr/local/mahout-distribution-0.7$ mvn clean install [INFO] Scanning for projects...
> [INFO] ------------------------------------------------------------------------
> [ERROR] FATAL ERROR
> [INFO] ------------------------------------------------------------------------
> [INFO] Error building POM (may not be this project's POM).
>
>
> Project ID: unknown
> POM Location: /usr/local/mahout-distribution-0.7/pom.xml
>
> Reason: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
>
>
> [INFO] ------------------------------------------------------------------------
> [INFO] Trace
> org.apache.maven.reactor.MavenExecutionException: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
>         at org.apache.maven.DefaultMaven.getProjects(DefaultMaven.java:404)
>         at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:272)
>         at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138)
>         at org.apache.maven.cli.MavenCli.main(MavenCli.java:362)
>         at org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>         at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>         at java.lang.reflect.Method.invoke(Method.java:597)
>         at org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315)
>         at org.codehaus.classworlds.Launcher.launch(Launcher.java:255)
>         at org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430)
>         at org.codehaus.classworlds.Launcher.main(Launcher.java:375)
> Caused by: org.apache.maven.project.InvalidProjectModelException: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
>         at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1610)
>         at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1571)
>         at org.apache.maven.project.DefaultMavenProjectBuilder.buildFromSourceFileInternal(DefaultMavenProjectBuilder.java:506)
>         at org.apache.maven.project.DefaultMavenProjectBuilder.build(DefaultMavenProjectBuilder.java:200)
>         at org.apache.maven.DefaultMaven.getProject(DefaultMaven.java:604)
>         at org.apache.maven.DefaultMaven.collectProjects(DefaultMaven.java:487)
>         at org.apache.maven.DefaultMaven.getProjects(DefaultMaven.java:391)
>         ... 12 more
> Caused by: org.codehaus.plexus.util.xml.pull.XmlPullParserException: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)
>         at org.apache.maven.model.io.xpp3.MavenXpp3Reader.parseModel(MavenXpp3Reader.java:2131)
>         at org.apache.maven.model.io.xpp3.MavenXpp3Reader.read(MavenXpp3Reader.java:3913)
>         at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1606)
>         ... 18 more
> [INFO] ------------------------------------------------------------------------
> [INFO] Total time: < 1 second
> [INFO] Finished at: Fri Jul 06 08:41:33 CEST 2012 [INFO] Final Memory: 1M/15M [INFO] ------------------------------------------------------------------------
>
> ################
>
>
>
> Thanks
>
>
> Regards
>
>
>
>
>
>
>
>
>
> Think green - keep it on the screen.
>
> This e-mail and any attachment is for authorised use by the intended recipient(s) only. It may contain proprietary material, confidential information and/or be subject to legal privilege. It should not be copied, disclosed to, retained or used by, any other party. If you are not an intended recipient then please promptly delete this e-mail and any attachment and all copies and inform the sender. Thank you.
>

general mahout working / some solr questions / last version tests

Posted by "Videnova, Svetlana" <sv...@logica.com>.

Can someone please ask me to following questions:
1)What the input of mahout (a xml file? Which is the output of solr, that what it interests me!)? 
2)What the output of mahout, I mean after clusterisation with k-means for exemple (a xml file again? )?
3)Where the output is store?
4)Can somebody please give me an exemple of code line command on unix ubuntu?
 I tried this already :
 $ $MAHOUT_HOME/bin/mahout --input my_file.txt --output output.txt
Is that makes any sense for you?


I know that there are some script to make solr and mahout working together, and create a connection between both, but no tutorials on this subject(either not very clear, nor too old...). Some ideas,tutoriels,forums ..?




I'm still using this tuto: http://cloudblog.8kmiles.com/2012/01/31/apache-mahout-a-clustering-example/
But with the implemented code from here: http://zoekja.nl/proxy/index.php?q=aHR0cHM6Ly9naXRodWIuY29tL2FwYWNoZS9tYWhvdXQ%3D
Ps:hadoop is running OK, java set OK

################
BUILD SUCCESSFUL
[INFO] ------------------------------------------------------------------------
[INFO] Total time: 77 minutes 5 seconds
[INFO] Finished at: Fri Jul 06 10:48:45 CEST 2012
[INFO] Final Memory: 67M/170M
################
:):):):):):):):):):)
Then thanks to : Sean Owen and his updates on http://zoekja.nl/proxy/index.php?q=aHR0cHM6Ly9naXRodWIuY29tL2FwYWNoZS9tYWhvdXQ%3D








-----Message d'origine-----
De : Videnova, Svetlana [mailto:svetlana.videnova@logica.com] 
Envoyé : vendredi 6 juillet 2012 09:44
À : dev@mahout.apache.org
Objet : RE: train mahout ex 

Hi guys.

I join pom.xml for mahout-distribution-0.7.
Im following this tutorial: http://cloudblog.8kmiles.com/2012/01/31/apache-mahout-a-clustering-example/

I still have errors when I execute this step: user1@ubuntu-server:~$ mvn clean install

I can't understand what's wrong about the pom.xml This is the output:



################

/usr/local/mahout-distribution-0.7$ mvn clean install [INFO] Scanning for projects...
[INFO] ------------------------------------------------------------------------
[ERROR] FATAL ERROR
[INFO] ------------------------------------------------------------------------
[INFO] Error building POM (may not be this project's POM).


Project ID: unknown
POM Location: /usr/local/mahout-distribution-0.7/pom.xml

Reason: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml


[INFO] ------------------------------------------------------------------------
[INFO] Trace
org.apache.maven.reactor.MavenExecutionException: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
	at org.apache.maven.DefaultMaven.getProjects(DefaultMaven.java:404)
	at org.apache.maven.DefaultMaven.doExecute(DefaultMaven.java:272)
	at org.apache.maven.DefaultMaven.execute(DefaultMaven.java:138)
	at org.apache.maven.cli.MavenCli.main(MavenCli.java:362)
	at org.apache.maven.cli.compat.CompatibleMain.main(CompatibleMain.java:60)
	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597)
	at org.codehaus.classworlds.Launcher.launchEnhanced(Launcher.java:315)
	at org.codehaus.classworlds.Launcher.launch(Launcher.java:255)
	at org.codehaus.classworlds.Launcher.mainWithExitCode(Launcher.java:430)
	at org.codehaus.classworlds.Launcher.main(Launcher.java:375)
Caused by: org.apache.maven.project.InvalidProjectModelException: Parse error reading POM. Reason: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17)  for project unknown at /usr/local/mahout-distribution-0.7/pom.xml
	at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1610)
	at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1571)
	at org.apache.maven.project.DefaultMavenProjectBuilder.buildFromSourceFileInternal(DefaultMavenProjectBuilder.java:506)
	at org.apache.maven.project.DefaultMavenProjectBuilder.build(DefaultMavenProjectBuilder.java:200)
	at org.apache.maven.DefaultMaven.getProject(DefaultMaven.java:604)
	at org.apache.maven.DefaultMaven.collectProjects(DefaultMaven.java:487)
	at org.apache.maven.DefaultMaven.getProjects(DefaultMaven.java:391)
	... 12 more
Caused by: org.codehaus.plexus.util.xml.pull.XmlPullParserException: Unrecognised tag: 'relativePath' (position: START_TAG seen ...</version>\r\n  <relativePath>... @24:17) 
	at org.apache.maven.model.io.xpp3.MavenXpp3Reader.parseModel(MavenXpp3Reader.java:2131)
	at org.apache.maven.model.io.xpp3.MavenXpp3Reader.read(MavenXpp3Reader.java:3913)
	at org.apache.maven.project.DefaultMavenProjectBuilder.readModel(DefaultMavenProjectBuilder.java:1606)
	... 18 more
[INFO] ------------------------------------------------------------------------
[INFO] Total time: < 1 second
[INFO] Finished at: Fri Jul 06 08:41:33 CEST 2012 [INFO] Final Memory: 1M/15M [INFO] ------------------------------------------------------------------------

################



Thanks


Regards









Think green - keep it on the screen.

This e-mail and any attachment is for authorised use by the intended recipient(s) only. It may contain proprietary material, confidential information and/or be subject to legal privilege. It should not be copied, disclosed to, retained or used by, any other party. If you are not an intended recipient then please promptly delete this e-mail and any attachment and all copies and inform the sender. Thank you.