You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@mahout.apache.org by lukee155 <lu...@hotmail.cz> on 2012/04/01 17:43:45 UTC
Mahout's K-means returns error when processing output/clusters-2
Hello, I am trying to run k-means example from Mahout In Action book (Listing
7.2) on the single node Hadoop cluster. Everything is going OK until
processing the clusters-2 part:
12/04/01 17:23:09 INFO kmeans.KMeansDriver: K-Means Iteration 3
12/04/01 17:23:09 INFO common.HadoopUtil: Deleting output/clusters-3
12/04/01 17:23:09 WARN mapreduce.JobSubmitter: Use GenericOptionsParser for
parsing the arguments. Applications should implement Tool for the same.
12/04/01 17:23:10 INFO input.FileInputFormat: Total input paths to process :
1
12/04/01 17:23:10 INFO mapreduce.JobSubmitter: number of splits:1
12/04/01 17:23:11 INFO mapred.ResourceMgrDelegate: Submitted application
application_1333286828058_0009 to ResourceManager at /0.0.0.0:8040
12/04/01 17:23:11 INFO mapreduce.Job: The url to track the job:
http://2668G1U:8088/proxy/application_1333286828058_0009/
12/04/01 17:23:11 INFO mapreduce.Job: Running job: job_1333286828058_0009
12/04/01 17:23:25 INFO mapreduce.Job: Job job_1333286828058_0009 running in
uber mode : false
12/04/01 17:23:25 INFO mapreduce.Job: map 0% reduce 0%
12/04/01 17:23:31 INFO mapreduce.Job: map 100% reduce 0%
12/04/01 17:23:32 INFO mapreduce.Job: Task Id :
attempt_1333286828058_0009_r_000000_0, Status : FAILED
Error: Found interface org.apache.hadoop.mapreduce.Counter, but class was
expected
12/04/01 17:23:32 WARN mapreduce.Job: Error reading task output Server
returned HTTP response code: 400 for URL:
http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_0&filter=stdout
12/04/01 17:23:32 WARN mapreduce.Job: Error reading task output Server
returned HTTP response code: 400 for URL:
http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_0&filter=stderr
12/04/01 17:23:37 INFO mapreduce.Job: Task Id :
attempt_1333286828058_0009_r_000000_1, Status : FAILED
Error: Found interface org.apache.hadoop.mapreduce.Counter, but class was
expected
12/04/01 17:23:37 WARN mapreduce.Job: Error reading task output Server
returned HTTP response code: 400 for URL:
http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_1&filter=stdout
12/04/01 17:23:37 WARN mapreduce.Job: Error reading task output Server
returned HTTP response code: 400 for URL:
http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_1&filter=stderr
12/04/01 17:23:42 INFO mapreduce.Job: Task Id :
attempt_1333286828058_0009_r_000000_2, Status : FAILED
Error: Found interface org.apache.hadoop.mapreduce.Counter, but class was
expected
12/04/01 17:23:42 WARN mapreduce.Job: Error reading task output Server
returned HTTP response code: 400 for URL:
http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_2&filter=stdout
12/04/01 17:23:42 WARN mapreduce.Job: Error reading task output Server
returned HTTP response code: 400 for URL:
http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_2&filter=stderr
12/04/01 17:23:47 INFO mapreduce.Job: Job job_1333286828058_0009 failed with
state FAILED due to:
12/04/01 17:23:47 INFO mapreduce.Job: Counters: 32
File System Counters
FILE: Number of bytes read=120
FILE: Number of bytes written=44559
FILE: Number of read operations=0
FILE: Number of large read operations=0
FILE: Number of write operations=0
HDFS: Number of bytes read=904
HDFS: Number of bytes written=0
HDFS: Number of read operations=7
HDFS: Number of large read operations=0
HDFS: Number of write operations=0
Job Counters
Failed reduce tasks=4
Launched map tasks=1
Launched reduce tasks=4
Data-local map tasks=1
Total time spent by all maps in occupied slots (ms)=4871
Total time spent by all reduces in occupied slots (ms)=12798
Map-Reduce Framework
Map input records=9
Map output records=9
Map output bytes=531
Map output materialized bytes=128
Input split bytes=121
Combine input records=9
Combine output records=2
Spilled Records=2
Failed Shuffles=0
Merged Map outputs=0
GC time elapsed (ms)=200
CPU time spent (ms)=570
Physical memory (bytes) snapshot=158449664
Virtual memory (bytes) snapshot=384389120
Total committed heap usage (bytes)=121180160
File Input Format Counters
Bytes Read=431
Exception in thread "main" java.lang.InterruptedException: K-Means Iteration
failed processing output/clusters-2
at
org.apache.mahout.clustering.kmeans.KMeansDriver.runIteration(KMeansDriver.java:371)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.buildClustersMR(KMeansDriver.java:316)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:239)
at
org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:154)
at
org.bakalarka.clusteringsample1.HelloWorldClustering.main(HelloWorldClustering.java:84)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:616)
at org.apache.hadoop.util.RunJar.main(RunJar.java:200)
Can you check this LOG and tell me what I am doing wrong? Thank you very
much.
--
View this message in context: http://lucene.472066.n3.nabble.com/Mahout-s-K-means-returns-error-when-processing-output-clusters-2-tp3875396p3875396.html
Sent from the Mahout User List mailing list archive at Nabble.com.
RE: Mahout's K-means returns error when processing output/clusters-2
Posted by WangRamon <ra...@hotmail.com>.
Check this line of log:
12/04/01 17:23:32 WARN mapreduce.Job: Error reading task output Serverreturned HTTP response code: 400 for URL:http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_0&filter=stdoutHow did you submit the job? It seems this node 2668G1U is configured differently, just guess.RegardsRamon
> Date: Sun, 1 Apr 2012 08:43:45 -0700
> From: lukee@hotmail.cz
> To: mahout-user@lucene.apache.org
> Subject: Mahout's K-means returns error when processing output/clusters-2
>
> Hello, I am trying to run k-means example from Mahout In Action book (Listing
> 7.2) on the single node Hadoop cluster. Everything is going OK until
> processing the clusters-2 part:
>
> 12/04/01 17:23:09 INFO kmeans.KMeansDriver: K-Means Iteration 3
> 12/04/01 17:23:09 INFO common.HadoopUtil: Deleting output/clusters-3
> 12/04/01 17:23:09 WARN mapreduce.JobSubmitter: Use GenericOptionsParser for
> parsing the arguments. Applications should implement Tool for the same.
> 12/04/01 17:23:10 INFO input.FileInputFormat: Total input paths to process :
> 1
> 12/04/01 17:23:10 INFO mapreduce.JobSubmitter: number of splits:1
> 12/04/01 17:23:11 INFO mapred.ResourceMgrDelegate: Submitted application
> application_1333286828058_0009 to ResourceManager at /0.0.0.0:8040
> 12/04/01 17:23:11 INFO mapreduce.Job: The url to track the job:
> http://2668G1U:8088/proxy/application_1333286828058_0009/
> 12/04/01 17:23:11 INFO mapreduce.Job: Running job: job_1333286828058_0009
> 12/04/01 17:23:25 INFO mapreduce.Job: Job job_1333286828058_0009 running in
> uber mode : false
> 12/04/01 17:23:25 INFO mapreduce.Job: map 0% reduce 0%
> 12/04/01 17:23:31 INFO mapreduce.Job: map 100% reduce 0%
> 12/04/01 17:23:32 INFO mapreduce.Job: Task Id :
> attempt_1333286828058_0009_r_000000_0, Status : FAILED
> Error: Found interface org.apache.hadoop.mapreduce.Counter, but class was
> expected
> 12/04/01 17:23:32 WARN mapreduce.Job: Error reading task output Server
> returned HTTP response code: 400 for URL:
> http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_0&filter=stdout
> 12/04/01 17:23:32 WARN mapreduce.Job: Error reading task output Server
> returned HTTP response code: 400 for URL:
> http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_0&filter=stderr
> 12/04/01 17:23:37 INFO mapreduce.Job: Task Id :
> attempt_1333286828058_0009_r_000000_1, Status : FAILED
> Error: Found interface org.apache.hadoop.mapreduce.Counter, but class was
> expected
> 12/04/01 17:23:37 WARN mapreduce.Job: Error reading task output Server
> returned HTTP response code: 400 for URL:
> http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_1&filter=stdout
> 12/04/01 17:23:37 WARN mapreduce.Job: Error reading task output Server
> returned HTTP response code: 400 for URL:
> http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_1&filter=stderr
> 12/04/01 17:23:42 INFO mapreduce.Job: Task Id :
> attempt_1333286828058_0009_r_000000_2, Status : FAILED
> Error: Found interface org.apache.hadoop.mapreduce.Counter, but class was
> expected
> 12/04/01 17:23:42 WARN mapreduce.Job: Error reading task output Server
> returned HTTP response code: 400 for URL:
> http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_2&filter=stdout
> 12/04/01 17:23:42 WARN mapreduce.Job: Error reading task output Server
> returned HTTP response code: 400 for URL:
> http://2668G1U:8080/tasklog?plaintext=true&attemptid=attempt_1333286828058_0009_r_000000_2&filter=stderr
> 12/04/01 17:23:47 INFO mapreduce.Job: Job job_1333286828058_0009 failed with
> state FAILED due to:
> 12/04/01 17:23:47 INFO mapreduce.Job: Counters: 32
> File System Counters
> FILE: Number of bytes read=120
> FILE: Number of bytes written=44559
> FILE: Number of read operations=0
> FILE: Number of large read operations=0
> FILE: Number of write operations=0
> HDFS: Number of bytes read=904
> HDFS: Number of bytes written=0
> HDFS: Number of read operations=7
> HDFS: Number of large read operations=0
> HDFS: Number of write operations=0
> Job Counters
> Failed reduce tasks=4
> Launched map tasks=1
> Launched reduce tasks=4
> Data-local map tasks=1
> Total time spent by all maps in occupied slots (ms)=4871
> Total time spent by all reduces in occupied slots (ms)=12798
> Map-Reduce Framework
> Map input records=9
> Map output records=9
> Map output bytes=531
> Map output materialized bytes=128
> Input split bytes=121
> Combine input records=9
> Combine output records=2
> Spilled Records=2
> Failed Shuffles=0
> Merged Map outputs=0
> GC time elapsed (ms)=200
> CPU time spent (ms)=570
> Physical memory (bytes) snapshot=158449664
> Virtual memory (bytes) snapshot=384389120
> Total committed heap usage (bytes)=121180160
> File Input Format Counters
> Bytes Read=431
> Exception in thread "main" java.lang.InterruptedException: K-Means Iteration
> failed processing output/clusters-2
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.runIteration(KMeansDriver.java:371)
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.buildClustersMR(KMeansDriver.java:316)
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:239)
> at
> org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:154)
> at
> org.bakalarka.clusteringsample1.HelloWorldClustering.main(HelloWorldClustering.java:84)
> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> at
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
> at
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> at java.lang.reflect.Method.invoke(Method.java:616)
> at org.apache.hadoop.util.RunJar.main(RunJar.java:200)
>
> Can you check this LOG and tell me what I am doing wrong? Thank you very
> much.
>
> --
> View this message in context: http://lucene.472066.n3.nabble.com/Mahout-s-K-means-returns-error-when-processing-output-clusters-2-tp3875396p3875396.html
> Sent from the Mahout User List mailing list archive at Nabble.com.