You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by "ion.badita" <io...@mcr.ro> on 2007/03/10 08:22:55 UTC

Unreported Exception

Hi,

I have 17 computers cluster running hadoop 0.12.0 with jdk 6.

I runned a job with 14 maps an 3 reducers. On the Job's status map is 100% completed with 0 Failures. When click the 0 link (for the map) it shows this stack traces:

Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed :
java.io.FileNotFoundException: /dfs/mapred/local/task_0003_m_000009_0/file.out
	at org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
	at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
	at org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
	at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
	at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
	at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
	at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
	at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
	at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
	at org.mortbay.http.HttpServer.service(HttpServer.java:954)
	at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
	at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
	at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
	at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
	at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
	at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-------
Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed :
java.io.FileNotFoundException: /dfs/mapred/local/task_0003_m_000009_0/file.out
	at org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
	at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
	at org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
	at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
	at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
	at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
	at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
	at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
	at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
	at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
	at org.mortbay.http.HttpServer.service(HttpServer.java:954)
	at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
	at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
	at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
	at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
	at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
	at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)

-------



On reduces task list 2 reduces are finished and one seems to be hannged on copy phase with this status: reduce > copy (13 of 14 at 0.06 MB/s) > 

Any help will be apreciated!
Thanks
John.



Re: Unreported Exception

Posted by Gaurav Agarwal <ga...@yahoo.com>.
Hi John,
I had similar problem on Hadoop 0.12.0, where my jobs were getting hung in
reduce copy phase. I suspect it is a bug in 0.12.0 (I assume that it is not
stable yet.) for which a patch has been released on dev forum (Devraj Das
released the patch). I moved back to 0.11.2 to solve the problem.

thanks,
gaurav


Ion Badita wrote:
> 
> Hi,
> 
> I have 17 computers cluster running hadoop 0.12.0 with jdk 6.
> 
> I runned a job with 14 maps an 3 reducers. On the Job's status map is 100%
> completed with 0 Failures. When click the 0 link (for the map) it shows
> this stack traces:
> 
> Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed
> :
> java.io.FileNotFoundException:
> /dfs/mapred/local/task_0003_m_000009_0/file.out
> 	at
> org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
> 	at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
> 	at
> org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> 	at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
> 	at
> org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
> 	at
> org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
> 	at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
> 	at
> org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
> 	at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
> 	at org.mortbay.http.HttpServer.service(HttpServer.java:954)
> 	at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
> 	at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
> 	at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
> 	at
> org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
> 	at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
> 	at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
> 
> -------
> Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed
> :
> java.io.FileNotFoundException:
> /dfs/mapred/local/task_0003_m_000009_0/file.out
> 	at
> org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
> 	at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
> 	at
> org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> 	at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
> 	at
> org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
> 	at
> org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
> 	at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
> 	at
> org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
> 	at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
> 	at org.mortbay.http.HttpServer.service(HttpServer.java:954)
> 	at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
> 	at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
> 	at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
> 	at
> org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
> 	at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
> 	at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
> 
> -------
> 
> 
> 
> On reduces task list 2 reduces are finished and one seems to be hannged on
> copy phase with this status: reduce > copy (13 of 14 at 0.06 MB/s) > 
> 
> Any help will be apreciated!
> Thanks
> John.
> 
> 
> 
> 

-- 
View this message in context: http://www.nabble.com/Unreported-Exception-tf3380081.html#a9413104
Sent from the Hadoop Users mailing list archive at Nabble.com.