You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-user@hadoop.apache.org by "ion.badita" <io...@mcr.ro> on 2007/03/10 08:22:55 UTC
Unreported Exception
Hi,
I have 17 computers cluster running hadoop 0.12.0 with jdk 6.
I runned a job with 14 maps an 3 reducers. On the Job's status map is 100% completed with 0 Failures. When click the 0 link (for the map) it shows this stack traces:
Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed :
java.io.FileNotFoundException: /dfs/mapred/local/task_0003_m_000009_0/file.out
at org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
at org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
at org.mortbay.http.HttpServer.service(HttpServer.java:954)
at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
-------
Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed :
java.io.FileNotFoundException: /dfs/mapred/local/task_0003_m_000009_0/file.out
at org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
at org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
at org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
at org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
at org.mortbay.http.HttpServer.service(HttpServer.java:954)
at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
at org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
-------
On reduces task list 2 reduces are finished and one seems to be hannged on copy phase with this status: reduce > copy (13 of 14 at 0.06 MB/s) >
Any help will be apreciated!
Thanks
John.
Re: Unreported Exception
Posted by Gaurav Agarwal <ga...@yahoo.com>.
Hi John,
I had similar problem on Hadoop 0.12.0, where my jobs were getting hung in
reduce copy phase. I suspect it is a bug in 0.12.0 (I assume that it is not
stable yet.) for which a patch has been released on dev forum (Devraj Das
released the patch). I moved back to 0.11.2 to solve the problem.
thanks,
gaurav
Ion Badita wrote:
>
> Hi,
>
> I have 17 computers cluster running hadoop 0.12.0 with jdk 6.
>
> I runned a job with 14 maps an 3 reducers. On the Job's status map is 100%
> completed with 0 Failures. When click the 0 link (for the map) it shows
> this stack traces:
>
> Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed
> :
> java.io.FileNotFoundException:
> /dfs/mapred/local/task_0003_m_000009_0/file.out
> at
> org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
> at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
> at
> org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
> at
> org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
> at
> org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
> at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
> at
> org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
> at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
> at org.mortbay.http.HttpServer.service(HttpServer.java:954)
> at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
> at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
> at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
> at
> org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
> at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
> at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
>
> -------
> Map output lost, rescheduling: getMapOutput(task_0003_m_000009_0,2) failed
> :
> java.io.FileNotFoundException:
> /dfs/mapred/local/task_0003_m_000009_0/file.out
> at
> org.apache.hadoop.fs.ChecksumFileSystem.open(ChecksumFileSystem.java:324)
> at org.apache.hadoop.fs.FileSystem.open(FileSystem.java:245)
> at
> org.apache.hadoop.mapred.TaskTracker$MapOutputServlet.doGet(TaskTracker.java:1654)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:689)
> at javax.servlet.http.HttpServlet.service(HttpServlet.java:802)
> at org.mortbay.jetty.servlet.ServletHolder.handle(ServletHolder.java:427)
> at
> org.mortbay.jetty.servlet.WebApplicationHandler.dispatch(WebApplicationHandler.java:475)
> at
> org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:567)
> at org.mortbay.http.HttpContext.handle(HttpContext.java:1565)
> at
> org.mortbay.jetty.servlet.WebApplicationContext.handle(WebApplicationContext.java:635)
> at org.mortbay.http.HttpContext.handle(HttpContext.java:1517)
> at org.mortbay.http.HttpServer.service(HttpServer.java:954)
> at org.mortbay.http.HttpConnection.service(HttpConnection.java:814)
> at org.mortbay.http.HttpConnection.handleNext(HttpConnection.java:981)
> at org.mortbay.http.HttpConnection.handle(HttpConnection.java:831)
> at
> org.mortbay.http.SocketListener.handleConnection(SocketListener.java:244)
> at org.mortbay.util.ThreadedServer.handle(ThreadedServer.java:357)
> at org.mortbay.util.ThreadPool$PoolThread.run(ThreadPool.java:534)
>
> -------
>
>
>
> On reduces task list 2 reduces are finished and one seems to be hannged on
> copy phase with this status: reduce > copy (13 of 14 at 0.06 MB/s) >
>
> Any help will be apreciated!
> Thanks
> John.
>
>
>
>
--
View this message in context: http://www.nabble.com/Unreported-Exception-tf3380081.html#a9413104
Sent from the Hadoop Users mailing list archive at Nabble.com.