You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mesos.apache.org by "Benjamin Mahler (JIRA)" <ji...@apache.org> on 2013/02/08 00:35:12 UTC

[jira] [Resolved] (MESOS-290) Jobtracker can't get TaskTrackerInfo when the JobTracker log file is deleted

     [ https://issues.apache.org/jira/browse/MESOS-290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Benjamin Mahler resolved MESOS-290.
-----------------------------------

    Resolution: Won't Fix

This should no longer be an issue as we've re-written the hadoop port:
https://reviews.apache.org/r/9174

Feel free to re-open.
                
> Jobtracker can't get TaskTrackerInfo when the JobTracker log file is deleted
> ----------------------------------------------------------------------------
>
>                 Key: MESOS-290
>                 URL: https://issues.apache.org/jira/browse/MESOS-290
>             Project: Mesos
>          Issue Type: Bug
>          Components: framework, java-api
>    Affects Versions: 0.9.0
>         Environment: SUSE Linux Enterprise Server 11
>            Reporter: Qinghe Jin
>            Priority: Minor
>
> For some reason, the JobTracker log file is expanding over 20G and running out of my disk partion. I delete the jobtracker log file in logs/ and restart the hadoop system, then can't get my mapreduce work. The JobTracker is suffering from IOExceptions, the stack looks like:
> 2012-10-10 09:19:31,838 INFO org.apache.hadoop.mapred.JobTracker: Adding tracker tracker_blade17:localhost.localdomain/127.0.0.1:44216 to host blade17 
> 2012-10-10 09:19:31,839 INFO org.apache.hadoop.mapred.JobTracker: Lost tracker 'tracker_blade19:localhost.localdomain/127.0.0.1:40465'
> 2012-10-10 09:19:31,839 INFO org.apache.hadoop.ipc.Server: IPC Server handler 6 on 9001, call heartbeat(org.apache.hadoop.mapred.TaskTrackerStatus@7be536d6, true, true, true, -1) from 10.10.129.17:57073: error: java.io.IOException: java.lang.RuntimeException: Expecting TaskTrackerInfo for host blade17
> java.io.IOException: java.lang.RuntimeException: Expecting TaskTrackerInfo for host blade17   at org.apache.hadoop.mapred.FrameworkScheduler.assignTasks(FrameworkScheduler.java:518)
>   at org.apache.hadoop.mapred.MesosScheduler.assignTasks(MesosScheduler.java:76)
>   at org.apache.hadoop.mapred.JobTracker.heartbeat(JobTracker.java:3398)
>   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>   at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
>   at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
>   at java.lang.reflect.Method.invoke(Method.java:597)  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:563)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1388)
>   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:1384)
>   at java.security.AccessController.doPrivileged(Native Method)
>   at javax.security.auth.Subject.doAs(Subject.java:396)
>   at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1059)
>   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:1382)
> 2012-10-10 09:19:31,839 INFO org.apache.hadoop.mapred.JobTracker: Adding tracker tracker_blade19:localhost.localdomain/127.0.0.1:40465 to host blade19
> 2012-10-10 09:19:31,839 INFO org.apache.hadoop.ipc.Server: IPC Server handler 7 on 9001, call heartbeat(org.apache.hadoop.mapred.TaskTrackerStatus@58651e95, true, true, true, -1) from 10.10.129.19:46705: error: java.io.IOException: java.lang.RuntimeException: Expecting TaskTrackerInfo for host blade19
> On the tasktracker side, it sends status to the jobtracker, but with responseid -1,just like below
> 2012-10-10 09:31:24,463 INFO org.apache.hadoop.mapred.TaskTracker: Resending 'status' to 'blade20' with reponseId '-1
> 2012-10-10 09:31:24,466 INFO org.apache.hadoop.mapred.TaskTracker: Resending 'status' to 'blade20' with reponseId '-1
> 2012-10-10 09:31:24,468 INFO org.apache.hadoop.mapred.TaskTracker: Resending 'status' to 'blade20' with reponseId '-1
> 2012-10-10 09:31:24,471 INFO org.apache.hadoop.mapred.TaskTracker: Resending 'status' to 'blade20' with reponseId '-1
> 2012-10-10 09:31:24,473 INFO org.apache.hadoop.mapred.TaskTracker: Resending 'status' to 'blade20' with reponseId '-1
> 2012-10-10 09:31:24,476 INFO org.apache.hadoop.mapred.TaskTracker: Resending 'status' to 'blade20' with reponseId '-1
> Is there any quick answer for this situation?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira