You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Devaraj K (JIRA)" <ji...@apache.org> on 2011/08/22 17:22:29 UTC

[jira] [Commented] (MAPREDUCE-2866) "Ignoring 'duplicate' heartbeat from tracker_x:localhost/127.0.0.1:35419'; resending the previous 'lost' response" message is coming continuously for some time

    [ https://issues.apache.org/jira/browse/MAPREDUCE-2866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13088755#comment-13088755 ] 

Devaraj K commented on MAPREDUCE-2866:
--------------------------------------

{code:title=Job Tracker Logs|borderStyle=solid}

2011-08-09 23:15:27,216 INFO  ipc.Server (Server.java:run(1100)) - IPC Server handler 5 on 9001: starting
2011-08-09 23:15:27,252 INFO  ipc.Server (RPC.java:call(570)) - Metrics: call.getMethodName() already registered. Re-fetching the handle
2011-08-09 23:15:27,252 INFO  ipc.Server (RPC.java:call(570)) - Metrics: call.getMethodName() already registered. Re-fetching the handle
2011-08-09 23:15:37,458 INFO  net.NetworkTopology (NetworkTopology.java:add(331)) - Adding a new node: /default-rack/10.18.40.233
2011-08-09 23:15:37,495 INFO  mapred.JobTracker (JobTracker.java:lostTaskTracker(3705)) - Lost tracker : tracker_10.18.40.233:localhost/127.0.0.1:35419
2011-08-09 23:15:38,261 INFO  mapred.JobTracker (JobTracker.java:lostTaskTracker(3705)) - Lost tracker : tracker_10.18.40.233:localhost/127.0.0.1:35419
2011-08-09 23:15:40,499 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:15:41,266 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:15:43,501 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:15:44,269 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:15:46,503 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:15:47,271 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:15:49,504 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
......................
2011-08-09 23:20:33,576 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:20:36,577 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:20:38,659 INFO  mapred.JobTracker (JobTracker.java:lostTaskTracker(3705)) - Lost tracker : tracker_10.18.40.233:localhost/127.0.0.1:35419
2011-08-09 23:20:39,579 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:20:40,586 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
......................
2011-08-09 23:21:52,616 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:21:55,617 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:21:55,832 INFO  net.NetworkTopology (NetworkTopology.java:add(331)) - Adding a new node: /default-rack/10.18.52.47
......................
2011-08-09 23:27:14,572 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:27:17,573 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:27:20,575 INFO  mapred.JobTracker (JobTracker.java:heartbeat(2583)) - Ignoring 'duplicate' heartbeat from 'tracker_10.18.40.233:localhost/127.0.0.1:35419'; resending the previous 'lost' response
2011-08-09 23:27:35,649 INFO  net.NetworkTopology (NetworkTopology.java:add(331)) - Adding a new node: /default-rack/10.18.52.7
2011-08-09 23:28:22,240 INFO  mapred.JobTracker (JobTracker.java:initJob(3222)) - Initializing job_201108092309_0001{code} 

*This message has logged continuously for 13 mins in the job tracker logs and there is no info logged in the task tracker logs during this period of time.*

{code:title=Task Tracker Logs|borderStyle=solid}
2011-08-09 23:15:37,439 INFO  ipc.Client (Client.java:handleConnectionFailure(507)) - Retrying connect to server: /10.18.52.47:9000. Already tried 9 time(s).
2011-08-09 23:28:43,312 INFO  mapred.TaskTracker (TaskTracker.java:registerTask(1787)) - LaunchTaskAction (registerTask): attempt_201108092309_0001_m_000004_0 task's state:UNASSIGNED
2011-08-09 23:28:43,315 INFO  mapred.TaskTracker (TaskTracker.java:registerTask(1787)) - LaunchTaskAction (registerTask): attempt_201108092309_0001_m_000005_0 task's state:UNASSIGNED{code} 


> "Ignoring 'duplicate' heartbeat from tracker_x:localhost/127.0.0.1:35419'; resending the previous 'lost' response" message is coming continuously for some time
> ---------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-2866
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2866
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: jobtracker, tasktracker
>    Affects Versions: 0.20.2
>            Reporter: Devaraj K
>            Assignee: Devaraj K
>


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira