You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Karam Singh (Created) (JIRA)" <ji...@apache.org> on 2011/10/20 14:25:10 UTC

[jira] [Created] (MAPREDUCE-3230) Data Local maps information not coming Job client CLI or in JobHistory

Data Local maps information not coming  Job client CLI or in JobHistory 
------------------------------------------------------------------------

                 Key: MAPREDUCE-3230
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: mrv2
    Affects Versions: 0.23.0
            Reporter: Karam Singh


Size of cluster is 350 NMs.
I have topology.node.switch.mapping.impl is set to enable rack-locality. 
Ran randomwriter/sort and scan jobs.
Both jobs ran 
Launched map tasks=16800

On Job client sort says -:
  Launched map tasks=16800
  Launched reduce tasks=700
  Other local map tasks=10
  Rack-local map tasks=16790

On Job client scan says -:
  Launched map tasks=16800
  Other local map tasks=50
  Rack-local map tasks=16750

JobHistory files also does having information of Data Local Maps

There is information data local maps, last month, when I checked out code branch-0.23
data local maps was coming
e.g.
For sort it was with 349 NMs-:
  Launched map tasks=16754 
  Launched reduce tasks=700
  Other local map tasks=48
  Data-local map tasks=16459
  Rack-local map tasks=248

For scan it was with 349 NMs -:
  Launched map tasks=16752
  Other local map tasks=27
  Data-local map tasks=16532
  Rack-local map tasks=193




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-3230) No information about Data Local maps from Job client CLI and in JobHistory

Posted by "Todd Lipcon (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13131760#comment-13131760 ] 

Todd Lipcon commented on MAPREDUCE-3230:
----------------------------------------

I think data locality is broken in 0.23 in general, not just the counters. I will file another JIRA for this.
                
> No information about Data Local maps from Job client CLI and in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>            Priority: Blocker
>             Fix For: 0.23.0
>
>
> Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.
> On Job client sort job says :
> {code}
> Launched map tasks=16800
> Launched reduce tasks=700
> Other local map tasks=10
> Rack-local map tasks=16790
> {code}
> JobHistory files also don't have information about Data Local Maps.
> There used to be information about data local maps before, till about a month back, like so:
> For sort with 349 NMs :
> {code}
> Launched map tasks=16754 
> Launched reduce tasks=700
> Other local map tasks=48
> Data-local map tasks=16459
> Rack-local map tasks=248
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-3230) No information about Data Local maps from Job client CLI and in JobHistory

Posted by "Vinod Kumar Vavilapalli (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3230:
-----------------------------------------------

    Description: 
Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.

On Job client sort job says :
{code}
Launched map tasks=16800
Launched reduce tasks=700
Other local map tasks=10
Rack-local map tasks=16790
{code}

JobHistory files also don't have information about Data Local Maps.

There used to be information about data local maps before, till about a month back, like so:

For sort with 349 NMs :
{code}
Launched map tasks=16754 
Launched reduce tasks=700
Other local map tasks=48
Data-local map tasks=16459
Rack-local map tasks=248
{code}

  was:
Size of cluster is 350 NMs.
I have topology.node.switch.mapping.impl is set to enable rack-locality. 
Ran randomwriter/sort and scan jobs.
Both jobs ran 
Launched map tasks=16800

On Job client sort says -:
  Launched map tasks=16800
  Launched reduce tasks=700
  Other local map tasks=10
  Rack-local map tasks=16790

On Job client scan says -:
  Launched map tasks=16800
  Other local map tasks=50
  Rack-local map tasks=16750

JobHistory files also does having information of Data Local Maps

There is information data local maps, last month, when I checked out code branch-0.23
data local maps was coming
e.g.
For sort it was with 349 NMs-:
  Launched map tasks=16754 
  Launched reduce tasks=700
  Other local map tasks=48
  Data-local map tasks=16459
  Rack-local map tasks=248

For scan it was with 349 NMs -:
  Launched map tasks=16752
  Other local map tasks=27
  Data-local map tasks=16532
  Rack-local map tasks=193




        Summary: No information about Data Local maps from Job client CLI and in JobHistory   (was: Data Local maps information not coming  on Job client CLI or in JobHistory )

Thanks for filing this, Karam!
                
> No information about Data Local maps from Job client CLI and in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>             Fix For: 0.23.0
>
>
> Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.
> On Job client sort job says :
> {code}
> Launched map tasks=16800
> Launched reduce tasks=700
> Other local map tasks=10
> Rack-local map tasks=16790
> {code}
> JobHistory files also don't have information about Data Local Maps.
> There used to be information about data local maps before, till about a month back, like so:
> For sort with 349 NMs :
> {code}
> Launched map tasks=16754 
> Launched reduce tasks=700
> Other local map tasks=48
> Data-local map tasks=16459
> Rack-local map tasks=248
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-3230) No information about Data Local maps from Job client CLI and in JobHistory

Posted by "Vinod Kumar Vavilapalli (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3230:
-----------------------------------------------

      Component/s: client
    Fix Version/s: 0.23.0
    
> No information about Data Local maps from Job client CLI and in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>             Fix For: 0.23.0
>
>
> Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.
> On Job client sort job says :
> {code}
> Launched map tasks=16800
> Launched reduce tasks=700
> Other local map tasks=10
> Rack-local map tasks=16790
> {code}
> JobHistory files also don't have information about Data Local Maps.
> There used to be information about data local maps before, till about a month back, like so:
> For sort with 349 NMs :
> {code}
> Launched map tasks=16754 
> Launched reduce tasks=700
> Other local map tasks=48
> Data-local map tasks=16459
> Rack-local map tasks=248
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (MAPREDUCE-3230) No information about Data Local maps from Job client CLI and in JobHistory

Posted by "Arun C Murthy (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13131851#comment-13131851 ] 

Arun C Murthy commented on MAPREDUCE-3230:
------------------------------------------

Looks like MAPREDUCE-2693 should fix this.
                
> No information about Data Local maps from Job client CLI and in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>            Priority: Blocker
>             Fix For: 0.23.0
>
>
> Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.
> On Job client sort job says :
> {code}
> Launched map tasks=16800
> Launched reduce tasks=700
> Other local map tasks=10
> Rack-local map tasks=16790
> {code}
> JobHistory files also don't have information about Data Local Maps.
> There used to be information about data local maps before, till about a month back, like so:
> For sort with 349 NMs :
> {code}
> Launched map tasks=16754 
> Launched reduce tasks=700
> Other local map tasks=48
> Data-local map tasks=16459
> Rack-local map tasks=248
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-3230) No information about Data Local maps from Job client CLI and in JobHistory

Posted by "Vinod Kumar Vavilapalli (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Vinod Kumar Vavilapalli updated MAPREDUCE-3230:
-----------------------------------------------

    Priority: Blocker  (was: Major)
    
> No information about Data Local maps from Job client CLI and in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>            Priority: Blocker
>             Fix For: 0.23.0
>
>
> Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.
> On Job client sort job says :
> {code}
> Launched map tasks=16800
> Launched reduce tasks=700
> Other local map tasks=10
> Rack-local map tasks=16790
> {code}
> JobHistory files also don't have information about Data Local Maps.
> There used to be information about data local maps before, till about a month back, like so:
> For sort with 349 NMs :
> {code}
> Launched map tasks=16754 
> Launched reduce tasks=700
> Other local map tasks=48
> Data-local map tasks=16459
> Rack-local map tasks=248
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (MAPREDUCE-3230) Data Local maps information not coming on Job client CLI or in JobHistory

Posted by "Karam Singh (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Karam Singh updated MAPREDUCE-3230:
-----------------------------------

    Summary: Data Local maps information not coming  on Job client CLI or in JobHistory   (was: Data Local maps information not coming  Job client CLI or in JobHistory )
    
> Data Local maps information not coming  on Job client CLI or in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>
> Size of cluster is 350 NMs.
> I have topology.node.switch.mapping.impl is set to enable rack-locality. 
> Ran randomwriter/sort and scan jobs.
> Both jobs ran 
> Launched map tasks=16800
> On Job client sort says -:
>   Launched map tasks=16800
>   Launched reduce tasks=700
>   Other local map tasks=10
>   Rack-local map tasks=16790
> On Job client scan says -:
>   Launched map tasks=16800
>   Other local map tasks=50
>   Rack-local map tasks=16750
> JobHistory files also does having information of Data Local Maps
> There is information data local maps, last month, when I checked out code branch-0.23
> data local maps was coming
> e.g.
> For sort it was with 349 NMs-:
>   Launched map tasks=16754 
>   Launched reduce tasks=700
>   Other local map tasks=48
>   Data-local map tasks=16459
>   Rack-local map tasks=248
> For scan it was with 349 NMs -:
>   Launched map tasks=16752
>   Other local map tasks=27
>   Data-local map tasks=16532
>   Rack-local map tasks=193

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (MAPREDUCE-3230) No information about Data Local maps from Job client CLI and in JobHistory

Posted by "Arun C Murthy (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/MAPREDUCE-3230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arun C Murthy resolved MAPREDUCE-3230.
--------------------------------------

    Resolution: Not A Problem

Verified MAPREDUCE-2693 fixed it.
                
> No information about Data Local maps from Job client CLI and in JobHistory 
> ---------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-3230
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-3230
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: client, mrv2
>    Affects Versions: 0.23.0
>            Reporter: Karam Singh
>            Priority: Blocker
>             Fix For: 0.23.0
>
>
> Size of cluster is 350 NMs. I have topology.node.switch.mapping.impl set to enable rack-locality. Ran randomwriter/sort and scan jobs. Both jobs ran completed successfully.
> On Job client sort job says :
> {code}
> Launched map tasks=16800
> Launched reduce tasks=700
> Other local map tasks=10
> Rack-local map tasks=16790
> {code}
> JobHistory files also don't have information about Data Local Maps.
> There used to be information about data local maps before, till about a month back, like so:
> For sort with 349 NMs :
> {code}
> Launched map tasks=16754 
> Launched reduce tasks=700
> Other local map tasks=48
> Data-local map tasks=16459
> Rack-local map tasks=248
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira