You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/05/02 19:04:00 UTC

[jira] [Work logged] (MAPREDUCE-7376) AggregateWordCount fetches wrong results

     [ https://issues.apache.org/jira/browse/MAPREDUCE-7376?focusedWorklogId=765105&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-765105 ]

ASF GitHub Bot logged work on MAPREDUCE-7376:
---------------------------------------------

                Author: ASF GitHub Bot
            Created on: 02/May/22 19:03
            Start Date: 02/May/22 19:03
    Worklog Time Spent: 10m 
      Work Description: ayushtkn opened a new pull request, #4257:
URL: https://github.com/apache/hadoop/pull/4257

   ### Description of PR
   
   Fixes AggregateWordCount
   
   ### How was this patch tested?
   ```
   hadoop-3.4.0-SNAPSHOT % bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-3.4.0-SNAPSHOT.jar  aggregatewordcount /testData /testOut 1 textinputformat
   
   hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut/part-r-00000                                                                                                   
   Bye	1
   Goodbye	1
   Hadoop	2
   Hello	2
   World	2
   ```
   ``/testData`` had two files:
   `wc01.txt:`
   Hello World Bye World
   
   `wc02.txt:`
   Hello Hadoop Goodbye Hadoop
   
   ### For code changes:
   
   - [ ] Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?




Issue Time Tracking
-------------------

            Worklog Id:     (was: 765105)
    Remaining Estimate: 0h
            Time Spent: 10m

> AggregateWordCount fetches wrong results
> ----------------------------------------
>
>                 Key: MAPREDUCE-7376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: Ayush Saxena
>            Assignee: Ayush Saxena
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> AggregateWordCount rather than counting  the words, gives a single line output counting the number of rows
> Wrong Result Looks Like:
> {noformat}
> hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut1/part-r-00000    
> record_count 2
> {noformat}
> Correct Should Look Like:
> {noformat}
> hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut1/part-r-00000                                                                                           
> Bye	1
> Goodbye	1
> Hadoop	2
> Hello	2
> World	2
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org