You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/05/02 19:04:00 UTC
[jira] [Work logged] (MAPREDUCE-7376) AggregateWordCount fetches wrong results
[ https://issues.apache.org/jira/browse/MAPREDUCE-7376?focusedWorklogId=765105&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-765105 ]
ASF GitHub Bot logged work on MAPREDUCE-7376:
---------------------------------------------
Author: ASF GitHub Bot
Created on: 02/May/22 19:03
Start Date: 02/May/22 19:03
Worklog Time Spent: 10m
Work Description: ayushtkn opened a new pull request, #4257:
URL: https://github.com/apache/hadoop/pull/4257
### Description of PR
Fixes AggregateWordCount
### How was this patch tested?
```
hadoop-3.4.0-SNAPSHOT % bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-3.4.0-SNAPSHOT.jar aggregatewordcount /testData /testOut 1 textinputformat
hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut/part-r-00000
Bye 1
Goodbye 1
Hadoop 2
Hello 2
World 2
```
``/testData`` had two files:
`wc01.txt:`
Hello World Bye World
`wc02.txt:`
Hello Hadoop Goodbye Hadoop
### For code changes:
- [ ] Does the title or this PR starts with the corresponding JIRA issue id (e.g. 'HADOOP-17799. Your PR title ...')?
Issue Time Tracking
-------------------
Worklog Id: (was: 765105)
Remaining Estimate: 0h
Time Spent: 10m
> AggregateWordCount fetches wrong results
> ----------------------------------------
>
> Key: MAPREDUCE-7376
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-7376
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Reporter: Ayush Saxena
> Assignee: Ayush Saxena
> Priority: Major
> Time Spent: 10m
> Remaining Estimate: 0h
>
> AggregateWordCount rather than counting the words, gives a single line output counting the number of rows
> Wrong Result Looks Like:
> {noformat}
> hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut1/part-r-00000
> record_count 2
> {noformat}
> Correct Should Look Like:
> {noformat}
> hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut1/part-r-00000
> Bye 1
> Goodbye 1
> Hadoop 2
> Hello 2
> World 2
> {noformat}
--
This message was sent by Atlassian Jira
(v8.20.7#820007)
---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org