You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Akira AJISAKA (JIRA)" <ji...@apache.org> on 2016/06/12 17:39:21 UTC
[jira] [Comment Edited] (MAPREDUCE-6643)
org.apache.hadoop.mapred.TestTextInputFormat.testSplitableCodecs failed
[ https://issues.apache.org/jira/browse/MAPREDUCE-6643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15326554#comment-15326554 ]
Akira AJISAKA edited comment on MAPREDUCE-6643 at 6/12/16 5:38 PM:
-------------------------------------------------------------------
Thanks [~lewuathe] for updating the patch! Mostly looks good to me. Some comments:
{code}
if (start == 0) {
adjStart = Math.max(0L, start - FIRST_BZIP2_BLOCK_MARKER_POSITION);
{code}
1. It seems that adjStart is always zero. If that is true, we should set zero instead of using max function.
{code}
// Other than the fist of file, the marker size is 6 bytes.
{code}
2. fist -> first
3. Would you fix the checkstyle warning?
{code}
// corner case when we have byte alignment and position of stream are same
int[] lengths = {471507, 473608};
int[] numSplits = {218, 110};
for (int i = 0; i < 2; i++) {
verifyPartitions(lengths[i], numSplits[i], file, codec, conf);
}
{code}
4. (nit) The following code is simpler for me:
{code}
// corner case when we have byte alignment and position of stream are same
verifyPartitions(471507, 218, file, codec, conf);
verifyPartitions(473608, 110, file, codec, conf);
{code}
was (Author: ajisakaa):
Thanks [~lewuathe] for updating the patch! Mostly looks good to me. Some comments:
{code}
if (start == 0) {
adjStart = Math.max(0L, start - FIRST_BZIP2_BLOCK_MARKER_POSITION);
{code}
1. It seems that adjStart is always zero. If that is true, we should set zero instead of using max function.
{code}
// Other than the fist of file, the marker size is 6 bytes.
{code}
2. fist -> first
3. Would you fix the checkstyle warning?
{code}
// corner case when we have byte alignment and position of stream are same
int[] lengths = {471507, 473608};
int[] numSplits = {218, 110};
for (int i = 0; i < 2; i++) {
verifyPartitions(lengths[i], numSplits[i], file, codec, conf);
}
{code}
4. (nit) The following code is simpler for me:
{code}
// corner case when we have byte alignment and position of stream are same
verifyPartition(471507, 218, file, codec, conf);
verifyPartition(473608, 110, file, codec, conf);
{code}
> org.apache.hadoop.mapred.TestTextInputFormat.testSplitableCodecs failed
> -----------------------------------------------------------------------
>
> Key: MAPREDUCE-6643
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-6643
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Reporter: Haibo Chen
> Assignee: Kai Sasaki
> Attachments: MAPREDUCE-6643.01.patch, MAPREDUCE-6643.02.patch, MAPREDUCE-6643.03.patch
>
>
> Unit test TestTextInputFormat.testSplitableCodecs() failed when the seed is 1313094493.
> Stacktrace
> java.lang.AssertionError: Key in multiple partitions.
> at org.junit.Assert.fail(Assert.java:88)
> at org.junit.Assert.assertTrue(Assert.java:41)
> at org.junit.Assert.assertFalse(Assert.java:64)
> at org.apache.hadoop.mapred.TestTextInputFormat.testSplitableCodecs(TestTextInputFormat.java:223)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: mapreduce-issues-help@hadoop.apache.org