You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Hive QA (JIRA)" <ji...@apache.org> on 2014/07/23 18:03:41 UTC
[jira] [Commented] (HIVE-4788) RCFile and bzip2 compression not
working
[ https://issues.apache.org/jira/browse/HIVE-4788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14071891#comment-14071891 ]
Hive QA commented on HIVE-4788:
-------------------------------
{color:red}Overall{color}: -1 at least one tests failed
Here are the results of testing the latest attachment:
https://issues.apache.org/jira/secure/attachment/12657276/HIVE-4788.1.patch.txt
{color:red}ERROR:{color} -1 due to 4 failed/errored test(s), 5737 tests executed
*Failed tests:*
{noformat}
org.apache.hadoop.hive.cli.TestMinimrCliDriver.testCliDriver_ql_rewrite_gbtoidx
org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_authorization_fail_8
org.apache.hive.jdbc.TestJdbcDriver2.testParentReferences
org.apache.hive.jdbc.miniHS2.TestHiveServer2.testConnection
{noformat}
Test results: http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/20/testReport
Console output: http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-TRUNK-Build/20/console
Test logs: http://ec2-174-129-184-35.compute-1.amazonaws.com/logs/PreCommit-HIVE-TRUNK-Build-20/
Messages:
{noformat}
Executing org.apache.hive.ptest.execution.PrepPhase
Executing org.apache.hive.ptest.execution.ExecutionPhase
Executing org.apache.hive.ptest.execution.ReportingPhase
Tests exited with: TestsFailedException: 4 tests failed
{noformat}
This message is automatically generated.
ATTACHMENT ID: 12657276
> RCFile and bzip2 compression not working
> ----------------------------------------
>
> Key: HIVE-4788
> URL: https://issues.apache.org/jira/browse/HIVE-4788
> Project: Hive
> Issue Type: Bug
> Components: Compression
> Affects Versions: 0.10.0
> Environment: CDH4.2
> Reporter: Johndee Burks
> Assignee: Navis
> Priority: Minor
> Attachments: HIVE-4788.1.patch.txt
>
>
> The issue is that Bzip2 compressed rcfile data is encountering an error when being queried even the most simple query "select *". The issue is easily reproducible using the following.
> Create a table and load the sample data below.
> DDL: create table source_data (a string, b string) row format delimited fields terminated by ',';
> Sample data:
> apple,sauce
> Test:
> Do the following and you should receive the error listed below for the rcfile table with bz2 compression.
> create table rc_nobz2 (a string, b string) stored as rcfile;
> insert into table rc_nobz2 select * from source_txt;
> SET io.seqfile.compression.type=BLOCK;
> SET hive.exec.compress.output=true;
> SET mapred.compress.map.output=true;
> SET mapred.output.compress=true;
> SET mapred.output.compression.codec=org.apache.hadoop.io.compress.BZip2Codec;
> create table rc_bz2 (a string, b string) stored as rcfile;
> insert into table rc_bz2 select * from source_txt;
> hive> select * from rc_bz2;
> Failed with exception java.io.IOException:java.io.IOException: Stream is not BZip2 formatted: expected 'h' as first byte but got ''
> hive> select * from rc_nobz2;
> apple sauce
--
This message was sent by Atlassian JIRA
(v6.2#6252)