You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Edward Capriolo (JIRA)" <ji...@apache.org> on 2012/06/28 06:56:44 UTC

[jira] [Updated] (HIVE-3206) FileUtils.tar assumes wrong directory in some cases

     [ https://issues.apache.org/jira/browse/HIVE-3206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Edward Capriolo updated HIVE-3206:
----------------------------------

    Summary: FileUtils.tar assumes wrong directory in some cases  (was: Bucket mapjoin in trunk is not working )
    
> FileUtils.tar assumes wrong directory in some cases
> ---------------------------------------------------
>
>                 Key: HIVE-3206
>                 URL: https://issues.apache.org/jira/browse/HIVE-3206
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.10.0
>            Reporter: Navis
>            Assignee: Navis
>         Attachments: hive-3206.1.patch.txt
>
>
> Bucket mapjoin throws exception archiving stored hashtables. 
> {noformat}
> hive> set hive.optimize.bucketmapjoin = true;
> hive> select /*+mapjoin(a)*/ a.key, a.value, b.value 
>     > from srcbucket_mapjoin_part a join srcbucket_mapjoin_part_2 b 
>     > on a.key=b.key;
> Total MapReduce jobs = 1
> 12/06/28 12:36:18 WARN conf.HiveConf: DEPRECATED: Ignoring hive-default.xml found on the CLASSPATH at /home/navis/hive/conf/hive-default.xml
> Execution log at: /tmp/navis/navis_20120628123636_5298a863-605c-4b98-bbb3-0a132c85c5a3.log
> 2012-06-28 12:36:18	Starting to launch local task to process map join;	maximum memory = 932118528
> 2012-06-28 12:36:18	Processing rows:	153	Hashtable size:	153	Memory usage:	1771376	rate:	0.002
> 2012-06-28 12:36:18	Dump the hashtable into file: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket22.txt.hashtable
> 2012-06-28 12:36:18	Upload 1 File to: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket22.txt.hashtable File size: 9644
> 2012-06-28 12:36:19	Processing rows:	309	Hashtable size:	156	Memory usage:	1844568	rate:	0.002
> 2012-06-28 12:36:19	Dump the hashtable into file: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket23.txt.hashtable
> 2012-06-28 12:36:19	Upload 1 File to: file:/tmp/navis/hive_2012-06-28_12-36-17_003_3016196240171705142/-local-10002/HashTable-Stage-1/MapJoin-a-00-srcbucket23.txt.hashtable File size: 10023
> 2012-06-28 12:36:19	End of local task; Time Taken: 0.773 sec.
> Execution completed successfully
> Mapred Local Task Succeeded . Convert the Join into MapJoin
> Mapred Local Task Succeeded . Convert the Join into MapJoin
> Launching Job 1 out of 1
> Number of reduce tasks is set to 0 since there's no reduce operator
> java.io.IOException: This archives contains unclosed entries.
> 	at org.apache.commons.compress.archivers.tar.TarArchiveOutputStream.finish(TarArchiveOutputStream.java:214)
> 	at org.apache.hadoop.hive.common.FileUtils.tar(FileUtils.java:276)
> 	at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:391)
> 	at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:137)
> 	at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:134)
> 	at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57)
> 	at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1324)
> 	at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1110)
> 	at org.apache.hadoop.hive.ql.Driver.run(Driver.java:944)
> 	at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:258)
> 	at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:215)
> 	at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:406)
> 	at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:744)
> 	at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:607)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> 	at java.lang.reflect.Method.invoke(Method.java:597)
> 	at org.apache.hadoop.util.RunJar.main(RunJar.java:186)
> Job Submission failed with exception 'java.io.IOException(This archives contains unclosed entries.)'
> java.lang.IllegalArgumentException: Can not create a Path from an empty string
> 	at org.apache.hadoop.fs.Path.checkPathArg(Path.java:82)
> 	at org.apache.hadoop.fs.Path.<init>(Path.java:90)
> 	at org.apache.hadoop.hive.ql.exec.Utilities.getHiveJobID(Utilities.java:380)
> 	at org.apache.hadoop.hive.ql.exec.Utilities.clearMapRedWork(Utilities.java:193)
> 	at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:460)
> 	at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:137)
> 	at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:134)
> 	at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57)
> 	at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1324)
> 	at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1110)
> 	at org.apache.hadoop.hive.ql.Driver.run(Driver.java:944)
> 	at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:258)
> 	at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:215)
> 	at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:406)
> 	at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:744)
> 	at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:607)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> 	at java.lang.reflect.Method.invoke(Method.java:597)
> 	at org.apache.hadoop.util.RunJar.main(RunJar.java:186)
> FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MapRedTask
> {noformat}
> Seemed to be regression from HIVE-3128.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira