You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2013/01/09 11:25:22 UTC
[jira] [Commented] (HIVE-2778) Fail on table sampling

    [ https://issues.apache.org/jira/browse/HIVE-2778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13547909#comment-13547909 ] 

Hudson commented on HIVE-2778:
------------------------------

Integrated in Hive-trunk-hadoop2 #54 (See [https://builds.apache.org/job/Hive-trunk-hadoop2/54/])
    HIVE-2778 [jira] Fail on table sampling
(Navis Ryu via Carl Steinbach)

Summary:
HIVE-2778 fix NPE on table sampling

Trying table sampling on any non-empty table throws NPE. This does not occur by
test on mini-MR.  <div class="preformatted panel" style="border-width:
1px;"><div class="preformattedContent panelContent"> <pre>select count(*) from
emp tablesample (0.1 percent);      Total MapReduce jobs = 1 Launching Job 1 out
of 1 Number of reduce tasks determined at compile time: 1 In order to change the
average load for a reducer (in bytes):   set
hive.exec.reducers.bytes.per.reducer=<number> In order to limit the maximum
number of reducers:   set hive.exec.reducers.max=<number> In order to set a
constant number of reducers:   set mapred.reduce.tasks=<number>
java.lang.NullPointerException 	at
org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampleSplits(CombineHiveInputFormat.java:450)
	at
org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:403)
	at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:971) 	at
org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:963) 	at
org.apache.hadoop.mapred.JobClient.access$500(JobClient.java:170) 	at
org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:880) 	at
org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:833) 	at
java.security.AccessController.doPrivileged(Native Method) 	at
javax.security.auth.Subject.doAs(Subject.java:396) 	at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127)
	at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:833) 	at
org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:807) 	at
org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:432) 	at
org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:136) 	at
org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:134) 	at
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57) 	at
org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1332) 	at
org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1123) 	at
org.apache.hadoop.hive.ql.Driver.run(Driver.java:931) 	at
org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:255) 	at
org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:212) 	at
org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:403) 	at
org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:671) 	at
org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:554) 	at
sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 	at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
	at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
	at java.lang.reflect.Method.invoke(Method.java:597) 	at
org.apache.hadoop.util.RunJar.main(RunJar.java:186) Job Submission failed with
exception 'java.lang.NullPointerException(null)' FAILED: Execution Error, return
code 1 from org.apache.hadoop.hive.ql.exec.MapRedTask  </pre> </div></div>

Test Plan: EMPTY

Reviewers: JIRA, cwsteinbach

Reviewed By: cwsteinbach

Differential Revision: https://reviews.facebook.net/D1593 (Revision 1301310)

     Result = ABORTED
cws : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1301310
Files : 
* /hive/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/CombineHiveInputFormat.java

                
> Fail on table sampling 
> -----------------------
>
>                 Key: HIVE-2778
>                 URL: https://issues.apache.org/jira/browse/HIVE-2778
>             Project: Hive
>          Issue Type: Bug
>          Components: Query Processor
>    Affects Versions: 0.9.0
>         Environment: Reproduced only on hadoop-0.20.2-CDH3u1, work fine on hadoop-0.20.2
>            Reporter: Navis
>            Assignee: Navis
>             Fix For: 0.9.0
>
>         Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2778.D1593.1.patch, ASF.LICENSE.NOT.GRANTED--HIVE-2778.D1593.2.patch, HIVE-2778.D1593.2.patch
>
>
> Trying table sampling on any non-empty table throws NPE. This does not occur by test on mini-MR.
> {noformat}
> select count(*) from emp tablesample (0.1 percent);     
> Total MapReduce jobs = 1
> Launching Job 1 out of 1
> Number of reduce tasks determined at compile time: 1
> In order to change the average load for a reducer (in bytes):
>   set hive.exec.reducers.bytes.per.reducer=<number>
> In order to limit the maximum number of reducers:
>   set hive.exec.reducers.max=<number>
> In order to set a constant number of reducers:
>   set mapred.reduce.tasks=<number>
> java.lang.NullPointerException
> 	at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampleSplits(CombineHiveInputFormat.java:450)
> 	at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:403)
> 	at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:971)
> 	at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:963)
> 	at org.apache.hadoop.mapred.JobClient.access$500(JobClient.java:170)
> 	at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:880)
> 	at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:833)
> 	at java.security.AccessController.doPrivileged(Native Method)
> 	at javax.security.auth.Subject.doAs(Subject.java:396)
> 	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127)
> 	at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:833)
> 	at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:807)
> 	at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:432)
> 	at org.apache.hadoop.hive.ql.exec.MapRedTask.execute(MapRedTask.java:136)
> 	at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:134)
> 	at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:57)
> 	at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:1332)
> 	at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:1123)
> 	at org.apache.hadoop.hive.ql.Driver.run(Driver.java:931)
> 	at org.apache.hadoop.hive.cli.CliDriver.processLocalCmd(CliDriver.java:255)
> 	at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:212)
> 	at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:403)
> 	at org.apache.hadoop.hive.cli.CliDriver.run(CliDriver.java:671)
> 	at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:554)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
> 	at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
> 	at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
> 	at java.lang.reflect.Method.invoke(Method.java:597)
> 	at org.apache.hadoop.util.RunJar.main(RunJar.java:186)
> Job Submission failed with exception 'java.lang.NullPointerException(null)'
> FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MapRedTask
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira