You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ashutosh Chauhan (JIRA)" <ji...@apache.org> on 2013/01/13 20:14:12 UTC
[jira] [Updated] (HIVE-2780) Implement more restrictive table
sampler
[ https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ashutosh Chauhan updated HIVE-2780:
-----------------------------------
Status: Open (was: Patch Available)
My manually conflict-resolved patch resulted in failure in split_sample.q
{code}
[junit] java.lang.NullPointerException
[junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat$DefaultPercentSampler.sampling(CombineHiveInputFormat.java:596)
[junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampling(CombineHiveInputFormat.java:496)
[junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampleSplits(CombineHiveInputFormat.java:477)
[junit] at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:403)
[junit] at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
[junit] at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
[junit] at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
[junit] at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:448)
[junit] at org.apache.hadoop.hive.ql.exec.ExecDriver.main(ExecDriver.java:682)
[junit] at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
[junit] at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
[junit] at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
[junit] at java.lang.reflect.Method.invoke(Method.java:597)
[junit] at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
[junit] Job Submission failed with exception 'java.lang.NullPointerException(null)'
{code}
Either my resolution wasn't correct or trunk has moved significantly. Navis, if you rebase the patch, I will take a look at this one quickly so that it doesnt go stale again.
> Implement more restrictive table sampler
> ----------------------------------------
>
> Key: HIVE-2780
> URL: https://issues.apache.org/jira/browse/HIVE-2780
> Project: Hive
> Issue Type: Improvement
> Reporter: Navis
> Assignee: Navis
> Priority: Trivial
> Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch
>
>
> Current table sampling scans whole block, making more rows included than expected especially for small tables.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira