You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ashutosh Chauhan (JIRA)" <ji...@apache.org> on 2013/01/13 20:14:12 UTC

[jira] [Updated] (HIVE-2780) Implement more restrictive table sampler

     [ https://issues.apache.org/jira/browse/HIVE-2780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ashutosh Chauhan updated HIVE-2780:
-----------------------------------

    Status: Open  (was: Patch Available)

My manually conflict-resolved patch resulted in failure in split_sample.q
{code}
    [junit] java.lang.NullPointerException
    [junit]     at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat$DefaultPercentSampler.sampling(CombineHiveInputFormat.java:596)
    [junit]     at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampling(CombineHiveInputFormat.java:496)
    [junit]     at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.sampleSplits(CombineHiveInputFormat.java:477)
    [junit]     at org.apache.hadoop.hive.ql.io.CombineHiveInputFormat.getSplits(CombineHiveInputFormat.java:403)
    [junit]     at org.apache.hadoop.mapred.JobClient.writeOldSplits(JobClient.java:810)
    [junit]     at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:781)
    [junit]     at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:730)
    [junit]     at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:448)
    [junit]     at org.apache.hadoop.hive.ql.exec.ExecDriver.main(ExecDriver.java:682)
    [junit]     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
    [junit]     at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
    [junit]     at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
    [junit]     at java.lang.reflect.Method.invoke(Method.java:597)
    [junit]     at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
    [junit] Job Submission failed with exception 'java.lang.NullPointerException(null)'
{code}
Either my resolution wasn't correct or trunk has moved significantly. Navis, if you rebase the patch, I will take a look at this one quickly so that it doesnt go stale again.
                
> Implement more restrictive table sampler
> ----------------------------------------
>
>                 Key: HIVE-2780
>                 URL: https://issues.apache.org/jira/browse/HIVE-2780
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Navis
>            Assignee: Navis
>            Priority: Trivial
>         Attachments: ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.1.patch, ASF.LICENSE.NOT.GRANTED--HIVE-2780.D1623.2.patch
>
>
> Current table sampling scans whole block, making more rows included than expected especially for small tables.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira