You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2015/08/04 03:20:04 UTC

[jira] [Updated] (HIVE-11415) Add early termination for recursion in vectorization for deep filter queries

     [ https://issues.apache.org/jira/browse/HIVE-11415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Matt McCline updated HIVE-11415:
--------------------------------
    Attachment: HIVE-11415.01.patch

Vectorized support for Multi-OR and Multi-AND.

Specifically, the FilterExprOrExpr and FilterExprAndExpr.

> Add early termination for recursion in vectorization for deep filter queries
> ----------------------------------------------------------------------------
>
>                 Key: HIVE-11415
>                 URL: https://issues.apache.org/jira/browse/HIVE-11415
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Prasanth Jayachandran
>            Assignee: Matt McCline
>
> Queries with deep filters (left deep) throws StackOverflowException in vectorization
> {code}
> Exception in thread "main" java.lang.StackOverflowError
> 	at java.lang.Class.getAnnotation(Class.java:3415)
> 	at org.apache.hive.common.util.AnnotationUtils.getAnnotation(AnnotationUtils.java:29)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorExpressionDescriptor.getVectorExpressionClass(VectorExpressionDescriptor.java:332)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getVectorExpressionForUdf(VectorizationContext.java:988)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getGenericUdfVectorExpression(VectorizationContext.java:1164)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getVectorExpression(VectorizationContext.java:439)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.createVectorExpression(VectorizationContext.java:1014)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getVectorExpressionForUdf(VectorizationContext.java:996)
> 	at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getGenericUdfVectorExpression(VectorizationContext.java:1164)
> {code}
> Sample query:
> {code}
> explain select count(*) from over1k where (
> (t=1 and si=2)
> or (t=2 and si=3)
> or (t=3 and si=4) 
> or (t=4 and si=5) 
> or (t=5 and si=6) 
> or (t=6 and si=7) 
> or (t=7 and si=8)
> ...
> ..
> {code}
> repeat the filter for few thousand times for reproduction of the issue.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)