You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Matt McCline (JIRA)" <ji...@apache.org> on 2015/08/04 03:20:04 UTC
[jira] [Updated] (HIVE-11415) Add early termination for recursion
in vectorization for deep filter queries
[ https://issues.apache.org/jira/browse/HIVE-11415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Matt McCline updated HIVE-11415:
--------------------------------
Attachment: HIVE-11415.01.patch
Vectorized support for Multi-OR and Multi-AND.
Specifically, the FilterExprOrExpr and FilterExprAndExpr.
> Add early termination for recursion in vectorization for deep filter queries
> ----------------------------------------------------------------------------
>
> Key: HIVE-11415
> URL: https://issues.apache.org/jira/browse/HIVE-11415
> Project: Hive
> Issue Type: Bug
> Reporter: Prasanth Jayachandran
> Assignee: Matt McCline
>
> Queries with deep filters (left deep) throws StackOverflowException in vectorization
> {code}
> Exception in thread "main" java.lang.StackOverflowError
> at java.lang.Class.getAnnotation(Class.java:3415)
> at org.apache.hive.common.util.AnnotationUtils.getAnnotation(AnnotationUtils.java:29)
> at org.apache.hadoop.hive.ql.exec.vector.VectorExpressionDescriptor.getVectorExpressionClass(VectorExpressionDescriptor.java:332)
> at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getVectorExpressionForUdf(VectorizationContext.java:988)
> at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getGenericUdfVectorExpression(VectorizationContext.java:1164)
> at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getVectorExpression(VectorizationContext.java:439)
> at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.createVectorExpression(VectorizationContext.java:1014)
> at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getVectorExpressionForUdf(VectorizationContext.java:996)
> at org.apache.hadoop.hive.ql.exec.vector.VectorizationContext.getGenericUdfVectorExpression(VectorizationContext.java:1164)
> {code}
> Sample query:
> {code}
> explain select count(*) from over1k where (
> (t=1 and si=2)
> or (t=2 and si=3)
> or (t=3 and si=4)
> or (t=4 and si=5)
> or (t=5 and si=6)
> or (t=6 and si=7)
> or (t=7 and si=8)
> ...
> ..
> {code}
> repeat the filter for few thousand times for reproduction of the issue.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)