You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@asterixdb.apache.org by "Taewoo Kim (JIRA)" <ji...@apache.org> on 2016/02/06 01:43:39 UTC

[jira] [Resolved] (ASTERIXDB-1250) NPE in BTree self index join

     [ https://issues.apache.org/jira/browse/ASTERIXDB-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Taewoo Kim resolved ASTERIXDB-1250.
-----------------------------------
    Resolution: Fixed

> NPE in BTree self index join
> ----------------------------
>
>                 Key: ASTERIXDB-1250
>                 URL: https://issues.apache.org/jira/browse/ASTERIXDB-1250
>             Project: Apache AsterixDB
>          Issue Type: Bug
>            Reporter: Yingyi Bu
>            Assignee: Taewoo Kim
>
> DDLs:
> {noformat}
> drop dataverse test if exists;
> create dataverse test;
> use dataverse test;
> create type TwitterUserType as closed {
>     screen-name: string,
>     lang: string,
>     friends-count: int64,
>     statuses-count: int64,
>     name: string,
>     followers-count: int64
> }
> create type TweetMessageType as closed {
>     tweetid: int64,
>         user: TwitterUserType,
>         sender-location: point,
>     send-time: datetime,
>         referred-topics: {{ string }},
>     message-text: string,
>     countA: int64,
>     countB: int64
> }
> create dataset TweetMessages(TweetMessageType)
> primary key tweetid;
> create index twmSndLocIx on TweetMessages(sender-location) type rtree;
> create index msgCountAIx on TweetMessages(countA) type btree;
> create index msgCountBIx on TweetMessages(countB) type btree;
> create index msgTextIx on TweetMessages(message-text) type keyword;
> {noformat}
> Query:
> {noformat}
> for $t1 in dataset('TweetMessages')
> for $t2 in dataset('TweetMessages')
> let $c := $t1.countA + 20 
> where $c /* +indexnl */= $t2.countB
> return {"tweetid2": $t2.tweetid, "count2":$t2.countB};
> {noformat}
> Exception:
> {noformat}
> java.lang.NullPointerException
> 	at org.apache.asterix.om.util.NonTaggedFormatUtil.isOptional(NonTaggedFormatUtil.java:96)
> 	at org.apache.asterix.metadata.entities.Index.getNonNullableType(Index.java:137)
> 	at org.apache.asterix.optimizer.rules.am.AbstractIntroduceAccessMethodRule.isMatched(AbstractIntroduceAccessMethodRule.java:325)
> 	at org.apache.asterix.optimizer.rules.am.AbstractIntroduceAccessMethodRule.pruneIndexCandidates(AbstractIntroduceAccessMethodRule.java:277)
> 	at org.apache.asterix.optimizer.rules.am.AbstractIntroduceAccessMethodRule.pruneIndexCandidates(AbstractIntroduceAccessMethodRule.java:119)
> 	at org.apache.asterix.optimizer.rules.am.IntroduceJoinAccessMethodRule.rewritePost(IntroduceJoinAccessMethodRule.java:141)
> 	at org.apache.hyracks.algebricks.core.rewriter.base.AbstractRuleController.rewriteOperatorRef(AbstractRuleController.java:125)
> 	at org.apache.hyracks.algebricks.core.rewriter.base.AbstractRuleController.rewriteOperatorRef(AbstractRuleController.java:99)
> 	at org.apache.hyracks.algebricks.core.rewriter.base.AbstractRuleController.rewriteOperatorRef(AbstractRuleController.java:99)
> 	at org.apache.hyracks.algebricks.core.rewriter.base.AbstractRuleController.rewriteOperatorRef(AbstractRuleController.java:99)
> 	at org.apache.hyracks.algebricks.compiler.rewriter.rulecontrollers.SequentialFixpointRuleController.rewriteWithRuleCollection(SequentialFixpointRuleController.java:53)
> 	at org.apache.hyracks.algebricks.core.rewriter.base.HeuristicOptimizer.runOptimizationSets(HeuristicOptimizer.java:95)
> 	at org.apache.hyracks.algebricks.core.rewriter.base.HeuristicOptimizer.optimize(HeuristicOptimizer.java:82)
> 	at org.apache.hyracks.algebricks.compiler.api.HeuristicCompilerFactoryBuilder$1$1.optimize(HeuristicCompilerFactoryBuilder.java:87)
> 	at org.apache.asterix.api.common.APIFramework.compileQuery(APIFramework.java:289)
> 	at org.apache.asterix.aql.translator.QueryTranslator.rewriteCompileQuery(QueryTranslator.java:1894)
> 	at org.apache.asterix.aql.translator.QueryTranslator.handleQuery(QueryTranslator.java:2469)
> 	at org.apache.asterix.aql.translator.QueryTranslator.compileAndExecute(QueryTranslator.java:383)
> 	at org.apache.asterix.api.http.servlet.APIServlet.doPost(APIServlet.java:148)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:754)
> 	at javax.servlet.http.HttpServlet.service(HttpServlet.java:847)
> 	at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:546)
> 	at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:483)
> 	at org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandler.java:231)
> 	at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:970)
> 	at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:411)
> 	at org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandler.java:192)
> 	at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:904)
> 	at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:117)
> 	at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:110)
> 	at org.eclipse.jetty.server.Server.handle(Server.java:347)
> 	at org.eclipse.jetty.server.HttpConnection.handleRequest(HttpConnection.java:439)
> 	at org.eclipse.jetty.server.HttpConnection$RequestHandler.content(HttpConnection.java:924)
> 	at org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:781)
> 	at org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:220)
> 	at org.eclipse.jetty.server.AsyncHttpConnection.handle(AsyncHttpConnection.java:43)
> 	at org.eclipse.jetty.io.nio.SelectChannelEndPoint.handle(SelectChannelEndPoint.java:545)
> 	at org.eclipse.jetty.io.nio.SelectChannelEndPoint$1.run(SelectChannelEndPoint.java:43)
> 	at org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool.java:529)
> 	at java.lang.Thread.run(Thread.java:745)
> {noformat}
> The query works fine without a indexnl hint and uses hash join:
> {noformat}
> for $t1 in dataset('TweetMessages')
> for $t2 in dataset('TweetMessages')
> let $c := $t1.countA + 20 
> where $c = $t2.countB
> return {"tweetid2": $t2.tweetid, "count2":$t2.countB};
> {noformat}
> It seems to me when we have the "indexnl" hint, using hash join is acceptable but NPE is not acceptable.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)