You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Robert Hou (JIRA)" <ji...@apache.org> on 2017/10/24 07:28:00 UTC

[jira] [Resolved] (DRILL-5901) Drill test framework can have successful run even if a random failure occurs

     [ https://issues.apache.org/jira/browse/DRILL-5901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Hou resolved DRILL-5901.
-------------------------------
    Resolution: Not A Bug

This is a bug in the Drill Test Framework, not in Drill itself.

> Drill test framework can have successful run even if a random failure occurs
> ----------------------------------------------------------------------------
>
>                 Key: DRILL-5901
>                 URL: https://issues.apache.org/jira/browse/DRILL-5901
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Tools, Build & Test
>    Affects Versions: 1.11.0
>            Reporter: Robert Hou
>
> From Jenkins:
> http://10.10.104.91:8080/view/Nightly/job/TPCH-SF100-baseline/574/console
> Random Failures:
> /root/drillAutomation/framework-master/framework/resources/Advanced/tpch/tpch_sf1/original/parquet/query17.sql
> Query: 
> SELECT
>   SUM(L.L_EXTENDEDPRICE) / 7.0 AS AVG_YEARLY
> FROM
>   lineitem L,
>   part P
> WHERE
>   P.P_PARTKEY = L.L_PARTKEY
>   AND P.P_BRAND = 'BRAND#13'
>   AND P.P_CONTAINER = 'JUMBO CAN'
>   AND L.L_QUANTITY < (
>     SELECT
>       0.2 * AVG(L2.L_QUANTITY)
>     FROM
>       lineitem L2
>     WHERE
>       L2.L_PARTKEY = P.P_PARTKEY
>   )
> Failed with exception
> java.sql.SQLException: SYSTEM ERROR: IllegalStateException: Memory was leaked by query. Memory leaked: (2097152)
> Allocator(op:8:2:6:ParquetRowGroupScan) 1000000/0/7675904/10000000000 (res/actual/peak/limit)
> Fragment 8:2
> [Error Id: f21a2560-7259-4e13-88c2-9bac29e2930a on atsqa6c88.qa.lab:31010]
>   (java.lang.IllegalStateException) Memory was leaked by query. Memory leaked: (2097152)
> Allocator(op:8:2:6:ParquetRowGroupScan) 1000000/0/7675904/10000000000 (res/actual/peak/limit)
>     org.apache.drill.exec.memory.BaseAllocator.close():519
>     org.apache.drill.exec.ops.AbstractOperatorExecContext.close():86
>     org.apache.drill.exec.ops.OperatorContextImpl.close():108
>     org.apache.drill.exec.ops.FragmentContext.suppressingClose():435
>     org.apache.drill.exec.ops.FragmentContext.close():424
>     org.apache.drill.exec.work.fragment.FragmentExecutor.closeOutResources():324
>     org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup():155
>     org.apache.drill.exec.work.fragment.FragmentExecutor.run():267
>     org.apache.drill.common.SelfCleaningRunnable.run():38
>     java.util.concurrent.ThreadPoolExecutor.runWorker():1145
>     java.util.concurrent.ThreadPoolExecutor$Worker.run():615
>     java.lang.Thread.run():744
> 	at org.apache.drill.jdbc.impl.DrillCursor.nextRowInternally(DrillCursor.java:489)
> 	at org.apache.drill.jdbc.impl.DrillCursor.loadInitialSchema(DrillCursor.java:561)
> 	at org.apache.drill.jdbc.impl.DrillResultSetImpl.execute(DrillResultSetImpl.java:1895)
> 	at org.apache.drill.jdbc.impl.DrillResultSetImpl.execute(DrillResultSetImpl.java:61)
> 	at oadd.org.apache.calcite.avatica.AvaticaConnection$1.execute(AvaticaConnection.java:473)
> 	at org.apache.drill.jdbc.impl.DrillMetaImpl.prepareAndExecute(DrillMetaImpl.java:1100)
> 	at oadd.org.apache.calcite.avatica.AvaticaConnection.prepareAndExecuteInternal(AvaticaConnection.java:477)
> 	at org.apache.drill.jdbc.impl.DrillConnectionImpl.prepareAndExecuteInternal(DrillConnectionImpl.java:181)
> 	at oadd.org.apache.calcite.avatica.AvaticaStatement.executeInternal(AvaticaStatement.java:110)
> 	at oadd.org.apache.calcite.avatica.AvaticaStatement.executeQuery(AvaticaStatement.java:130)
> 	at org.apache.drill.jdbc.impl.DrillStatementImpl.executeQuery(DrillStatementImpl.java:112)
> 	at org.apache.drill.test.framework.DrillTestJdbc.executeQuery(DrillTestJdbc.java:206)
> 	at org.apache.drill.test.framework.DrillTestJdbc.run(DrillTestJdbc.java:115)
> 	at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
> 	at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> 	at java.lang.Thread.run(Thread.java:744)
> Caused by: oadd.org.apache.drill.common.exceptions.UserRemoteException: SYSTEM ERROR: IllegalStateException: Memory was leaked by query. Memory leaked: (2097152)
> Allocator(op:8:2:6:ParquetRowGroupScan) 1000000/0/7675904/10000000000 (res/actual/peak/limit)
> Fragment 8:2
> [Error Id: f21a2560-7259-4e13-88c2-9bac29e2930a on atsqa6c88.qa.lab:31010]
>   (java.lang.IllegalStateException) Memory was leaked by query. Memory leaked: (2097152)
> Allocator(op:8:2:6:ParquetRowGroupScan) 1000000/0/7675904/10000000000 (res/actual/peak/limit)
>     org.apache.drill.exec.memory.BaseAllocator.close():519
>     org.apache.drill.exec.ops.AbstractOperatorExecContext.close():86
>     org.apache.drill.exec.ops.OperatorContextImpl.close():108
>     org.apache.drill.exec.ops.FragmentContext.suppressingClose():435
>     org.apache.drill.exec.ops.FragmentContext.close():424
>     org.apache.drill.exec.work.fragment.FragmentExecutor.closeOutResources():324
>     org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup():155
>     org.apache.drill.exec.work.fragment.FragmentExecutor.run():267
>     org.apache.drill.common.SelfCleaningRunnable.run():38
>     java.util.concurrent.ThreadPoolExecutor.runWorker():1145
>     java.util.concurrent.ThreadPoolExecutor$Worker.run():615
>     java.lang.Thread.run():744
> 	at oadd.org.apache.drill.exec.rpc.user.QueryResultHandler.resultArrived(QueryResultHandler.java:123)
> 	at oadd.org.apache.drill.exec.rpc.user.UserClient.handle(UserClient.java:465)
> 	at oadd.org.apache.drill.exec.rpc.user.UserClient.handle(UserClient.java:102)
> 	at oadd.org.apache.drill.exec.rpc.RpcBus$InboundHandler.decode(RpcBus.java:274)
> 	at oadd.org.apache.drill.exec.rpc.RpcBus$InboundHandler.decode(RpcBus.java:244)
> 	at oadd.io.netty.handler.codec.MessageToMessageDecoder.channelRead(MessageToMessageDecoder.java:88)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:356)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:342)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:335)
> 	at oadd.io.netty.handler.timeout.IdleStateHandler.channelRead(IdleStateHandler.java:287)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:356)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:342)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:335)
> 	at oadd.io.netty.handler.codec.MessageToMessageDecoder.channelRead(MessageToMessageDecoder.java:102)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:356)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:342)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:335)
> 	at oadd.io.netty.handler.codec.ByteToMessageDecoder.fireChannelRead(ByteToMessageDecoder.java:312)
> 	at oadd.io.netty.handler.codec.ByteToMessageDecoder.channelRead(ByteToMessageDecoder.java:286)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:356)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:342)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:335)
> 	at oadd.io.netty.channel.ChannelInboundHandlerAdapter.channelRead(ChannelInboundHandlerAdapter.java:86)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:356)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:342)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.fireChannelRead(AbstractChannelHandlerContext.java:335)
> 	at oadd.io.netty.channel.DefaultChannelPipeline$HeadContext.channelRead(DefaultChannelPipeline.java:1294)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:356)
> 	at oadd.io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:342)
> 	at oadd.io.netty.channel.DefaultChannelPipeline.fireChannelRead(DefaultChannelPipeline.java:911)
> 	at oadd.io.netty.channel.nio.AbstractNioByteChannel$NioByteUnsafe.read(AbstractNioByteChannel.java:131)
> 	at oadd.io.netty.channel.nio.NioEventLoop.processSelectedKey(NioEventLoop.java:645)
> 	at oadd.io.netty.channel.nio.NioEventLoop.processSelectedKeysOptimized(NioEventLoop.java:580)
> 	at oadd.io.netty.channel.nio.NioEventLoop.processSelectedKeys(NioEventLoop.java:497)
> 	at oadd.io.netty.channel.nio.NioEventLoop.run(NioEventLoop.java:459)
> 	at oadd.io.netty.util.concurrent.SingleThreadEventExecutor$2.run(SingleThreadEventExecutor.java:131)
> 	... 1 more
> ----------------------------------------------------------------------------------------------------------------
> Summary
> ----------------------------------------------------------------------------------------------------------------
> Random Failures:
> ----------------------------------------------------------------------------------------------------------------
> /root/drillAutomation/framework-master/framework/resources/Advanced/tpch/tpch_sf1/original/parquet/query17.sql
> ----------------------------------------------------------------------------------------------------------------
> Passing tests: 365
> Execution Failures: 0
> Data Verification Failures: 0
> Plan Verification Failures: 0
> Timeouts: 0
> Canceled: 0
> Random Failures: 1
> > TEARING DOWN..
> version	commit_id	commit_message	commit_time	build_email	build_time
> 1.12.0-SNAPSHOT	b0c4e0486d6d4620b04a1bb8198e959d433b4840	DRILL-5876: Use openssl profile to include netty-tcnative dependency with the platform specific classifier	20.10.2017 @ 16:52:35 PDT	Unknown	20.10.2017 @ 16:55:44 PDT
> hostname	user_port	control_port	data_port	current	version
> atsqa6c88.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> atsqa6c82.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> atsqa6c86.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> atsqa6c87.qa.lab	31010	31011	31012	true	1.12.0-SNAPSHOT
> atsqa6c85.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> atsqa6c83.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> atsqa6c81.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> atsqa6c80.qa.lab	31010	31011	31012	false	1.12.0-SNAPSHOT
> ok	summary
> true	ALL updated.
> Session: 0x15d324dfe10ee1e closed
> EventThread shut down
> Session: 0x15d324dfe10ee1f closed
> EventThread shut down
> Running command: /root/drillAutomation/framework-master/framework/scripts/restart_drill_script
> ----------------------------------------------------------------------------------------------------------------
> FINISHED AT Mon Oct 23 10:39:07 PDT 2017
> TOTAL DURATION: 36.21 min
> ----------------------------------------------------------------------------------------------------------------
> ----------------------------------------------------------------------------------------------------------------
> [PostBuildScript] - Execution post build scripts.
> [TPCH-SF100-baseline] $ /bin/sh -xe /tmp/hudson5203439872152590056.sh
> + TAG=jenkins-TPCH-SF100-baseline-574
> + mkdir /home/MAPRTECH/drill/logs/archive/jenkins-TPCH-SF100-baseline-574
> + clush -a 'mkdir /home/MAPRTECH/drill/logs/archive/jenkins-TPCH-SF100-baseline-574/$HOSTNAME'
> + clush -a 'mv /var/log/drill/* /home/MAPRTECH/drill/logs/archive/jenkins-TPCH-SF100-baseline-574/$HOSTNAME'
> Warning: you have no plugins providing access control for builds, so falling back to legacy behavior of permitting any downstream builds to be triggered
> Triggering a new build of Advanced-Baseline
> Triggering a new build of md-1115
> Finished: SUCCESS



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)