You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Robert Hou (JIRA)" <ji...@apache.org> on 2018/07/20 00:20:00 UTC

[jira] [Updated] (DRILL-6623) Drill encounters exception IndexOutOfBoundsException: writerIndex: -8373248 (expected: readerIndex(0) <= writerIndex <= capacity(32768))

     [ https://issues.apache.org/jira/browse/DRILL-6623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Robert Hou updated DRILL-6623:
------------------------------
    Attachment: 24aedae9-d1f3-8e12-2e1f-0479915c61b1.sys.drill

> Drill encounters exception IndexOutOfBoundsException: writerIndex: -8373248 (expected: readerIndex(0) <= writerIndex <= capacity(32768))
> ----------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: DRILL-6623
>                 URL: https://issues.apache.org/jira/browse/DRILL-6623
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Relational Operators
>    Affects Versions: 1.14.0
>            Reporter: Robert Hou
>            Assignee: Karthikeyan Manivannan
>            Priority: Major
>         Attachments: 24aedae9-d1f3-8e12-2e1f-0479915c61b1.sys.drill, drillbit.log.61b1
>
>
> This is the query:
> alter session set `planner.width.max_per_node` = 1;
> alter session set `planner.width.max_per_query` = 1;
> select * from (
> select
> split_part(CharacterValuea, '8', 1) CharacterValuea,
> split_part(CharacterValueb, '8', 1) CharacterValueb,
> split_part(CharacterValuec, '8', 2) CharacterValuec,
> split_part(CharacterValued, '8', 3) CharacterValued,
> split_part(CharacterValuee, 'b', 1) CharacterValuee
> from (select * from dfs.`/drill/testdata/batch_memory/character5_1MB_1GB.parquet` order by CharacterValuea) d where d.CharacterValuea = '1234567890123110');
> The query works with a smaller table.
> This is the stack trace:
> {noformat}
> 2018-07-19 16:59:48,803 [24aedae9-d1f3-8e12-2e1f-0479915c61b1:frag:0:0] ERROR o.a.d.e.w.fragment.FragmentExecutor - SYSTEM ERROR: IndexOutOfBoundsException: writerIndex: -8373248 (expected: readerIndex(0) <= writerIndex <= capacity(32768))
> Fragment 0:0
> [Error Id: edc75560-41ca-4fdd-907f-060be1795786 on qa-node186.qa.lab:31010]
> org.apache.drill.common.exceptions.UserException: SYSTEM ERROR: IndexOutOfBoundsException: writerIndex: -8373248 (expected: readerIndex(0) <= writerIndex <= capacity(32768))
> Fragment 0:0
> [Error Id: edc75560-41ca-4fdd-907f-060be1795786 on qa-node186.qa.lab:31010]
> 	at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:633) ~[drill-common-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:361) [drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.work.fragment.FragmentExecutor.cleanup(FragmentExecutor.java:216) [drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:327) [drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) [na:1.8.0_161]
> 	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) [na:1.8.0_161]
> 	at java.lang.Thread.run(Thread.java:748) [na:1.8.0_161]
> Caused by: java.lang.IndexOutOfBoundsException: writerIndex: -8373248 (expected: readerIndex(0) <= writerIndex <= capacity(32768))
> 	at io.netty.buffer.AbstractByteBuf.writerIndex(AbstractByteBuf.java:104) ~[netty-buffer-4.0.48.Final.jar:4.0.48.Final]
> 	at org.apache.drill.exec.vector.VarCharVector$Mutator.setValueCount(VarCharVector.java:810) ~[vector-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.vector.NullableVarCharVector$Mutator.setValueCount(NullableVarCharVector.java:641) ~[vector-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.setValueCount(ProjectRecordBatch.java:329) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.doWork(ProjectRecordBatch.java:242) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.record.AbstractUnaryRecordBatch.innerNext(AbstractUnaryRecordBatch.java:117) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:142) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:172) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:119) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:109) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.record.AbstractUnaryRecordBatch.innerNext(AbstractUnaryRecordBatch.java:63) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.innerNext(ProjectRecordBatch.java:142) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.record.AbstractRecordBatch.next(AbstractRecordBatch.java:172) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:103) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.ScreenCreator$ScreenRoot.innerNext(ScreenCreator.java:83) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:93) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:294) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:281) ~[drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	at java.security.AccessController.doPrivileged(Native Method) ~[na:1.8.0_161]
> 	at javax.security.auth.Subject.doAs(Subject.java:422) ~[na:1.8.0_161]
> 	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1595) ~[hadoop-common-2.7.0-mapr-1707.jar:na]
> 	at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:281) [drill-java-exec-1.14.0-SNAPSHOT.jar:1.14.0-SNAPSHOT]
> 	... 4 common frames omitted
> {noformat}
> This is the explain plan:
> {noformat}
> | 00-00    Screen : rowType = RecordType(ANY CharacterValuea, ANY CharacterValueb, ANY CharacterValuec, ANY CharacterValued, ANY CharacterValuee): rowcount = 9216000.0, cumulative cost = {5.815296E8 rows, 8.786270178575306E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3374
> 00-01      ProjectAllowDup(CharacterValuea=[$0], CharacterValueb=[$1], CharacterValuec=[$2], CharacterValued=[$3], CharacterValuee=[$4]) : rowType = RecordType(ANY CharacterValuea, ANY CharacterValueb, ANY CharacterValuec, ANY CharacterValued, ANY CharacterValuee): rowcount = 9216000.0, cumulative cost = {5.80608E8 rows, 8.785348578575306E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3373
> 00-02        Project(CharacterValuea=[SPLIT_PART(ITEM($0, 'CharacterValuea'), '8', 1)], CharacterValueb=[SPLIT_PART(ITEM($0, 'CharacterValueb'), '8', 1)], CharacterValuec=[SPLIT_PART(ITEM($0, 'CharacterValuec'), '8', 2)], CharacterValued=[SPLIT_PART(ITEM($0, 'CharacterValued'), '8', 3)], CharacterValuee=[SPLIT_PART(ITEM($0, 'CharacterValuee'), 'b', 1)]) : rowType = RecordType(ANY CharacterValuea, ANY CharacterValueb, ANY CharacterValuec, ANY CharacterValued, ANY CharacterValuee): rowcount = 9216000.0, cumulative cost = {5.71392E8 rows, 8.739268578575306E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3372
> 00-03          SelectionVectorRemover : rowType = RecordType(DYNAMIC_STAR T3¦¦**): rowcount = 9216000.0, cumulative cost = {5.62176E8 rows, 8.554948578575305E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3371
> 00-04            Filter(condition=[=(ITEM($0, 'CharacterValuea'), '1234567890123110')]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**): rowcount = 9216000.0, cumulative cost = {5.5296E8 rows, 8.545732578575305E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3370
> 00-05              Project(T3¦¦**=[$0]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**): rowcount = 6.144E7, cumulative cost = {4.9152E8 rows, 8.263108578575305E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3369
> 00-06                SingleMergeExchange(sort0=[1]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {4.3008E8 rows, 8.201668578575305E9 cpu, 0.0 io, 1.00663296E12 network, 9.8304E8 memory}, id = 3368
> 01-01                  OrderedMuxExchange(sort0=[1]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {3.6864E8 rows, 7.710148578575305E9 cpu, 0.0 io, 5.0331648E11 network, 9.8304E8 memory}, id = 3367
> 02-01                    SelectionVectorRemover : rowType = RecordType(DYNAMIC_STAR T3¦¦**, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {3.072E8 rows, 7.648708578575305E9 cpu, 0.0 io, 5.0331648E11 network, 9.8304E8 memory}, id = 3366
> 02-02                      Sort(sort0=[$1], dir0=[ASC]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {2.4576E8 rows, 7.587268578575305E9 cpu, 0.0 io, 5.0331648E11 network, 9.8304E8 memory}, id = 3365
> 02-03                        HashToRandomExchange(dist0=[[$1]]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {1.8432E8 rows, 1.2288E9 cpu, 0.0 io, 5.0331648E11 network, 0.0 memory}, id = 3364
> 03-01                          Project(T3¦¦**=[$0], CharacterValuea=[$1]) : rowType = RecordType(DYNAMIC_STAR T3¦¦**, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {1.2288E8 rows, 2.4576E8 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 3363
> 03-02                            Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath [path=maprfs:///drill/testdata/batch_memory/character5_1MB_1GB.parquet]], selectionRoot=maprfs:/drill/testdata/batch_memory/character5_1MB_1GB.parquet, numFiles=1, numRowGroups=25, usedMetadataFile=false, columns=[`**`]]]) : rowType = RecordType(DYNAMIC_STAR **, ANY CharacterValuea): rowcount = 6.144E7, cumulative cost = {6.144E7 rows, 1.2288E8 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 3362
> {noformat}
> The table can be found in /home/MAPRTECH/qa/rhou/drill6623.
> I have attached the profile and the drillbit.log.
> This was encountered on the Apache Drill release with the latest code in July 19.  This is the commit id:
> | 1.14.0-SNAPSHOT  | 85344abd1ddb73448bdf67cdc6883cb98795a910  | DRILL-6614: Allow usage of MapRDBFormatPlugin for HiveStoragePlugin  | 19.07.2018 @ 10:39:36 PDT  | rhou@mapr.com  | 19.07.2018 @ 15:44:52 PDT  |



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)