You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Chun Chang (JIRA)" <ji...@apache.org> on 2015/05/08 00:40:05 UTC

[jira] [Created] (DRILL-2986) IOBException query multiple files that contain schema changes between files

Chun Chang created DRILL-2986:
---------------------------------

             Summary: IOBException query multiple files that contain schema changes between files
                 Key: DRILL-2986
                 URL: https://issues.apache.org/jira/browse/DRILL-2986
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Data Types
    Affects Versions: 1.0.0
            Reporter: Chun Chang
            Assignee: Daniel Barclay (Drill)


{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexP> select * from sys.version;
+------------+----------------+-------------+-------------+------------+
| commit_id  | commit_message | commit_time | build_email | build_time |
+------------+----------------+-------------+-------------+------------+
| 31e51832db216ca16525af83abd445b812c569c4 | DRILL-2963: Fix NestedLoopJoinBatch when left batch is empty | 06.05.2015 @ 14:21:57 EDT | Unknown     | 06.05.2015 @ 18:04:15 EDT |
+------------+----------------+-------------+-------------+------------+
{code}

The following query (Advanced/Passing/complextype/json/complex313.q) read from four files in a dir. Between files, there is schema changes.

{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexP> select * from dfs.`/drill/test*/comp[a-l]ex_type/json/jira1*.json`;
+------------+------------+------------+------------+------------+
|    dir0    |    dir1    |    dir2    |     id     |    oooa    |
+------------+------------+------------+------------+------------+
| testdata   | complex_type | json       | 2          | {"oa":{"oab":{"oabc":[{"rowId":2},{"rowValue1":2,"rowValue2":2}]}}} |
| testdata   | complex_type | json       | 2          | null       |
java.lang.RuntimeException: java.sql.SQLException: Failure while executing query.
	at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
	at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
	at sqlline.SqlLine.print(SqlLine.java:1809)
	at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
	at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
	at sqlline.SqlLine.dispatch(SqlLine.java:889)
	at sqlline.SqlLine.begin(SqlLine.java:763)
	at sqlline.SqlLine.start(SqlLine.java:498)
	at sqlline.SqlLine.main(SqlLine.java:460)
{code}

drill log:

{code}
2015-05-07 15:01:11,570 [2ab41f58-4a68-9384-2310-1853a36405a1:foreman] INFO  o.a.d.e.s.schedule.BlockMapBuilder - Get block maps: Executed 4 out of 4 using 4 threads. Time: 2ms total, 1.126568ms avg, 1ms max.
2015-05-07 15:01:11,577 [2ab41f58-4a68-9384-2310-1853a36405a1:foreman] INFO  o.a.drill.exec.work.foreman.Foreman - State change requested.  PENDING --> RUNNING
2015-05-07 15:01:11,593 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from AWAITING_ALLOCATION --> RUNNING for
2015-05-07 15:01:11,593 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.d.e.w.f.AbstractStatusReporter - State changed for 2ab41f58-4a68-9384-2310-1853a36405a1:0:0. New state: RUNNING
2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.d.c.e.DrillRuntimeException - User Error Occurred
org.apache.drill.common.exceptions.UserException: DATA_READ ERROR: index: 0, length: 4 (expected: range(0, 0))

Line  11
Column  48
Field  rowValue1

[Error Id: 5cd7d26d-38a8-45dd-b7ae-4825cae19c37 ]
	at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:465) ~[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:512) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:305) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:470) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:305) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeDataSwitch(JsonReader.java:242) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeToVector(JsonReader.java:180) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.write(JsonReader.java:146) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.store.easy.json.JSONRecordReader.next(JSONRecordReader.java:194) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:175) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:83) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.physical.impl.ScreenCreator$ScreenRoot.innerNext(ScreenCreator.java:80) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:73) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:199) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:193) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at java.security.AccessController.doPrivileged(Native Method) [na:1.7.0_45]
	at javax.security.auth.Subject.doAs(Subject.java:415) [na:1.7.0_45]
	at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1469) [hadoop-common-2.4.1-mapr-1408.jar:na]
	at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:193) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_45]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_45]
	at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
Caused by: java.lang.IndexOutOfBoundsException: index: 0, length: 4 (expected: range(0, 0))
	at io.netty.buffer.DrillBuf.checkIndexD(DrillBuf.java:189) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final]
	at io.netty.buffer.DrillBuf.chk(DrillBuf.java:211) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final]
	at io.netty.buffer.DrillBuf.getInt(DrillBuf.java:491) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final]
	at org.apache.drill.exec.vector.UInt4Vector$Accessor.get(UInt4Vector.java:300) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.EmptyValuePopulator.populate(EmptyValuePopulator.java:46) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.RepeatedMapVector$Mutator.setValueCount(RepeatedMapVector.java:534) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.impl.RepeatedMapWriter.start(RepeatedMapWriter.java:169) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:278) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:470) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	... 25 common frames omitted
2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from RUNNING --> FAILED for
2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from FAILED --> FINISHED for
2015-05-07 15:01:11,636 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.work.foreman.Foreman - State change requested.  RUNNING --> FAILED
org.apache.drill.common.exceptions.UserRemoteException: DATA_READ ERROR: index: 0, length: 4 (expected: range(0, 0))

File  /drill/testdata/complex_type/json/jira1962a.json
Record  1
Line  11
Column  48
Field  rowValue1
Line  11
Column  48
Field  rowValue1
Fragment 0:0

[Error Id: 5cd7d26d-38a8-45dd-b7ae-4825cae19c37 on qa-node119.qa.lab:31010]
	at org.apache.drill.exec.work.foreman.QueryManager$1.statusUpdate(QueryManager.java:409) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:389) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:90) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:86) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:266) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
	at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_45]
	at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_45]
	at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from FAILED --> CANCELLATION_REQUESTED for
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] WARN  o.a.d.e.w.fragment.FragmentExecutor - Ignoring unexpected state transition FAILED => CANCELLATION_REQUESTED.
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.work.foreman.Foreman - foreman cleaning up.
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO  o.a.drill.exec.work.foreman.Foreman - State change requested.  FAILED --> COMPLETED
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] WARN  o.a.drill.exec.work.foreman.Foreman - Dropping request to move to COMPLETED state as query is already at FAILED state (which is terminal).
{code}

plan:

{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexP> explain plan for select * from dfs.`/drill/test*/comp[a-l]ex_type/json/jira1*.json`;
+------------+------------+
|    text    |    json    |
+------------+------------+
| 00-00    Screen
00-01      Scan(groupscan=[EasyGroupScan [selectionRoot=/drill, numFiles=4, columns=[`*`], files=[maprfs:/drill/testdata/complex_type/json/jira1894.json, maprfs:/drill/testdata/complex_type/json/jira1893.json, maprfs:/drill/testdata/complex_type/json/jira1962a.json, maprfs:/drill/testdata/complex_type/json/jira1962b.json]]])
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)