You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "PJ Fanning (Jira)" <ji...@apache.org> on 2021/12/07 10:55:00 UTC

[jira] [Created] (DRILL-8070) format-excel assumes that rowIterator returns every row - it doesn't

PJ Fanning created DRILL-8070:
---------------------------------

             Summary: format-excel assumes that rowIterator returns every row - it doesn't
                 Key: DRILL-8070
                 URL: https://issues.apache.org/jira/browse/DRILL-8070
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Data Types
            Reporter: PJ Fanning


In ExcelBatchReader, this code makes the wrong assumption:

```

for (int i = 1; i < rowNumber; i++) {
  currentRow = rowIterator.next();
}

```

 

There are 2 for loops like this.

 

Empty Rows will not necessarily be returned by the iterator. Basically, rows without populated cells could easily be skipped. Think of the Sheet as being represented as a sparse matrix - because it is stored like this.

 

 

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)