You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Charles Givre (Jira)" <ji...@apache.org> on 2023/01/19 03:37:00 UTC

[jira] [Created] (DRILL-8390) Minor Improvements to PDF Reader

Charles Givre created DRILL-8390:
------------------------------------

             Summary: Minor Improvements to PDF Reader
                 Key: DRILL-8390
                 URL: https://issues.apache.org/jira/browse/DRILL-8390
             Project: Apache Drill
          Issue Type: Improvement
          Components: Format - PDF
            Reporter: Charles Givre
            Assignee: Charles Givre


This PR makes some minor improvements to the PDF reader including:
 * Fixes a minor bug where certain configurations the first row of data was skipped
 * Fixes a minor bug where empty tables were causing crashes with the spreadsheet extraction algorithm was used
 * Adds a table_count metadata field
 * Adds a table_index metadata field to reflect the current table.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)