You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "James Turton (Jira)" <ji...@apache.org> on 2023/02/08 08:29:00 UTC

[jira] [Reopened] (DRILL-8390) Minor Improvements to PDF Reader

     [ https://issues.apache.org/jira/browse/DRILL-8390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

James Turton reopened DRILL-8390:
---------------------------------

> Minor Improvements to PDF Reader
> --------------------------------
>
>                 Key: DRILL-8390
>                 URL: https://issues.apache.org/jira/browse/DRILL-8390
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Format - PDF
>            Reporter: Charles Givre
>            Assignee: Charles Givre
>            Priority: Major
>
> This PR makes some minor improvements to the PDF reader including:
>  * Fixes a minor bug where certain configurations the first row of data was skipped
>  * Fixes a minor bug where empty tables were causing crashes with the spreadsheet extraction algorithm was used
>  * Adds a table_count metadata field
>  * Adds a table_index metadata field to reflect the current table.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)