You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "James Turton (Jira)" <ji...@apache.org> on 2023/02/08 08:29:00 UTC
[jira] [Reopened] (DRILL-8390) Minor Improvements to PDF Reader
[ https://issues.apache.org/jira/browse/DRILL-8390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Turton reopened DRILL-8390:
---------------------------------
> Minor Improvements to PDF Reader
> --------------------------------
>
> Key: DRILL-8390
> URL: https://issues.apache.org/jira/browse/DRILL-8390
> Project: Apache Drill
> Issue Type: Improvement
> Components: Format - PDF
> Reporter: Charles Givre
> Assignee: Charles Givre
> Priority: Major
>
> This PR makes some minor improvements to the PDF reader including:
> * Fixes a minor bug where certain configurations the first row of data was skipped
> * Fixes a minor bug where empty tables were causing crashes with the spreadsheet extraction algorithm was used
> * Adds a table_count metadata field
> * Adds a table_index metadata field to reflect the current table.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)