You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by GitBox <gi...@apache.org> on 2023/01/19 03:40:10 UTC

[GitHub] [drill] cgivre opened a new pull request, #2742: DRILL-8390: Minor Improvements to PDF Reader

cgivre opened a new pull request, #2742:
URL: https://github.com/apache/drill/pull/2742

   # [DRILL-8390](https://issues.apache.org/jira/browse/DRILL-8390): Minor Improvements to PDF Reader
   
   
   ## Description
   This PR makes some minor improvements to the PDF reader including:
   Fixes a minor bug where certain configurations the first row of data was skipped
   Fixes a minor bug where empty tables were causing crashes with the spreadsheet extraction algorithm was used
   Adds a `_table_count` metadata field
   Adds a `_table_index` metadata field to reflect the current table.
   
   ## Documentation
   See above.  Updated README.
   
   ## Testing
   Ran existing unit tests.  Manually tested against customer data.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [drill] cgivre merged pull request #2742: DRILL-8390: Minor Improvements to PDF Reader

Posted by GitBox <gi...@apache.org>.
cgivre merged PR #2742:
URL: https://github.com/apache/drill/pull/2742


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscribe@drill.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org