You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Krishna Dheeraj (JIRA)" <ji...@apache.org> on 2018/09/04 18:45:00 UTC
[jira] [Created] (PDFBOX-4311) Unable to parse some pdf's using
pdfbox.
Krishna Dheeraj created PDFBOX-4311:
---------------------------------------
Summary: Unable to parse some pdf's using pdfbox.
Key: PDFBOX-4311
URL: https://issues.apache.org/jira/browse/PDFBOX-4311
Project: PDFBox
Issue Type: Bug
Components: Parsing
Affects Versions: 2.0.9
Environment: Pdfbox -2.0.9
Pdfbox-tools - 2.0.9
Java - 1.7
Scala - 2.10.6
Reporter: Krishna Dheeraj
Attachments: upload_user4024353_claimnr283909709_healthpartners_2018-06-17.pdf
When I tried to convert the PDF file into HTML for parsing the content in the body is empty and there are no errors or exceptions thrown. It is happening for only few files, others are are working as expected. I am attaching the file which we are unable to parse. Let us know know in case of any resolutions are avilable.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@pdfbox.apache.org
For additional commands, e-mail: dev-help@pdfbox.apache.org