You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Christian Czech (JIRA)" <ji...@apache.org> on 2013/03/28 15:50:14 UTC

[jira] [Created] (PDFBOX-1554) You can extract text with Foxit Reader or Adobe Reader but not with PDFBox

Christian Czech created PDFBOX-1554:
---------------------------------------

             Summary: You can extract text with Foxit Reader or Adobe Reader but not with PDFBox
                 Key: PDFBOX-1554
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1554
             Project: PDFBox
          Issue Type: Bug
          Components: Text extraction
    Affects Versions: 1.8.0
         Environment: Windows XP
JRE 1.6.0_39-b04
            Reporter: Christian Czech


With Foxit Reader or Adobe Reader it's possibel save the PDF Document as text without password.
With PDFBox you get the error:
WARN  (PDFBoxExtractor.java:113) - [ test_1e6a2e_001_test.pdf ] Could not decypt document, object not found

Code:
PDDocument document = null;
document = PDDocument.load(pdfFile, true);
	
try {
  if (document.isEncrypted()) {
  StandardDecryptionMaterial sdm = new StandardDecryptionMaterial("");
  document.openProtection(sdm);
}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira