You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@pdfbox.apache.org by "Christian Czech (JIRA)" <ji...@apache.org> on 2013/08/12 18:16:48 UTC

[jira] [Created] (PDFBOX-1692) java.lang.OutOfMemoryError: Java heap space

Christian Czech created PDFBOX-1692:
---------------------------------------

             Summary: java.lang.OutOfMemoryError: Java heap space
                 Key: PDFBOX-1692
                 URL: https://issues.apache.org/jira/browse/PDFBOX-1692
             Project: PDFBox
          Issue Type: Bug
          Components: Text extraction
    Affects Versions: 1.8.2
         Environment: Windows 7
java version 1.7.0_17 (build 1.7.0_17-b02/64-Bit Server VM build 23.7-01)
pdfbox-app-1.8.2.jar
            Reporter: Christian Czech


Hello,

I have a problem with text extraction.
The problem is not enough memory in VM during the text extraction!

My Code:
String pdfFile = "D:\testfolder\test1fd9a_test.pdf"; //size of file 168 KB
PDDocument document = PDDocument.load(pdfFile, true);

PDFTextStripper stripper = null;
try {
stripper = new PDFTextStripper();
stripper.setSortByPosition(true);
stripper.writeText(document, outputWriter);
} catch () {
}

You get an error:
java.lang.OutOfMemoryError: Java heap space 


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira