You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pdfbox.apache.org by "Anca Zapuc (Updated) (JIRA)" <ji...@apache.org> on 2012/02/09 23:16:57 UTC
[jira] [Updated] (PDFBOX-1226) Counting pages of a PDF gives OutOfMemoryError

     [ https://issues.apache.org/jira/browse/PDFBOX-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Anca Zapuc updated PDFBOX-1226:
-------------------------------

    Description: 
I have a pdf ( 397 MB) and I am trying to count the pages.
I am able to open the PDF with AdobeReader 9, but no with FoxitReader.
Code:
  PDDocument doc = null;
	        File temp = null;
	        RandomAccessFile rand = null;
	        int nr = 0;
	        try {
	            //create a temporary file needed by the PDFBox when dealing with PDFs really really large
	            temp = new File("e:/temp.tmp");
	            //using random access file needed for PDF really large
	            rand = new RandomAccessFile(temp,"rw");
	            doc = PDDocument.load(file,rand);
	            nr = doc.getNumberOfPages();
	}catch(Exception e){
		e.printStackTrace();
	}

Got following exception:
org.apache.pdfbox.exceptions.WrappedIOException
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:240)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1069)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1022)
	at PDFBoxExample.getHugeNrOfFiles(PDFBoxExample.java:36)
	at PDFBoxExample.main(PDFBoxExample.java:258)
Caused by: java.lang.OutOfMemoryError: Java heap space
	at java.lang.AbstractStringBuilder.<init>(AbstractStringBuilder.java:45)
	at java.lang.StringBuffer.<init>(StringBuffer.java:79)
	at org.apache.pdfbox.pdfparser.BaseParser.readString(BaseParser.java:1121)
	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:402)
	at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:552)
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:184)
	... 4 more

I attached the PDF.

  was:
I have a pdf ( 397 MB) and I am trying to count the pages.
I am able to open the PDF with AdobeReader 9, but no with FoxitReader.
Code:
  PDDocument doc = null;
	        File temp = null;
	        RandomAccessFile rand = null;
	        int nr = 0;
	        try {
	            //create a temporary file needed by the PDFBox when dealing with PDFs really really large
	            temp = new File("e:/temp.tmp");
	            //using random access file needed for PDF really large
	            rand = new RandomAccessFile(temp,"rw");
	            doc = PDDocument.load(file,rand);
	            nr = doc.getNumberOfPages();
	}catch(Exception e){
		e.printStackTrace();
	}

Got following exception:
org.apache.pdfbox.exceptions.WrappedIOException
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:240)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1069)
	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1022)
	at PDFBoxExample.getHugeNrOfFiles(PDFBoxExample.java:36)
	at PDFBoxExample.main(PDFBoxExample.java:258)
Caused by: java.lang.OutOfMemoryError: Java heap space
	at java.lang.AbstractStringBuilder.<init>(AbstractStringBuilder.java:45)
	at java.lang.StringBuffer.<init>(StringBuffer.java:79)
	at org.apache.pdfbox.pdfparser.BaseParser.readString(BaseParser.java:1121)
	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:402)
	at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:552)
	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:184)
	... 4 more

    
> Counting  pages of a PDF gives OutOfMemoryError
> -----------------------------------------------
>
>                 Key: PDFBOX-1226
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1226
>             Project: PDFBox
>          Issue Type: Bug
>          Components: PDFReader
>    Affects Versions: 1.6.0
>         Environment: Windows 7 / Windows XP
>            Reporter: Anca Zapuc
>         Attachments: Big_no_pages.7z
>
>
> I have a pdf ( 397 MB) and I am trying to count the pages.
> I am able to open the PDF with AdobeReader 9, but no with FoxitReader.
> Code:
>   PDDocument doc = null;
> 	        File temp = null;
> 	        RandomAccessFile rand = null;
> 	        int nr = 0;
> 	        try {
> 	            //create a temporary file needed by the PDFBox when dealing with PDFs really really large
> 	            temp = new File("e:/temp.tmp");
> 	            //using random access file needed for PDF really large
> 	            rand = new RandomAccessFile(temp,"rw");
> 	            doc = PDDocument.load(file,rand);
> 	            nr = doc.getNumberOfPages();
> 	}catch(Exception e){
> 		e.printStackTrace();
> 	}
> Got following exception:
> org.apache.pdfbox.exceptions.WrappedIOException
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:240)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1069)
> 	at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:1022)
> 	at PDFBoxExample.getHugeNrOfFiles(PDFBoxExample.java:36)
> 	at PDFBoxExample.main(PDFBoxExample.java:258)
> Caused by: java.lang.OutOfMemoryError: Java heap space
> 	at java.lang.AbstractStringBuilder.<init>(AbstractStringBuilder.java:45)
> 	at java.lang.StringBuffer.<init>(StringBuffer.java:79)
> 	at org.apache.pdfbox.pdfparser.BaseParser.readString(BaseParser.java:1121)
> 	at org.apache.pdfbox.pdfparser.BaseParser.parseCOSStream(BaseParser.java:402)
> 	at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:552)
> 	at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:184)
> 	... 4 more
> I attached the PDF.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira