You are viewing a plain text version of this content. The canonical link for it is here.
Posted to java-user@lucene.apache.org by Patricio Galeas <ga...@informatik.uni-siegen.de> on 2005/10/26 21:12:30 UTC

Java heap space ...after index process

Hello All,
I try to index some PDF documents using PDFBox.
It works apparent normally, but when the index process ends, I get the
following message:
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space 

Do you have some idea?
Thanks
Patricio



---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: AW: Java heap space ...after index process

Posted by Gusenbauer Stefan <gu...@eduhi.at>.
Patricio Galeas wrote:

>Hello Ben,
>It happens when one of the documents [4.95 MB] is indexed.
>I use the framework to index office documents from the book "Lucene In
>Action". I think the PDDocument objects are closed correctly.
>
>I'll look for more information about increasing the heap size.
>
>PDFBox version = 0.7.2
>
>Thank You for your help
>Patricio
>
>
>
>
>
>-----Ursprüngliche Nachricht-----
>Von: Ben Litchfield [mailto:ben@csh.rit.edu] 
>Gesendet: Mittwoch, 26. Oktober 2005 21:17
>An: java-user@lucene.apache.org
>Betreff: Re: Java heap space ...after index process
>
>Is this only after the entire indexing process is finished or do you 
>mean it happens on one of the documents you are extracting text from?
>
>Are you closing the PDDocument objects when you are done with them?
>
>What heap size are you using and have you tried increasing it?
>
>What version of PDFBox?
>
>Ben
>
>Patricio Galeas wrote:
>
>  
>
>>Hello All,
>>I try to index some PDF documents using PDFBox.
>>It works apparent normally, but when the index process ends, I get the
>>following message:
>>Exception in thread "main" java.lang.OutOfMemoryError: Java heap space 
>>
>>Do you have some idea?
>>Thanks
>>Patricio
>>
>>
>>
>>---------------------------------------------------------------------
>>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>>For additional commands, e-mail: java-user-help@lucene.apache.org
>> 
>>
>>    
>>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>
>
>
>  
>
I've had such a problem to here are some possible solutions
1) implement a memory warning system
http://www.javaspecialists.co.za/archive/Issue092.html
2) resize the heap space with java -Xmx512M (for example)


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


AW: Java heap space ...after index process

Posted by Patricio Galeas <ga...@informatik.uni-siegen.de>.
Hello Ben,
It happens when one of the documents [4.95 MB] is indexed.
I use the framework to index office documents from the book "Lucene In
Action". I think the PDDocument objects are closed correctly.

I'll look for more information about increasing the heap size.

PDFBox version = 0.7.2

Thank You for your help
Patricio





-----Ursprüngliche Nachricht-----
Von: Ben Litchfield [mailto:ben@csh.rit.edu] 
Gesendet: Mittwoch, 26. Oktober 2005 21:17
An: java-user@lucene.apache.org
Betreff: Re: Java heap space ...after index process

Is this only after the entire indexing process is finished or do you 
mean it happens on one of the documents you are extracting text from?

Are you closing the PDDocument objects when you are done with them?

What heap size are you using and have you tried increasing it?

What version of PDFBox?

Ben

Patricio Galeas wrote:

>Hello All,
>I try to index some PDF documents using PDFBox.
>It works apparent normally, but when the index process ends, I get the
>following message:
>Exception in thread "main" java.lang.OutOfMemoryError: Java heap space 
>
>Do you have some idea?
>Thanks
>Patricio
>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>  
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org





---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org


Re: Java heap space ...after index process

Posted by Ben Litchfield <be...@csh.rit.edu>.
Is this only after the entire indexing process is finished or do you 
mean it happens on one of the documents you are extracting text from?

Are you closing the PDDocument objects when you are done with them?

What heap size are you using and have you tried increasing it?

What version of PDFBox?

Ben

Patricio Galeas wrote:

>Hello All,
>I try to index some PDF documents using PDFBox.
>It works apparent normally, but when the index process ends, I get the
>following message:
>Exception in thread "main" java.lang.OutOfMemoryError: Java heap space 
>
>Do you have some idea?
>Thanks
>Patricio
>
>
>
>---------------------------------------------------------------------
>To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
>For additional commands, e-mail: java-user-help@lucene.apache.org
>  
>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-user-help@lucene.apache.org