You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tika User (Jira)" <ji...@apache.org> on 2022/01/10 15:53:00 UTC

[jira] [Created] (TIKA-3642) Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file

Tika User created TIKA-3642:
-------------------------------

             Summary: Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file
                 Key: TIKA-3642
                 URL: https://issues.apache.org/jira/browse/TIKA-3642
             Project: Tika
          Issue Type: Bug
            Reporter: Tika User


When parsing large PDF files(1.65 GB) we are getting out of memory error. The version we are using 2.2.1


java.lang.OutOfMemoryError: Java heap space at org.apache.pdfbox.pdfparser.COSParser.isString



--
This message was sent by Atlassian Jira
(v8.20.1#820001)