You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2014/06/23 17:15:25 UTC
[jira] [Commented] (TIKA-1187) java.lang.OutOfMemoryError: Java
heap space
[ https://issues.apache.org/jira/browse/TIKA-1187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14040830#comment-14040830 ]
Tyler Palsulich commented on TIKA-1187:
---------------------------------------
Hi [~Guffi],
Did you ever get this issue resolved? Do you still have the problematic file?
Thanks,
Tyler
> java.lang.OutOfMemoryError: Java heap space
> -------------------------------------------
>
> Key: TIKA-1187
> URL: https://issues.apache.org/jira/browse/TIKA-1187
> Project: Tika
> Issue Type: Bug
> Components: general
> Affects Versions: 1.3
> Environment: Ubuntu
> Reporter: GURFAN
> Priority: Critical
> Original Estimate: 612h
> Remaining Estimate: 612h
>
> Hi,
> While parsing the content we are getting below exception in parse method.
> The file which we are parsing is 1 mb.
> TIKA JAR: tika-core-1.3.jar
> File size: 1 MB.
> Parser parser = new AutoDetectParser();
> parser.parse(is, handler, metaData, new ParseContext());
> java.lang.OutOfMemoryError: Java heap space
> at java.util.Arrays.copyOf(Arrays.java:2734)
> at java.util.ArrayList.ensureCapacity(ArrayList.java:167)
> at java.util.ArrayList.add(ArrayList.java:351)
> at org.apache.fontbox.ttf.GlyfCompositeDescript.(GlyfCompositeDescript.java:60)
> at org.apache.fontbox.ttf.GlyphData.initData(GlyphData.java:63)
> at org.apache.fontbox.ttf.GlyphTable.initData(GlyphTable.java:71)
> at org.apache.fontbox.ttf.AbstractTTFParser.parseTables(AbstractTTFParser.java:163)
> at org.apache.fontbox.ttf.TTFParser.parseTables(TTFParser.java:61)
> at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:90)
> at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
> at org.apache.fontbox.ttf.AbstractTTFParser.parseTTF(AbstractTTFParser.java:66)
> at org.apache.fontbox.ttf.TTFParser.parseTTF(TTFParser.java:26)
> at org.apache.tika.parser.font.TrueTypeParser.parse(TrueTypeParser.java:65)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242)
> at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
> at com.impetus.vajra.parser.tika.TikaParser.processContent(TikaParser.java:96)
> at com.impetus.vajra.storm.helper.TextAnalyserBoltHelper.execute(TextAnalyserBoltHelper.java:283)
> at com.impetus.vajra.storm.TextAnalyserBolt.execute(TextAnalyserBolt.java:182)
> at backtype.storm.daemon.executor$fn__4050$tuple_action_fn__4052.invoke(executor.clj:566)
> at backtype.storm.daemon.executor$mk_task_receiver$fn__3976.invoke(executor.clj:345)
> at backtype.storm.disruptor$clojure_handler$reify__1606.onEvent(disruptor.clj:43)
> at backtype.storm.utils.DisruptorQueue.consumeBatchToCursor(DisruptorQueue.java:84)
> at backtype.storm.utils.DisruptorQueue.consumeBatchWhenAvailable(DisruptorQueue.java:58)
> at backtype.storm.disruptor$consume_batch_when_available.invoke(disruptor.clj:62)
> at backtype.storm.daemon.executor$fn__4050$fn__4059$fn__4106.invoke(executor.clj:658)
> at backtype.storm.util$async_loop$fn__465.invoke(util.clj:377)
> at clojure.lang.AFn.run(AFn.java:24)
> at java.lang.Thread.run(Thread.java:662)
--
This message was sent by Atlassian JIRA
(v6.2#6252)