You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by HanumanthaRao K <hk...@gmail.com> on 2015/10/15 10:37:49 UTC

Request for help:-Solr - Indexer causes java.lang.OutOfMemoryError: Java heap spac

Dear Solr Team,


this is hanumantharao. I am getting the error when parsing and indexing
content using solr. if you know please help me on the same.
------------------------------------------------------------------------

I am using solr for index as well as content search. when I am trying with
1 gb data and RAM sizes 10 GB for tomcat and solr . I am getting error:
java.lang.OutOfMemoryError: Requested array size exceeds VM limit at
java.util.Arrays.copyOf(Unknown Source) at
java.lang.AbstractStringBuilder.expandCapacity(Unknown Source) at
java.lang.AbstractStringBuilder.ensureCapacityInternal(Unknown Source) at
java.lang.AbstractStringBuilder.append(Unknown Source) at
java.lang.StringBuffer.append(Unknown Source) at
java.io.StringWriter.write(Unknown Source) at
org.apache.tika.sax.ToTextContentHandler.characters(ToTextContentHandler.java:93)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.WriteOutContentHandler.characters(WriteOutContentHandler.java:136)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.xpath.MatchingContentHandler.characters(MatchingContentHandler.java:85)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.SecureContentHandler.characters(SecureContentHandler.java:270)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.ContentHandlerDecorator.characters(ContentHandlerDecorator.java:146)
at
org.apache.tika.sax.SafeContentHandler.access$001(SafeContentHandler.java:46)
at
org.apache.tika.sax.SafeContentHandler$1.write(SafeContentHandler.java:82)
at
org.apache.tika.sax.SafeContentHandler.filter(SafeContentHandler.java:140)
at
org.apache.tika.sax.SafeContentHandler.characters(SafeContentHandler.java:287)
at
org.apache.tika.sax.XHTMLContentHandler.characters(XHTMLContentHandler.java:278)
at org.apache.tika.parser.txt.TXTParser.parse(TXTParser.java:88) at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256) at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:256) at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120)
SolrHandler.indexFile(SolrHandler.java:218):

my code is:-- mData = new Metadata();

 cHandler = new BodyContentHandler(-1); or linkedHandler()

 adp = new AutoDetectParser();

            adp.parse(inputStream, cHandler, mData, new ParseContext());
            sid.addField("content", cHandler.toString());
             solrServer.add(sid);
could you pleae tell me how to resolve the issue. even RAM size is
increased I am getting.Parsing using Tika 1.7
ThanksHanuman

Re: Request for help:-Solr - Indexer causes java.lang.OutOfMemoryError: Java heap spac

Posted by Zheng Lin Edwin Yeo <ed...@gmail.com>.
You can increase the Java heap memory maximum by setting the -Xmx parameter
when you start Solr, if you are using Solr 5.1 and below. (Eg: -Xmx4096M)

For Solr 5.2 and above, set the -m parameters (Eg: -m 4g)

Regards,
Edwin


On 15 October 2015 at 21:09, Shawn Heisey <ap...@elyograg.org> wrote:

> On 10/15/2015 2:37 AM, HanumanthaRao K wrote:
> > this is hanumantharao. I am getting the error when parsing and indexing
> > content using solr. if you know please help me on the same.
> > ------------------------------------------------------------------------
> >
> > I am using solr for index as well as content search. when I am trying
> with
> > 1 gb data and RAM sizes 10 GB for tomcat and solr . I am getting error:
> > java.lang.OutOfMemoryError: Requested array size exceeds VM limit at
>
> Tika, which is what is being used here to extract text out of your
> document, is requiring more memory than you have allocated in order to
> do the extraction on whatever document it is currently indexing.
>
> See this previous message on this mailing list:
>
>
> http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201407.mbox/%3C53D658C7.1000203@raytion.com%3E
>
> Thanks,
> Shawn
>
>

Re: Request for help:-Solr - Indexer causes java.lang.OutOfMemoryError: Java heap spac

Posted by Shawn Heisey <ap...@elyograg.org>.
On 10/15/2015 2:37 AM, HanumanthaRao K wrote:
> this is hanumantharao. I am getting the error when parsing and indexing
> content using solr. if you know please help me on the same.
> ------------------------------------------------------------------------
> 
> I am using solr for index as well as content search. when I am trying with
> 1 gb data and RAM sizes 10 GB for tomcat and solr . I am getting error:
> java.lang.OutOfMemoryError: Requested array size exceeds VM limit at

Tika, which is what is being used here to extract text out of your
document, is requiring more memory than you have allocated in order to
do the extraction on whatever document it is currently indexing.

See this previous message on this mailing list:

http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201407.mbox/%3C53D658C7.1000203@raytion.com%3E

Thanks,
Shawn