You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by da...@correo.aeat.es on 2014/03/11 08:10:14 UTC
DocumentCache Out Of Memory
Hello,
in our project we need to execute some big queries against Solr once a
day, with maybe more than 1000 results, in order to trigger a batch
proccess with the results. In the fl parameter we only are putting the ID
field, because we don't need large text fields.
This is our scenary:
- Our documents are generally very big, but as I have said we only request
for the ID field.
- We have the enableLazyFieldLoading parameter set to true in
solrconfig.xml, so the DocumentCache should load only the ID field that we
are requesting.
- Our DocumetCache is set to 8192 objects.
- This test have been executed in Solr 4.2.1, 4.6.1 and 4.7, in no Solr
Cloud as well as in Solr Cloud mode.
The issue we have got is this:
- When we request for more than 1000 docs. more or less JVM takes a lot of
memory and ends with an OOM.
- Seeing in "real time" as the DocumentCache inserts documents, we have
seen that memory grows when the documents are bigger (and time needed to
load that documents in Cache is larger also), but we don't understand why,
because with enableLazyFieldLoading only the ID should be loaded, so, ¿why
does memory grow in that way?
We know that one solution is increase RAM memory and other is decrease the
size of DocumentCache (we have already done this), but we'd like to know
why this issue with memory.
On the other hand, one good solution for us would be make the queries
without cache. ¿Is there any way to say Solr not to cache some specific
queries? I don't think so, but maybe I am wrong.
Thank very much,
David Dávila Atienza
AEAT - Departamento de Informática Tributaria
Subdirección de Tecnologías de Análisis de la Información e Investigación
del Fraude
Teléfono: 917681160
Extensión: 30160