You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Christian Moen (Commented) (JIRA)" <ji...@apache.org> on 2012/04/10 17:37:19 UTC

[jira] [Commented] (SOLR-3282) Perform Kuromoji/Japanese stability test before 3.6 freeze

    [ https://issues.apache.org/jira/browse/SOLR-3282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13250751#comment-13250751 ] 

Christian Moen commented on SOLR-3282:
--------------------------------------

I'll resolve this issue now.

I've also been doing additional testing using the VisualVM Visual GC plugin and I'm seeing that the {{org.apache.lucene.analysis.ja.Token}} objects gets collected fairly as we expect.  In actual deployments, it's perhaps a good idea to use a larger eden space by using the server GC defaults or tune things up.

In longer term tests, it seems like Solr's heap-space is being used also in the case of a 512MB heap before a full GC recovered lots of heap.  I suspect this might be caused by searching with highlighting and the heap seems very stable with indexing only.  (In either case, this doesn't seem to be caused by Kuromoji.)

                
> Perform Kuromoji/Japanese stability test before 3.6 freeze
> ----------------------------------------------------------
>
>                 Key: SOLR-3282
>                 URL: https://issues.apache.org/jira/browse/SOLR-3282
>             Project: Solr
>          Issue Type: Task
>          Components: Schema and Analysis
>    Affects Versions: 3.6, 4.0
>            Reporter: Christian Moen
>            Assignee: Christian Moen
>         Attachments: 250k-queries-no-highlight-gc.log, 250k-queries-no-highlight-visualvm.png, 62k-queries-highlight-gc.log, 62k-queries-highlight-visualvm.png, jawiki-index-gc.log, jawiki-index-gcviewer.png, jawiki-index-visualvm.png, long-query-indexing-gc.log, long-search-indexing-visualvm.png
>
>
> Kuromoji might be used by many and also in mission critical systems.  I'd like to run a stability test before we freeze 3.6.
> My thinking is to test the out-of-the-box configuration using fieldtype {{text_ja}} as follows:
> # Index all of Japanese Wikipedia documents (approx. 1.4M documents) in a never ending loop
> # Simultaneously run many tens of thousands typical Japanese queries against the index at 3-5 queries per second with highlighting turned on
> While Solr is indexing and searching, I'd like to verify that:
> * Indexing and queries are working as expected
> * Memory and heap usage looks stable over time
> * Garbage collection is overall low over time -- no Full-GC issues
> I'll post findings and results to this JIRA.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org