You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Jason Rutherglen (JIRA)" <ji...@apache.org> on 2009/03/27 21:04:50 UTC

[jira] Updated: (LUCENE-1577) Benchmark of different in RAM realtime techniques

     [ https://issues.apache.org/jira/browse/LUCENE-1577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jason Rutherglen updated LUCENE-1577:
-------------------------------------

    Attachment: LUCENE-1577.patch

This patch performs a benchmark of 3 different techniques for RAM based realtime indexing where after an update, the new document is searchable.  It performs multiple rounds of indexing and calculates the percentage difference with fastest of each of the 3 techniques.  The document source is the Wikipedia English XML used by contrib/benchmark.

* RealtimeWriter uses InstantiatedIndex
* LuceneWriter adds documents to an IndexWriter
* LuceneRealtimeWriter creates a RAMDirectory, opens an IndexWriter, adds a document, then closes the writer.

I found it odd that RealtimeWriter is faster than LuceneWriter and so perhaps the benchmark is incorrect somehow.  Otherwise the results look highly promising in that we can implement realtime search with no impact to existing indexing performance.  

Summary of the results:

numRounds:3 docs indexed:50000
lowest of each, percent compared with lowest
RealtimeWriter:7597 dif:0% 
LuceneWriter:12940 dif:70%
LuceneRealtimeWriter:25882 dif:241%


> Benchmark of different in RAM realtime techniques
> -------------------------------------------------
>
>                 Key: LUCENE-1577
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1577
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: contrib/*
>    Affects Versions: 2.4.1
>            Reporter: Jason Rutherglen
>            Priority: Minor
>             Fix For: 2.9
>
>         Attachments: LUCENE-1577.patch
>
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> A place to post code that benchmarks the differences in the speed of indexing and searching using different realtime techniques.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org