You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Ignacio Vera (JIRA)" <ji...@apache.org> on 2018/08/09 08:22:00 UTC
[jira] [Created] (LUCENE-8452) BKD-based shape indexing benchmarks
Ignacio Vera created LUCENE-8452:
------------------------------------
Summary: BKD-based shape indexing benchmarks
Key: LUCENE-8452
URL: https://issues.apache.org/jira/browse/LUCENE-8452
Project: Lucene - Core
Issue Type: Improvement
Components: modules/sandbox
Reporter: Ignacio Vera
Initial benchmarking of the new BKD-based shape indexing suggest that searches can be somewhat under-performing. I open this ticket to share the findings and to open a discussion how to speed up the solution.
The first benchmark is done by using the current benchmark in luceneutils for indexing points and search by bounding box. We would expect {{LatLonShape}} to be slower that {{LatLonPoint}} but still having a good performance. The results of running such benchmark in my computer looks like:
LatLonPoint:
89.717239531 sec to index
INDEX SIZE: 0.5087761553004384 GB
READER MB: 0.6098232269287109
maxDoc=60844404
totHits=221118844
BEST M hits/sec: 72.91056132596746
BEST QPS: 74.19031323419311
LatLonShape:
89.388678805 sec to index
INDEX SIZE: 1.3028179928660393 GB
READER MB: 0.8827085494995117
maxDoc=60844404
totHits=221118844
BEST M hits/sec: 1.0053836784184809
BEST QPS: 1.0230305276205143
A second benchmark has been performed indexing around 10 million 4-side polygons and around 3 million points. Searches are performed using bounding boxes. The results are compared with spatial trees alternatives. Spatial trees use a composite strategy, precision=0.001 degrees and distErrPct=0.25:
s2 (Geo3d):
1191.732124301 sec to index part 0
INDEX SIZE: 3.2086284114047885 GB
READER MB: 19.453557014465332
maxDoc=12949519
totHits=705758537
BEST M hits/sec: 13.311369588840462
BEST QPS: 4.243743434150063
quad (JTS):
3252.62925159 sec to index part 0
INDEX SIZE: 4.5238002222031355 GB
READER MB: 41.15725612640381
maxDoc=12949519
totHits=705758357
BEST M hits/sec: 35.54591930673003
BEST QPS: 11.332252412866938
LatLonShape:
30.32712009 sec to index part 0
INDEX SIZE: 0.5627057952806354 GB
READER MB: 0.29498958587646484
maxDoc=12949519
totHits=705758228
BEST M hits/sec: 3.4130465326433357
BEST QPS: 1.0880999177593018
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org