You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by GitBox <gi...@apache.org> on 2019/04/18 20:28:46 UTC

[GitHub] [lucene-solr] johann-beessip opened a new pull request #651: Performance improvement for update?optimize=true

johann-beessip opened a new pull request #651: Performance improvement for update?optimize=true
URL: https://github.com/apache/lucene-solr/pull/651
 
 
   For our use case we have a small number of documents with a large number of docValues.  During optimize we noticed a lot of time being spent in the FilterFieldInfor constructor, on the filterFields.contains(...) function.  
   
   The small change from an ArrayList to a LinkedHashSet reduced our optimize time from 60min to < 1min.
   
   This was added to branch 7_2 as that's the version we're still using, but the issue is still present in master:
   https://github.com/apache/lucene-solr/blob/master/lucene/core/src/java/org/apache/lucene/codecs/perfield/PerFieldDocValuesFormat.java#L142
   
   
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org