You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Blackknight <Iv...@gmail.com> on 2018/02/01 13:33:07 UTC

Save the document size in to a new field

Hello guys,

I want to add an option to search document by size. For example, find the
top categories with the biggest documents. I thought about creating a new
update processor wich will counting the bytes of all fields in the document,
but I think it wont work good, because some fields are stored, some are
indexed, some od them has both of these flags, there are copyfields too wich
need to count...
 
So I think the size counter of fields in update processor, will lie about
the doc size. I don't take into account the compression of index on the
disk, but I want to get real numbers (I can admit for 10% observational
error)  
 
Someone knows what should I do?

I read some posts about saving size(in bytes) of document, all the posts
were relatively old, and has no solution. May be solr has new techniques for
document size counting? :)

Thank you, guys! 



--
Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html