You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2023/01/09 23:06:08 UTC

[GitHub] [iceberg] RussellSpitzer commented on issue #6549: Collecting Iceberg NDV Statistics for Spark Engine

RussellSpitzer commented on issue #6549:
URL: https://github.com/apache/iceberg/issues/6549#issuecomment-1376460828

   I think while it may be helpful to collect sketches at write time, for older tables and for a POC I think we should start with just an "analyze" like procedure that just uses a specific snapshots and generates a puffin file with all the expected NDV stats for the entire snapshot.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org