You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by GitBox <gi...@apache.org> on 2022/02/05 00:53:34 UTC

[GitHub] [iceberg] huaxingao opened a new issue #4046: push down aggregation (min/max/count) to iceberg scan

huaxingao opened a new issue #4046:
URL: https://github.com/apache/iceberg/issues/4046


   Currently, when doing SELECT max/min/count FROM table, all the files will be scanned and then the max/min/count will be calculated. Ideally, we should avoid doing a full scan and just use the metrics statistics for max/min/count.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] huaxingao commented on issue #4046: push down aggregation (min/max/count) to iceberg scan

Posted by GitBox <gi...@apache.org>.
huaxingao commented on issue #4046:
URL: https://github.com/apache/iceberg/issues/4046#issuecomment-1030470872


   cc @RussellSpitzer @flyrain 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] Zhangg7723 commented on issue #4046: push down aggregation (min/max/count) to iceberg scan

Posted by GitBox <gi...@apache.org>.
Zhangg7723 commented on issue #4046:
URL: https://github.com/apache/iceberg/issues/4046#issuecomment-1035112432


   Iceberg has provided the metric api of data files, I think the pushdown should be implemented in the SQL engine, maybe build a  plan with a datasource that collect the statistics。


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org


[GitHub] [iceberg] flyrain commented on issue #4046: push down aggregation (min/max/count) to iceberg scan

Posted by GitBox <gi...@apache.org>.
flyrain commented on issue #4046:
URL: https://github.com/apache/iceberg/issues/4046#issuecomment-1046430517


   As @RussellSpitzer mentioned in https://apache-iceberg.slack.com/archives/C025PH0G1D4/p1645289087247769, the MOR deletes make the metrics unreliable. In that case, we still need to read data files.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org