You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hive.apache.org by Owen Thomas <ow...@xpldr.com> on 2016/10/03 23:45:08 UTC

StorageHandler and Hive Cost Based Optimization

Hi,

I'd like to know if its possible for a StorageHandler backed table to
contribute more than table-wide statistics during CBO analysis.
Specifically, with native partitioned tables, CBO will filter partitions
not involved in the query. Is it possible to do something similar to the
StorageHandler's predicate push-down optimization for CBO?

Ideally, I'd like an interface that lets me generate a Statistics object
with respect to a Table and given predicate. This mirrors
the HiveStoragePredicateHandler, but for statistics generation.

Is there an approach for doing this or do I need to fall back to native
tables and partitions?

Apologies if this question has been asked before or this isn't the correct
audience.

Regards,

Owen.