You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Siddharth Seth (JIRA)" <ji...@apache.org> on 2016/03/09 23:27:40 UTC

[jira] [Commented] (HIVE-13250) Compute predicate conversions on the client, instead of per row group

    [ https://issues.apache.org/jira/browse/HIVE-13250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15188204#comment-15188204 ] 

Siddharth Seth commented on HIVE-13250:
---------------------------------------

cc [~ashutoshc], [~prasanth_j] - this was initially filed for Orc split elimination and partition pruning. [~ashutoshc] mentioned that CBO may be affected as well.

> Compute predicate conversions on the client, instead of per row group
> ---------------------------------------------------------------------
>
>                 Key: HIVE-13250
>                 URL: https://issues.apache.org/jira/browse/HIVE-13250
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Siddharth Seth
>
> When running a query for the form 
> select count from table where ts_field = "2016-01-23 00:00:00";
> or
> select count from table where ts_field = 1453507200
> ts_field is of type TIMESTAMP
> The predicate is converted to whatever format is appropriate for TIMESTAMP processing on each and every row group.
> It would be far more efficient to process this once on the client - or even once per task.
> The same applies to ORC splt elimination as well - this is applied for each stripe.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)