You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@hive.apache.org by Maciek <ma...@sonra.io> on 2014/11/18 21:37:47 UTC

Can CLUSTERED BY/SORTED BY speed up processing aggregations or inter-row calculations?

Can Hive optimizer take advantage of tables CLUSTERED BY … SORTED BY … when
performing aggregations or analytic functions (window aggregations)?
If so, how?

more details at SO: optimize Hive table storage for subsequent Aggregations
and/or Window Analytic Functions
<http://stackoverflow.com/questions/27001856>