You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Sergey Shelukhin (JIRA)" <ji...@apache.org> on 2014/12/03 03:02:12 UTC

[jira] [Created] (HIVE-9010) introduce Hive perf configuration utility

Sergey Shelukhin created HIVE-9010:
--------------------------------------

             Summary: introduce Hive perf configuration utility
                 Key: HIVE-9010
                 URL: https://issues.apache.org/jira/browse/HIVE-9010
             Project: Hive
          Issue Type: Improvement
          Components: Configuration
            Reporter: Sergey Shelukhin


Recently, many major perf features have been added (or are being added) to Hive, such as vectorization, CBO, Tez, Spark, etc.
Customers using the Apache distribution may not be aware of these, and may not take advantage of all the improvement Hive can offer. That may also apply to other config settings.

We can create a Hive perf configuration utility that will be able to set 6-10 important, easy-to-set settings. It can be used by admins or users when deploying Hive or on an existing cluster. Ideally all the no-brainer set-to-true settings would be there, with caveats, if any, described; some other ones may be, too, but we don't want to add any options for tuning because the whole point is to make it not confusing (as compared to editing the entire config file). Unless we have automatic tuning at some point, the users doing perf tuning can edit the config file manually after reading the docs.

Then we can mention it prominently in the docs and release notes. This should go a long way towards making sure users can utilize Hive to its full potential, without us enabling large/perf features by default, at least until they are stable (e.g. CBO can be enabled by default, so this tool may note that).

Experimental feature settings (true/false or simple) can also be added in a separate section.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)