You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@impala.apache.org by Thomas Tauber-Marshall <tm...@cloudera.com> on 2017/06/05 21:02:34 UTC

Hive Schema Upgrade Required

I have two patches going out soon:
https://gerrit.cloudera.org/#/c/7072/
https://gerrit.cloudera.org/#/c/7076/
which combined update Impala's Hive dependency to a new version with an
updated schema. After pulling in these changes, you should find that the
hive metastore will fail to come up.

To fix this without doing a full data reload, you can use the following
command:
${IMPALA_TOOLCHAIN}/cdh_components/hive-1.1.0-cdh5.13.0-SNAPSHOT/bin/schematool
-upgradeSchema -dbType {type}
where type is one of 'postgres' or 'mysql', depending on your setup.

You'll need to pull the latest version of Hive into your toolchain, such as
by running buildall.sh with DOWNLOAD_CDH_COMPONENTS=true

If you see:
Error: ERROR: column "SCHEMA_VERSION_V2" of relation "VERSION" does not
exist (state=42703,code=0)
in the schematool output, this is expected and not really an error.

There is a JIRA for automating this process in the future:
https://issues.apache.org/jira/browse/IMPALA-5430