You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "ASF subversion and git services (Jira)" <ji...@apache.org> on 2022/07/19 11:02:00 UTC

[jira] [Commented] (IMPALA-10871) Add MetastoreShim to support Apache Hive 3.x

    [ https://issues.apache.org/jira/browse/IMPALA-10871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17568478#comment-17568478 ] 

ASF subversion and git services commented on IMPALA-10871:
----------------------------------------------------------

Commit 1d16367afb1e520bae5992c1e6c92b4fb41d2655 in impala's branch refs/heads/master from Fucun Chu
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=1d16367af ]

IMPALA-10871 (part 2): Apache Hive 3: fixes for dataset loading

This patch fixes the data loading problem of integrating Apache Hive 3
and switches to the tez engine.

Add HIVE-21569, HIVE-20038 patches and recompile the hive-exec module.

Testing:
- Manually perform data loading steps.

Change-Id: I86a1fdffc70b8d9a3bc97a72b5b939021dc496f1
Reviewed-on: http://gerrit.cloudera.org:8080/18028
Reviewed-by: Impala Public Jenkins <im...@cloudera.com>
Tested-by: Impala Public Jenkins <im...@cloudera.com>


> Add MetastoreShim to support Apache Hive 3.x
> --------------------------------------------
>
>                 Key: IMPALA-10871
>                 URL: https://issues.apache.org/jira/browse/IMPALA-10871
>             Project: IMPALA
>          Issue Type: Task
>            Reporter: Quanlong Huang
>            Assignee: Fucun Chu
>            Priority: Major
>
> Impala 3.x supports both Hive 2 and 3 by a MetastoreShim that is chose at compilation time:
> {code:bash}
> fe/src/compat-hive-2
> └── java/org/apache
>             ├── hadoop/hive/common/ValidWriteIdList.java
>             └── impala/compat/MetastoreShim.java
> fe/src/compat-hive-3
> └── java/org/apache/impala/compat
>                     ├── HiveMetadataFormatUtils.java
>                     └── MetastoreShim.java {code}
> In pom.xml:
> {code:xml}
>       <plugin>
>         <groupId>org.codehaus.mojo</groupId>
>         <artifactId>build-helper-maven-plugin</artifactId>
>         <version>1.5</version>
>         <executions>
>           <!-- Tell maven about our generated files -->
>           <execution>
>             <id>add-source</id>
>             <phase>generate-sources</phase>
>             <goals>
>               <goal>add-source</goal>
>             </goals>
>             <configuration>
>               <sources>
>                 ...
>                 <source>${project.basedir}/src/compat-hive-${hive.major.version}/java</source>
>                </sources>
>             </configuration>
>           </execution>
>         </executions>
>       </plugin>
> {code}
> Now MetastoreShim evolves to only support CDP Hive which contains some breaking changes of Hive 4.x.
> To support Apache Hive 3.x, we need another MetastoreShim. Some codes that directly use Hive 4 APIs may need to be ignored in compilation, e.g. fe/src/main/java/org/apache/impala/catalog/metastore/*.java



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org