You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "He Qi (Jira)" <ji...@apache.org> on 2023/07/24 04:42:00 UTC
[jira] [Commented] (SPARK-44518) Completely make hive as a data source
[ https://issues.apache.org/jira/browse/SPARK-44518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17746191#comment-17746191 ]
He Qi commented on SPARK-44518:
-------------------------------
[~LuciferYang] [~yumwang] [~Qin Yao] [~csun] WDYT?
> Completely make hive as a data source
> -------------------------------------
>
> Key: SPARK-44518
> URL: https://issues.apache.org/jira/browse/SPARK-44518
> Project: Spark
> Issue Type: Sub-task
> Components: SQL
> Affects Versions: 3.5.0
> Reporter: He Qi
> Priority: Major
> Fix For: 4.0.0
>
>
> Now, hive is a different data source from other data sources. In Spark Project, Hive have many special logic and burden the cost of maintenance . Like presto, hive is only a connector. Is it possible that we canĀ make hive as a data source completely?
> Surely, I know that it's very difficult. It has many historical problems and compatible problems. Could we reduce these problems as possible as we can if we release 4.0?
> I just wanna start a discussion to collect more people's suggestion. Any suggestion is welcome. I just feel 4.0 is a good opportunity to discuss this issue.
> If I am wrong, it's welcome to point it out.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org