You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "He Qi (Jira)" <ji...@apache.org> on 2023/07/24 04:42:00 UTC

[jira] [Commented] (SPARK-44518) Completely make hive as a data source

    [ https://issues.apache.org/jira/browse/SPARK-44518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17746191#comment-17746191 ] 

He Qi commented on SPARK-44518:
-------------------------------

[~LuciferYang] [~yumwang] [~Qin Yao] [~csun] WDYT?

> Completely make hive as a data source
> -------------------------------------
>
>                 Key: SPARK-44518
>                 URL: https://issues.apache.org/jira/browse/SPARK-44518
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>    Affects Versions: 3.5.0
>            Reporter: He Qi
>            Priority: Major
>             Fix For: 4.0.0
>
>
> Now, hive is a different data source from other data sources. In Spark Project, Hive have many special logic and burden the cost of maintenance . Like presto, hive is only a connector. Is it possible that we can  make hive as a data source completely?
> Surely, I know that it's very difficult. It has many historical problems and compatible problems. Could we reduce these problems as possible as we can if we release 4.0?
> I just wanna start a discussion to collect more people's suggestion. Any suggestion is welcome. I just feel 4.0 is a good opportunity to discuss this issue.
> If I am wrong, it's welcome to point it out.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org