You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "He Qi (Jira)" <ji...@apache.org> on 2023/07/24 04:26:00 UTC

[jira] [Created] (SPARK-44518) Completely make hive as a data source

He Qi created SPARK-44518:
-----------------------------

             Summary: Completely make hive as a data source
                 Key: SPARK-44518
                 URL: https://issues.apache.org/jira/browse/SPARK-44518
             Project: Spark
          Issue Type: Sub-task
          Components: SQL
    Affects Versions: 3.5.0
            Reporter: He Qi
             Fix For: 4.0.0


Now, hive is a different data source from other data sources. In Spark Project, Hive have many special logic and burden the cost of maintenance . Like presto, hive is only a connector. Is it possible that we can  make hive as a data source completely?

Surely, I know that it's very difficult. It has many historical problems and compatible problems. Could we reduce these problems as possible as we can if we release 4.0?

I just wanna start a discussion to collect more people's suggestion. Any suggestion is welcome. I just feel 4.0 is a good opportunity to discuss this issue.

If I am wrong, it's welcome to point it out.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org