You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Herman van Hövell (Jira)" <ji...@apache.org> on 2022/12/22 18:20:00 UTC
[jira] [Created] (SPARK-41690) Introduce AgnosticEncoders
Herman van Hövell created SPARK-41690:
-----------------------------------------
Summary: Introduce AgnosticEncoders
Key: SPARK-41690
URL: https://issues.apache.org/jira/browse/SPARK-41690
Project: Spark
Issue Type: Task
Components: Connect, SQL
Affects Versions: 3.4.0
Reporter: Herman van Hövell
Assignee: Herman van Hövell
For the Spark Connect Scala Client we need Encoders. We cannot use the current ExpressionEncoder infrastructure because of the following reasons:
* Connect uses Arrow instead of internal rows
* We don't want to have a dependency on Catalyst in the client.
* We don't want to be sending the expressions that expression encoders use over the wire.
An additional challenge is that we cannot send TypeTags over the wire.
The proposal is to create an intermediate encoder. This encoder describes exactly how which external type maps to an internal type, but it does not prescribe which format is used.
An upside of this approach is that we can reduce the cost of encoder creation by quite a margin.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org