You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "xiaoli (Jira)" <ji...@apache.org> on 2021/10/18 04:05:00 UTC

[jira] [Created] (SPARK-37034) What's the progress of vectorized execution for spark?

xiaoli created SPARK-37034:
------------------------------

             Summary: What's the progress of vectorized execution for spark?
                 Key: SPARK-37034
                 URL: https://issues.apache.org/jira/browse/SPARK-37034
             Project: Spark
          Issue Type: New Feature
          Components: Spark Core
    Affects Versions: 3.2.0
            Reporter: xiaoli


Spark has support vectorized read for ORC and parquet. What's the progress of other vectorized execution, e.g. vectorized write,  join, aggr, simple operator (string function, math function)? 

Hive support vectorized execution in [early version|[https://cwiki.apache.org/confluence/display/hive/vectorized+query+execution]|https://cwiki.apache.org/confluence/display/hive/vectorized+query+execution].] As we know, Spark is replacement of Hive. I guess the reason why Spark does not support vectorized execution maybe the difficulty of design or implementation in Spark is larger. What's the main issue for Spark to support vectorized execution?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org