You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Yaohua Zhao (Jira)" <ji...@apache.org> on 2022/01/13 11:26:00 UTC

[jira] [Created] (SPARK-37896) ConstantColumnVector: a column vector with same values

Yaohua Zhao created SPARK-37896:
-----------------------------------

             Summary: ConstantColumnVector: a column vector with same values
                 Key: SPARK-37896
                 URL: https://issues.apache.org/jira/browse/SPARK-37896
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 3.2.0
            Reporter: Yaohua Zhao


Introduce a new column vector named `ConstantColumnVector`, it represents a column vector where every row has the same constant value.

It could help improve performance on hidden file metadata columnar file format, since metadata fields for every row in each file are exactly the same, we don't need to copy and keep multiple copies of data.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org