You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (Jira)" <ji...@apache.org> on 2022/01/13 11:29:00 UTC

[jira] [Commented] (SPARK-37896) ConstantColumnVector: a column vector with same values

    [ https://issues.apache.org/jira/browse/SPARK-37896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17475282#comment-17475282 ] 

Apache Spark commented on SPARK-37896:
--------------------------------------

User 'Yaohua628' has created a pull request for this issue:
https://github.com/apache/spark/pull/35068

> ConstantColumnVector: a column vector with same values
> ------------------------------------------------------
>
>                 Key: SPARK-37896
>                 URL: https://issues.apache.org/jira/browse/SPARK-37896
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 3.2.0
>            Reporter: Yaohua Zhao
>            Priority: Major
>
> Introduce a new column vector named `ConstantColumnVector`, it represents a column vector where every row has the same constant value.
> It could help improve performance on hidden file metadata columnar file format, since metadata fields for every row in each file are exactly the same, we don't need to copy and keep multiple copies of data.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org