You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2018/06/18 00:00:00 UTC
[jira] [Assigned] (SPARK-24576) Upgrade Apache ORC to 1.5.1
[ https://issues.apache.org/jira/browse/SPARK-24576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-24576:
------------------------------------
Assignee: Apache Spark
> Upgrade Apache ORC to 1.5.1
> ----------------------------
>
> Key: SPARK-24576
> URL: https://issues.apache.org/jira/browse/SPARK-24576
> Project: Spark
> Issue Type: Improvement
> Components: Build
> Affects Versions: 2.4.0
> Reporter: Dongjoon Hyun
> Assignee: Apache Spark
> Priority: Major
>
> This issue aims to upgrade Apache ORC library from 1.4.4 to 1.5.1 in order to bring the following benefits into Apache Spark.
> * ORC-91 Support for variable length blocks in HDFS (The current space wasted in ORC to padding is known to be 5%.)
> * ORC-344 Support for using Decimal64ColumnVector
> In addition to that, Apache Hive 3.1.0 will use ORC 1.5.1 (HIVE-19669). This will improve the compatibility between Apache Spark and Apache Hive.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org