You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Cheng Lian (JIRA)" <ji...@apache.org> on 2015/06/12 07:45:00 UTC

[jira] [Resolved] (SPARK-6566) Update Spark to use the latest version of Parquet libraries

     [ https://issues.apache.org/jira/browse/SPARK-6566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Cheng Lian resolved SPARK-6566.
-------------------------------
       Resolution: Fixed
    Fix Version/s: 1.5.0

Issue resolved by pull request 5889
[https://github.com/apache/spark/pull/5889]

> Update Spark to use the latest version of Parquet libraries
> -----------------------------------------------------------
>
>                 Key: SPARK-6566
>                 URL: https://issues.apache.org/jira/browse/SPARK-6566
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 1.3.0
>            Reporter: Konstantin Shaposhnikov
>             Fix For: 1.5.0
>
>
> There are a lot of bug fixes in the latest version of parquet (1.6.0rc7). E.g. PARQUET-136
> It would be good to update Spark to use the latest parquet version.
> The following changes are required:
> {code}
> diff --git a/pom.xml b/pom.xml
> index 5ad39a9..095b519 100644
> --- a/pom.xml
> +++ b/pom.xml
> @@ -132,7 +132,7 @@
>      <!-- Version used for internal directory structure -->
>      <hive.version.short>0.13.1</hive.version.short>
>      <derby.version>10.10.1.1</derby.version>
> -    <parquet.version>1.6.0rc3</parquet.version>
> +    <parquet.version>1.6.0rc7</parquet.version>
>      <jblas.version>1.2.3</jblas.version>
>      <jetty.version>8.1.14.v20131031</jetty.version>
>      <orbit.version>3.0.0.v201112011016</orbit.version>
> {code}
> and
> {code}
> --- a/sql/core/src/main/scala/org/apache/spark/sql/parquet/ParquetTableOperations.scala
> +++ b/sql/core/src/main/scala/org/apache/spark/sql/parquet/ParquetTableOperations.scala
> @@ -480,7 +480,7 @@ private[parquet] class FilteringParquetRowInputFormat
>      globalMetaData = new GlobalMetaData(globalMetaData.getSchema,
>        mergedMetadata, globalMetaData.getCreatedBy)
>  
> -    val readContext = getReadSupport(configuration).init(
> +    val readContext = ParquetInputFormat.getReadSupportInstance(configuration).init(
>        new InitContext(configuration,
>          globalMetaData.getKeyValueMetaData,
>          globalMetaData.getSchema))
> {code}
> I am happy to prepare a pull request if necessary.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org