You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2016/03/08 11:16:40 UTC

[jira] [Commented] (SPARK-13728) Fix ORC PPD

    [ https://issues.apache.org/jira/browse/SPARK-13728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15184742#comment-15184742 ] 

Hyukjin Kwon commented on SPARK-13728:
--------------------------------------

I see. I found some clues. It looks https://github.com/apache/spark/pull/11509 PR make the output only single ORC file.

It was 10 files but this PR writes only single file. So, this could not skip stripes in ORC.

Whether it is right to write this to single file or multiple files, I think I should explicitly repartition the data. Can I maybe submit a PR for this?

> Fix ORC PPD
> -----------
>
>                 Key: SPARK-13728
>                 URL: https://issues.apache.org/jira/browse/SPARK-13728
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Michael Armbrust
>
> Fix the ignored test "Enable ORC PPD" in OrcQuerySuite.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org