You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Stephen Boesch <ja...@gmail.com> on 2018/09/12 17:26:27 UTC

Fixing NullType for parquet files

When this JIRA was opened in 2015 the parquet did not support null types.
I commented on this JIRA in May that - given parquet now does include that
support - can this bug be reopened ?  There was no response. What is the
correct way to request consideration of re-opening this issue?

https://issues.apache.org/jira/browse/SPARK-10943

Permalink
<https://issues.apache.org/jira/browse/SPARK-10943?focusedCommentId=14959304&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14959304>
[image: marmbrus]Michael Armbrust
<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=marmbrus> added
a comment - 15/Oct/15 18:00

Yeah, parquet doesn't have a concept of null type. I'd probably suggest
they case null to a type CAST(NULL AS INT) if they really want to do this,
but really you should just omit the column probably.
<https://issues.apache.org/jira/browse/SPARK-10943#>
Permalink
<https://issues.apache.org/jira/browse/SPARK-10943?focusedCommentId=16462244&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16462244>
[image: wabu]Daniel Davis
<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=wabu> added a
comment - 03/May/18 10:14

According to parquet data types
https://github.com/apache/parquet-format/blob/master/LogicalTypes.md, now a
Null type should be supported. So perhaps this issue should be reconsidered?
<https://issues.apache.org/jira/browse/SPARK-10943#>
Permalink
<https://issues.apache.org/jira/browse/SPARK-10943?focusedCommentId=16462797&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-16462797>
[image: javadba]Stephen Boesch
<https://issues.apache.org/jira/secure/ViewProfile.jspa?name=javadba> added
a comment - 03/May/18 17:08

Given the comment by Daniel Davis can this issue be reopened?