You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Dongjoon Hyun (Jira)" <ji...@apache.org> on 2019/12/14 01:18:00 UTC
[jira] [Commented] (SPARK-30249) Invalid Column Names in parquet
tables should not be allowed
[ https://issues.apache.org/jira/browse/SPARK-30249?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16996044#comment-16996044 ]
Dongjoon Hyun commented on SPARK-30249:
---------------------------------------
I believe it's prevented because ORC format doesn't support that.
When you use those column in Parquet file, does Parquet table work incorrectly?
I didn't test it, but It might be a valid format in Parquet file format.
> Invalid Column Names in parquet tables should not be allowed
> ------------------------------------------------------------
>
> Key: SPARK-30249
> URL: https://issues.apache.org/jira/browse/SPARK-30249
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Affects Versions: 3.0.0
> Reporter: Rakesh Raushan
> Priority: Minor
>
> Column names such as `a:b` , `??`, `,,`, `^^` , `++`etc are allowed when we are creating parquet tables.
> While when we are creating tables with `orc` all such column names are marked as invalid and analysis exception is thrown.
> These column names should also be not allowed for parquet tables as well.
> Also this induces inconsistency between column names for Parquet and ORC
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org