You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Yan (JIRA)" <ji...@apache.org> on 2016/02/02 09:23:40 UTC

[jira] [Commented] (SPARK-12988) Can't drop columns that contain dots

    [ https://issues.apache.org/jira/browse/SPARK-12988?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15127883#comment-15127883 ] 

Yan commented on SPARK-12988:
-----------------------------

[~marmbrus] For the same reason of "`a.c` is an invalid column name. toDF(...) should not accept that",  can we require that df.drop do not take backtick either because df.drop can only drop top-level columns? Programmatically it makes little difference; but it seems more consistent semantically. 

> Can't drop columns that contain dots
> ------------------------------------
>
>                 Key: SPARK-12988
>                 URL: https://issues.apache.org/jira/browse/SPARK-12988
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 1.6.0
>            Reporter: Michael Armbrust
>
> Neither of theses works:
> {code}
> val df = Seq((1, 1)).toDF("a_b", "a.c")
> df.drop("a.c").collect()
> df: org.apache.spark.sql.DataFrame = [a_b: int, a.c: int]
> {code}
> {code}
> val df = Seq((1, 1)).toDF("a_b", "a.c")
> df.drop("`a.c`").collect()
> df: org.apache.spark.sql.DataFrame = [a_b: int, a.c: int]
> {code}
> Given that you can't use drop to drop subfields, it seems to me that we should treat the column name literally (i.e. as though it is wrapped in back ticks).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org