You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Dongjoon Hyun (Jira)" <ji...@apache.org> on 2020/03/17 09:50:00 UTC

[jira] [Updated] (SPARK-26354) Ability to return schema prefix before dataframe column names

     [ https://issues.apache.org/jira/browse/SPARK-26354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dongjoon Hyun updated SPARK-26354:
----------------------------------
    Affects Version/s:     (was: 3.0.0)
                       3.1.0

> Ability to return schema prefix before dataframe column names
> -------------------------------------------------------------
>
>                 Key: SPARK-26354
>                 URL: https://issues.apache.org/jira/browse/SPARK-26354
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 3.1.0
>            Reporter: t oo
>            Priority: Major
>
> This query returns dataframe with prdct, prdct, addr, pho :
> select a.prdct, b.prdct, a.addr, b.pho from ac a
> full outer join baa b on a.prdct = b.prdct
>  
> This feature Jira is about having a new config flag (defaulted to false) that would be show.schema.prefix. When true it should return dataframe for above example of a.prdct, b.prdct, a.addr, b.pho. This would help to clearly distinguish origin of the columns with same name in >=2 tables without having to rewrite query to put specific alias ie as a_prdct or as b_prdct. My current use case is loading dataframe into List of Maps in java but it is only taking the first prdct column rather than both prdct columns
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org