You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Paul Rogers (Jira)" <ji...@apache.org> on 2020/04/12 18:20:00 UTC

[jira] [Resolved] (DRILL-7685) Case statement marking column as required in parquet metadata

     [ https://issues.apache.org/jira/browse/DRILL-7685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Paul Rogers resolved DRILL-7685.
--------------------------------
    Resolution: Cannot Reproduce

Tested in Drill 1.18 (snapshot) and found that the provided query works fine. Suggested the user try the newer Drill version.

If you still have a problem please reopen this bug and provide another example so we can locate and fix the issue, if it still exists in the latest code.

> Case statement marking column as required in parquet metadata
> -------------------------------------------------------------
>
>                 Key: DRILL-7685
>                 URL: https://issues.apache.org/jira/browse/DRILL-7685
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Parquet
>    Affects Versions: 1.16.0
>            Reporter: Nitin Pawar
>            Assignee: Paul Rogers
>            Priority: Minor
>
> We use apache drill for multi step processing.
> In one of the steps we have query as below
> ~create table dfs.tmp.`/t2` as select employee_id, case when department_id is not null then 1 else 2 end as case_output from cp.`employee.json`;~
> This provides output as 
> employee_id: OPTIONAL INT64 R:0 D:1
> case_output: REQUIRED INT32 R:0 D:0
> If we remove the end statement from case it does mark the column as optional.
>  
> We feed this output to covariance function and because of this we get an error like below 
> Error: Missing function implementation: [covariance(BIGINT-OPTIONAL, INT-REQUIRED)]. Full expression: --UNKNOWN EXPRESSION--
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)