You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Nitin Pawar (Jira)" <ji...@apache.org> on 2020/04/02 04:46:00 UTC

[jira] [Created] (DRILL-7685) Case statement marking column as required in parquet metadata

Nitin Pawar created DRILL-7685:
----------------------------------

             Summary: Case statement marking column as required in parquet metadata
                 Key: DRILL-7685
                 URL: https://issues.apache.org/jira/browse/DRILL-7685
             Project: Apache Drill
          Issue Type: Improvement
          Components: Storage - Parquet
    Affects Versions: 1.16.0
            Reporter: Nitin Pawar


We use apache drill for multi step processing.

In one of the steps we have query as below

~create table dfs.tmp.`/t2` as select employee_id, case when department_id is not null then 1 else 2 end as case_output from cp.`employee.json`;~

This provides output as 

employee_id: OPTIONAL INT64 R:0 D:1
case_output: REQUIRED INT32 R:0 D:0

If we remove the end statement from case it does mark the column as optional.

 

We feed this output to covariance function and because of this we get an error like below 

Error: Missing function implementation: [covariance(BIGINT-OPTIONAL, INT-REQUIRED)]. Full expression: --UNKNOWN EXPRESSION--

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)