You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2019/08/05 09:28:00 UTC

[jira] [Commented] (DRILL-7156) Empty Parquet is not getting created if 0 records in result

    [ https://issues.apache.org/jira/browse/DRILL-7156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16899942#comment-16899942 ] 

ASF GitHub Bot commented on DRILL-7156:
---------------------------------------

oleg-zinovev commented on pull request #1836: DRILL-7156: empty parquet files support
URL: https://github.com/apache/drill/pull/1836
 
 
   PR for Drill empty parquet  files read and write support.
   
   Known limitations:
   1) Not working for hive parquet for now
   2) Ignores all schemas except last while writing empty parquet file
   3) Not support empty schemas (e.g. create table `..` as select * from `empty.json, e.g. {}`)
   
   Short changes description:
   1) Parquet footer metadata added
   2) Parquet writer checks that at least 1 row has been written. If not - creates a empty parquet file with footer.
   3) EmptyParquetRowGroupScan and EmptyParquetScanBatchCreator added
   
   Questions:
   1) TestParquetWriterEmptyFiles#testMultipleWriters now creates several empty files, but not fails, since reading of empty parquet is supported. Should I rewrite comment or remove the test?
   
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Empty Parquet is not getting created if 0 records in result
> -----------------------------------------------------------
>
>                 Key: DRILL-7156
>                 URL: https://issues.apache.org/jira/browse/DRILL-7156
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Parquet
>            Reporter: Sayalee Bhanavase
>            Priority: Blocker
>
> I am creating parquet tables out of joins. If there is no record in join, it does not create empty. table and when I reused the table my further script fails. 
> Has anyone faced this issue? Any suggestion or workaround?



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)