You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "Balaji Varadarajan (Jira)" <ji...@apache.org> on 2020/05/27 00:48:00 UTC

[jira] [Commented] (HUDI-242) Support Efficient bootstrap of large parquet datasets to Hudi

    [ https://issues.apache.org/jira/browse/HUDI-242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17117203#comment-17117203 ] 

Balaji Varadarajan commented on HUDI-242:
-----------------------------------------

Cleaned up jiras to reflect the current status. 

Status -5/26 ([~vbalaji], [~uditme] [~wenningd])
 # Udit to make one pass of testing for Spark DataSource for RO view (COW/MOR). Wenning had already done extensive tests of Spark DataSource
 # Wenning to finish up testing Spark-Hive integration testing which is also covered by Balaji
 # Balaji to work on prepping code for review.  

Test Plan being used : [https://docs.google.com/spreadsheets/d/1xVfatk-6-fekwuCCZ-nTHQkewcHSEk89y-ReVV5vHQU/edit#gid=1813901684]

Targeting starting code-review by 05/28-05/29. 

Major Open Items: Presto Support and HUDI-915. 

 

cc [~vinoth]

 

> Support Efficient bootstrap of large parquet datasets to Hudi
> -------------------------------------------------------------
>
>                 Key: HUDI-242
>                 URL: https://issues.apache.org/jira/browse/HUDI-242
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: Usability
>            Reporter: Balaji Varadarajan
>            Assignee: Balaji Varadarajan
>            Priority: Blocker
>             Fix For: 0.6.0
>
>
>  Support Efficient bootstrap of large parquet tables



--
This message was sent by Atlassian Jira
(v8.3.4#803005)