You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2023/02/22 22:37:00 UTC

[jira] [Commented] (HUDI-5828) Support df.write.format("hudi") with out any additional options

    [ https://issues.apache.org/jira/browse/HUDI-5828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17692402#comment-17692402 ] 

sivabalan narayanan commented on HUDI-5828:
-------------------------------------------

looks like having tableName as a mandatory should be ok. For eg, even w/ databases, we have table name as a mandatory field. So, instead of complicating things(having inference of hive sync table name in one flow and not in other), better option would be ask users to set the table name to the minimum. 

 

[~kazdy] : thanks for the pointers. We are taking a look at registering as table as well. 

> Support df.write.format("hudi") with out any additional options
> ---------------------------------------------------------------
>
>                 Key: HUDI-5828
>                 URL: https://issues.apache.org/jira/browse/HUDI-5828
>             Project: Apache Hudi
>          Issue Type: Improvement
>          Components: writer-core
>            Reporter: sivabalan narayanan
>            Priority: Major
>
> Wrt simplifying the usage of hudi among more users, we should try to see if we can support writing to hudi w/o any options during write. 
>  
> For eg, we can do the following with paruqet writes. 
> {code:java}
> df.write.format("parquet").save(path)
> {code}
>  
> So, for a non-partitioned dataset, we should try if we can support this usability. 
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)