You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/08/15 01:11:46 UTC

[GitHub] [hudi] nmukerje commented on issue #3321: [SUPPORT] Setting _hoodie_is_deleted column is not deleting records when using Spark DataSource.

nmukerje commented on issue #3321:
URL: https://github.com/apache/hudi/issues/3321#issuecomment-898979181


   @nsivabalan @codope Worked fine after I cast _hoodie_is_deleted to boolean.
   
   ```
   df1=df1.withColumn("_hoodie_is_deleted",df1["_hoodie_is_deleted"].cast(BooleanType()))
   ```
   
   Please note that this is still a bad experience that I need to insert _hoodie_is_deleted as "false" to the initial records for the later delete operation to work. It would be nice if users do not need to explicitly set  _hoodie_is_deleted  to "false".
   
   If I do not set  _hoodie_is_deleted  to "false" for the initial records, I get the error on delete.
   
   ```
   Caused by: java.lang.RuntimeException: Null-value for required field: _hoodie_is_deleted
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org