You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/04/08 13:25:57 UTC

[GitHub] [spark] gengliangwang opened a new pull request #24318: [SPARK-27407][SQL] File source V2: Invalidate cache data on overwrite/append

gengliangwang opened a new pull request #24318: [SPARK-27407][SQL] File source V2: Invalidate cache data on overwrite/append
URL: https://github.com/apache/spark/pull/24318
 
 
   ## What changes were proposed in this pull request?
   
   File source V2 currently incorrectly continues to use cached data even if the underlying data is overwritten. 
   We should follow https://github.com/apache/spark/pull/13566 and fix it by invalidating and refreshes all the cached data (and the associated metadata) for any Dataframe that contains the given data source path.
   
   
   ## How was this patch tested?
   
   Unit test
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org