You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/01/24 09:43:54 UTC

[GitHub] [arrow-datafusion] Ted-Jiang opened a new issue #1662: Need clean up intermediate data in Ballista

Ted-Jiang opened a new issue #1662:
URL: https://github.com/apache/arrow-datafusion/issues/1662


   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   We need to check whether the states saved in the sled is consumed by UI or not.
   if not consumed by UI, we can clean the job/task data when the SQL is finished.
   
   If they are consumed by UI, we can choose either LRU based policy like Spark or time based eviction policy.
   
   Regarding shuffle files, we also need to implement a way to clean them. This is a little bit complex because we need to clean up the files on all the hosts. We might need to add new RPCs.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [arrow-datafusion] Ted-Jiang commented on issue #1662: Need clean up intermediate data in Ballista

Posted by GitBox <gi...@apache.org>.
Ted-Jiang commented on issue #1662:
URL: https://github.com/apache/arrow-datafusion/issues/1662#issuecomment-1032234517


   related to #1701 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org