You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/03/04 02:39:59 UTC

[GitHub] [arrow-datafusion] yahoNanJing commented on issue #1916: Discussion: Is Ballista a standalone system or framework

yahoNanJing commented on issue #1916:
URL: https://github.com/apache/arrow-datafusion/issues/1916#issuecomment-1058769722


   Actually I think Ballista should act as a distributed computing framework like Spark Core. Like Spark is based on the RDD, Ballista is based on the ExecutionPlan for the DAG.
   
   Based on this framework, Ballista should also include several kinds of deployments. Currently, only standalone mode is provided. In the future, it's possible to introduce more resource managers, like Yarn, Mesos, etc.
   
   For the SQL part, I think it should be an independent part. The core of Ballista should not depend on the SQL. 
   ![Picture1](https://user-images.githubusercontent.com/90197956/156688633-6789dc67-a2c6-444b-9e9d-d4bf415d1c87.png)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org