You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/04/14 18:54:24 UTC

[GitHub] [arrow] james-camacho-ab commented on issue #12892: Arrow install on Databricks cluster takes 10+ minutes

james-camacho-ab commented on issue #12892:
URL: https://github.com/apache/arrow/issues/12892#issuecomment-1099528930

   @wjones127 This is for conjunction with SparkR, essentially taking some Legacy code and optimizing the collects() into R dataframes so we can leverage Delta tables while making minimal changes to the existing codebase. I'm using the CRAN option to install it onto my cluster. Using arrow version 7.0.0. I looked through the driver logs and nothing immediately pops out as being relevant for the arrow install, but I can send those if you want it. Here's my cluster config as well for reference:
   ![image](https://user-images.githubusercontent.com/85585586/163457581-e89e64ca-caef-4e3f-965a-9b27c92c6a09.png)
   ![image](https://user-images.githubusercontent.com/85585586/163457727-0511acce-fbe0-4cd1-ac24-3483513221ee.png)
   ![image](https://user-images.githubusercontent.com/85585586/163457677-a2fbc79f-b0f7-404b-8db5-329673edbd80.png)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org