You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/10/19 13:49:36 UTC

[GitHub] [arrow-datafusion] andygrove opened a new issue, #3892: Improve efficiency of multiple optimizer passes

andygrove opened a new issue, #3892:
URL: https://github.com/apache/arrow-datafusion/issues/3892

   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   Improve efficiency of multiple optimizer passes
   
   **Describe the solution you'd like**
   
   Thanks to @isidentical  for this suggestion.
   
   Instead of having the optimizer decide when it is done by seeing if the last pass changed the plan or not, based on the Display representation of the plan, it might also make sense to compute a unique plan id (bottom up) so that we can also use this to detect optimization cycles.
   
   A very basic example is (assuming each letter is a unique plan id) A -> B -> C -> A -> B -> [max passes times more], where even though the previous plan is different from the current one we would still need to exit the loop. Having a unique id would mean we can just store a set somewhere and check against if known_plans.contains(new_plan.id) and it would break the loop.
   
   **Describe alternatives you've considered**
   
   **Additional context**
   Discussion at https://github.com/apache/arrow-datafusion/pull/3880/files#r998491734
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] isidentical commented on issue #3892: Improve efficiency of multiple optimizer passes

Posted by GitBox <gi...@apache.org>.
isidentical commented on issue #3892:
URL: https://github.com/apache/arrow-datafusion/issues/3892#issuecomment-1284244043

   > Wouldn't this be a hash?
   
   Pretty much! I am not sure if there is anything internally that would prevent them being 'hashable' (as in getting  consistent summary), but if there is none we can even implement native rust `Hash` and just use a hash-set.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] Dandandan commented on issue #3892: Improve efficiency of multiple optimizer passes

Posted by GitBox <gi...@apache.org>.
Dandandan commented on issue #3892:
URL: https://github.com/apache/arrow-datafusion/issues/3892#issuecomment-1284089865

   Wouldn't this be a hash?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] HaoYang670 commented on issue #3892: Improve efficiency of multiple optimizer passes

Posted by GitBox <gi...@apache.org>.
HaoYang670 commented on issue #3892:
URL: https://github.com/apache/arrow-datafusion/issues/3892#issuecomment-1284776650

   As this is very similar to the optimizer in compilers, I guess we can also find some inspiration from how compilers judge to finish the optimizing process. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-datafusion] alamb closed issue #3892: Improve efficiency of multiple optimizer passes

Posted by "alamb (via GitHub)" <gi...@apache.org>.
alamb closed issue #3892: Improve efficiency of multiple optimizer passes
URL: https://github.com/apache/arrow-datafusion/issues/3892


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org