You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2021/10/17 00:44:42 UTC

[GitHub] [spark] Kimahriman opened a new pull request #34294: [SPARK-37019][SQL] Add codegen support to array transform

Kimahriman opened a new pull request #34294:
URL: https://github.com/apache/spark/pull/34294

### What changes were proposed in this pull request?

This PR adds codegen support to ArrayTransform. This is my first time playing around with codegen, so definitely looking for any feedback. I ran into several issues along the way which you'll see in some checks I had to add. Specifically:
- I added lambda variable tracking to the codegen context, to make sure a function can't be split out while there is any active lambda variables
- Made sure lambda functions themselves can never be considered for subexpression elimination
- I still have to set the atomic references in the codegen to support any children with CodegenFallback or just that fallback to interpreted in their codegen path

Questions I have:
- Does it make sense to support both the traditional ExprCode approach with lambda variables while also setting the atomic reference value for fallback cases? Or should I just use the atomic reference to get the value everywhere even in codegen cases since I have to set it anyway and simplify the code a little bit?
- Obviously any other corner cases anyone can think of?

### Why are the changes needed?

To improve performance of transform operations, letting the children be codegen'd and participate in WholeStageCodegen

### Does this PR introduce _any_ user-facing change?

No, only performance improvements.

### How was this patch tested?

Existing unit tests, let me know if there's other codegen-specific unit tests I should add.

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org