You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2022/08/10 17:55:21 UTC

[GitHub] [spark] peter-toth opened a new pull request, #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

peter-toth opened a new pull request, #37472:
URL: https://github.com/apache/spark/pull/37472

   ### What changes were proposed in this pull request?
   Keep the output attributes of a `Union` node's first child in the `RemoveRedundantAliases` rule to avoid correctness issues.
   
   ### Why are the changes needed?
   To fix the result of the following query:
   ```
   SELECT a, b AS a FROM (
   SELECT a, a AS b FROM (SELECT a FROM VALUES (1) AS t(a))
   UNION ALL
   SELECT a, b FROM (SELECT a, b FROM VALUES (1, 2) AS t(a, b))
   )
   ```
   Before this PR the query returns the incorrect result:
   ```
   +---+---+
   | a| a|
   +---+---+
   | 1| 1|
   | 2| 2|
   +---+---+
   ```
   After this PR it returns the expected result:
   ```
   +---+---+
   | a| a|
   +---+---+
   | 1| 1|
   | 1| 2|
   +---+---+
   ```
   
   ### Does this PR introduce _any_ user-facing change?
   Yes, fixes a correctness issue.
   
   ### How was this patch tested?
   Added new UTs.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] peter-toth commented on pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

Posted by GitBox <gi...@apache.org>.
peter-toth commented on PR #37472:
URL: https://github.com/apache/spark/pull/37472#issuecomment-1213127391

   Sure, here is the 3.2 version: https://github.com/apache/spark/pull/37491 and the 3.1 version: https://github.com/apache/spark/pull/37496.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] cloud-fan commented on pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

Posted by GitBox <gi...@apache.org>.
cloud-fan commented on PR #37472:
URL: https://github.com/apache/spark/pull/37472#issuecomment-1212676469

   @peter-toth unfortunately, due to the changes of golden files, this PR has conflicts with all branches... can you create another 2 PRs for 3.2 and 3.1? thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] peter-toth commented on pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

Posted by GitBox <gi...@apache.org>.
peter-toth commented on PR #37472:
URL: https://github.com/apache/spark/pull/37472#issuecomment-1211635591

   cc @cloud-fan


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] cloud-fan commented on pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

Posted by GitBox <gi...@apache.org>.
cloud-fan commented on PR #37472:
URL: https://github.com/apache/spark/pull/37472#issuecomment-1212675370

   GA passed: https://github.com/peter-toth/spark/runs/7772855648


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] cloud-fan closed pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

Posted by GitBox <gi...@apache.org>.
cloud-fan closed pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique
URL: https://github.com/apache/spark/pull/37472


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] cloud-fan commented on pull request #37472: [SPARK-39887][SQL][3.3] RemoveRedundantAliases should keep aliases that make the output of projection nodes unique

Posted by GitBox <gi...@apache.org>.
cloud-fan commented on PR #37472:
URL: https://github.com/apache/spark/pull/37472#issuecomment-1212676191

   thanks, merging to 3.3!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org