You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@iceberg.apache.org by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/05/24 04:00:33 UTC

[GitHub] [iceberg] ConeyLiu opened a new pull request, #7693: Spark 3.3: Fixes bucket on binary column

ConeyLiu opened a new pull request, #7693:
URL: https://github.com/apache/iceberg/pull/7693

   Closes #7682 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] ConeyLiu commented on pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "ConeyLiu (via GitHub)" <gi...@apache.org>.

ConeyLiu commented on PR #7693:
URL: https://github.com/apache/iceberg/pull/7693#issuecomment-1565137649

   Thanks all. 
   
   > @ConeyLiu @jzhuge, shall we cherry-pick this into Spark 3.2? Do we have a test for this in Spark 3.4 (we are using the function catalog there)?
   
   Let met check it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] ConeyLiu commented on a diff in pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "ConeyLiu (via GitHub)" <gi...@apache.org>.

ConeyLiu commented on code in PR #7693:
URL: https://github.com/apache/iceberg/pull/7693#discussion_r1204958851


##########
spark/v3.3/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestRequiredDistributionAndOrdering.java:
##########
@@ -268,6 +268,23 @@ public void testDefaultSortOnLongTruncatedColumn() {
     assertEquals("Rows must match", expected, sql("SELECT * FROM %s ORDER BY c1", tableName));
   }
 
+  @Test
+  public void testDefaultSortOnBinaryColumn() {

Review Comment:
   @jzhuge I added back this UT, not sure if is this what you expected.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] Fokko commented on pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "Fokko (via GitHub)" <gi...@apache.org>.

Fokko commented on PR #7693:
URL: https://github.com/apache/iceberg/pull/7693#issuecomment-1561216112

   LGTM, thanks for fixing this @ConeyLiu 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] aokolnychyi commented on pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "aokolnychyi (via GitHub)" <gi...@apache.org>.

aokolnychyi commented on PR #7693:
URL: https://github.com/apache/iceberg/pull/7693#issuecomment-1564796060

   @ConeyLiu @jzhuge, shall we cherry-pick this into 3.2 and add a test for Spark 3.4 (we are using the function catalog there)?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] ConeyLiu commented on a diff in pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "ConeyLiu (via GitHub)" <gi...@apache.org>.

ConeyLiu commented on code in PR #7693:
URL: https://github.com/apache/iceberg/pull/7693#discussion_r1205049115


##########
spark/v3.3/spark-extensions/src/test/java/org/apache/iceberg/spark/extensions/TestRequiredDistributionAndOrdering.java:
##########
@@ -268,6 +268,23 @@ public void testDefaultSortOnLongTruncatedColumn() {
     assertEquals("Rows must match", expected, sql("SELECT * FROM %s ORDER BY c1", tableName));
   }
 
+  @Test
+  public void testDefaultSortOnBinaryColumn() {

Review Comment:
   Hi @jzhuge, I checked the `TestRequiredDistributionAndOrdering ` again, it seems the new UT would be more reasonable to put here. You could see the around UTs all are about partition and ordering tests. I renamed the UT name from `testDefaultSortOnBinaryColumn ` to `testDefaultSortOnBinaryBucketedColumn` to keep the name consistent with others. However, I could move it if you stand by your point.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] aokolnychyi merged pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "aokolnychyi (via GitHub)" <gi...@apache.org>.

aokolnychyi merged PR #7693:
URL: https://github.com/apache/iceberg/pull/7693


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] aokolnychyi commented on pull request #7693: Spark 3.3: Fixes bucket on binary column

Posted by "aokolnychyi (via GitHub)" <gi...@apache.org>.

aokolnychyi commented on PR #7693:
URL: https://github.com/apache/iceberg/pull/7693#issuecomment-1564789908

   Let me check.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org