You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by "itholic (via GitHub)" <gi...@apache.org> on 2023/10/17 09:27:28 UTC

[PR] [SPARK-45566][PS] Support Pandas-like testing utils for Pandas API on Spark [spark]

itholic opened a new pull request, #43398:
URL: https://github.com/apache/spark/pull/43398

   ### What changes were proposed in this pull request?
   
   This PR proposes to support utility functions `assert_frame_equal`, `assert_series_equal`, and `assert_index_equal` in the Pandas API on Spark to aid users in testing.
   
   
   ### Why are the changes needed?
   
   These utility functions allow users to efficiently test the equality of `DataFrames`, `Series`, and `Indexes` in the Pandas API on Spark. Ensuring accurate testing helps in maintaining code quality and user trust in the platform.
   
   e.g.
   ```python
   from pyspark.pandas.testing import assert_frame_equal
   
   df1 = spark.createDataFrame([('Alice', 1), ('Bob', 2)], ["name", "age"])
   df2 = spark.createDataFrame([('Alice', 1), ('Bob', 2)], ["name", "age"])
   
   assert_frame_equal(df1, df2)
   ```
   
   ### Does this PR introduce _any_ user-facing change?
   
   Yes. Users will now have access to `assert_frame_equal`, `assert_series_equal`, `and assert_index_equal` functions for testing purposes.
   
   
   ### How was this patch tested?
   
   Added doctests.
   
   
   ### Was this patch authored or co-authored using generative AI tooling?
   
   No.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Re: [PR] [SPARK-45566][PS] Support Pandas-like testing utils for Pandas API on Spark [spark]

Posted by "HyukjinKwon (via GitHub)" <gi...@apache.org>.
HyukjinKwon closed pull request #43398: [SPARK-45566][PS] Support Pandas-like testing utils for Pandas API on Spark
URL: https://github.com/apache/spark/pull/43398


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Re: [PR] [SPARK-45566][PS] Support Pandas-like testing utils for Pandas API on Spark [spark]

Posted by "HyukjinKwon (via GitHub)" <gi...@apache.org>.
HyukjinKwon commented on PR #43398:
URL: https://github.com/apache/spark/pull/43398#issuecomment-1767256163

   Merged to master.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


Re: [PR] [SPARK-45566][PS] Support Pandas-like testing utils for Pandas API on Spark [spark]

Posted by "itholic (via GitHub)" <gi...@apache.org>.
itholic commented on PR #43398:
URL: https://github.com/apache/spark/pull/43398#issuecomment-1766029062

   cc @HyukjinKwon @allanf-db FYI


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org