You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2022/09/05 23:06:51 UTC

[GitHub] [spark] itholic commented on pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark

itholic commented on PR #37671:
URL: https://github.com/apache/spark/pull/37671#issuecomment-1237515485

   The `read_excel` tests still failed for some reason with error below:
   
   ```
   ======================================================================
   FAIL [1.476s]: test_read_excel (pyspark.pandas.tests.test_dataframe_spark_io.DataFrameSparkIOTest)
   ----------------------------------------------------------------------
   Traceback (most recent call last):
     File "/__w/spark/spark/python/pyspark/pandas/tests/test_dataframe_spark_io.py", line 278, in test_read_excel
       ps.read_excel(tmp, index_col=0).sort_index(),
     File "/__w/spark/spark/python/pyspark/pandas/namespace.py", line 1224, in read_excel
       return read_excel_on_spark(pdf_or_psers, sheet_name)
     File "/__w/spark/spark/python/pyspark/pandas/namespace.py", line 1213, in read_excel_on_spark
       psdf = DataFrame(psdf._internal.with_new_sdf(sdf))
     File "/__w/spark/spark/python/pyspark/pandas/internal.py", line 1223, in with_new_sdf
       return self.copy(
     File "/__w/spark/spark/python/pyspark/pandas/internal.py", line 1427, in copy
       return InternalFrame(
     File "/__w/spark/spark/python/pyspark/pandas/internal.py", line 755, in __init__
       assert all(
   AssertionError: ([InternalField(dtype=float64, struct_field=StructField('__index_level_0__', DoubleType(), False))], [StructField('__index_level_0__', DoubleType(), True)])
   
   ----------------------------------------------------------------------
   Ran 8 tests in 34.923s
   ```
   
   I failed to reproduce this error although I use the same version of related envs (e.g. `Python`, `pandas`, `openpyxl`).
   
   Let me leave the re-enabling `read_excel` tests as TODO, and just re-enable the `to_excel` tests here for now.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org