You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Xiao Li (JIRA)" <ji...@apache.org> on 2018/07/19 22:38:00 UTC

[jira] [Created] (SPARK-24867) Add AnalysisBarrier to DataFrameWriter

Xiao Li created SPARK-24867:
-------------------------------

             Summary: Add AnalysisBarrier to DataFrameWriter 
                 Key: SPARK-24867
                 URL: https://issues.apache.org/jira/browse/SPARK-24867
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 2.3.1, 2.3.0
            Reporter: Xiao Li
            Assignee: Xiao Li



{code}
      val udf1 = udf({(x: Int, y: Int) => x + y})
      val df = spark.range(0, 3).toDF("a")
        .withColumn("b", udf1($"a", udf1($"a", lit(10))))
      df.cache()
      df.write.saveAsTable("t")
      df.write.saveAsTable("t1")
{code}

Cache is not being used because the plans do not match with the cached plan. This is a regression caused by the changes we made in AnalysisBarrier, since not all the Analyzer rules are idempotent. We need to fix it to Spark 2.3.2




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org