You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by gatorsmile <gi...@git.apache.org> on 2015/12/19 19:53:03 UTC

[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

GitHub user gatorsmile opened a pull request:

    https://github.com/apache/spark/pull/10393

    [SPARK-12441] [SQL] Fixing missingInput in Generate

    When explain any plan with Generate, we will see an exclamation mark in the plan. Normally, when we see this mark, it means the plan has an error. This PR is to correct the `missingInput` in `Generate`.
    
    For example, 
    ```scala
    val df = Seq((1, "a b c"), (2, "a b"), (3, "a")).toDF("number", "letters")
    val df2 =
      df.explode('letters) {
        case Row(letters: String) => letters.split(" ").map(Tuple1(_)).toSeq
      }
    
    df2.explain(true)
    ```
    Before the fix, the plan is like 
    ```
    == Parsed Logical Plan ==
    'Generate UserDefinedGenerator('letters), true, false, None
    +- Project [_1#0 AS number#2,_2#1 AS letters#3]
       +- LocalRelation [_1#0,_2#1], [[1,a b c],[2,a b],[3,a]]
    
    == Analyzed Logical Plan ==
    number: int, letters: string, _1: string
    Generate UserDefinedGenerator(letters#3), true, false, None, [_1#8]
    +- Project [_1#0 AS number#2,_2#1 AS letters#3]
       +- LocalRelation [_1#0,_2#1], [[1,a b c],[2,a b],[3,a]]
    
    == Optimized Logical Plan ==
    Generate UserDefinedGenerator(letters#3), true, false, None, [_1#8]
    +- LocalRelation [number#2,letters#3], [[1,a b c],[2,a b],[3,a]]
    
    == Physical Plan ==
    !Generate UserDefinedGenerator(letters#3), true, false, [number#2,letters#3,_1#8]
    +- LocalTableScan [number#2,letters#3], [[1,a b c],[2,a b],[3,a]]
    ```

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gatorsmile/spark generateExplain

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/10393.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #10393
    
----
commit 21071d1bc9fe0d5869fa94f21f617ee5dbaae390
Author: gatorsmile <ga...@gmail.com>
Date:   2015-12-19T18:33:59Z

    correct missingInput.

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166765006
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-167675761
  
    Thank you!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166751206
  
    Build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166765007
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48219/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166343916
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48115/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166243848
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48096/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166751329
  
    **[Test build #48219 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48219/consoleFull)** for PR 10393 at commit [`8c48c35`](https://github.com/apache/spark/commit/8c48c35df3cba1dc409a7d355d682aed8ea4fec0).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48191985
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/basicOperators.scala ---
    @@ -307,6 +307,7 @@ case class MapPartitions[T, U](
         uEncoder: ExpressionEncoder[U],
         output: Seq[Attribute],
         child: SparkPlan) extends UnaryNode {
    +  override def missingInput: AttributeSet = AttributeSet.empty
    --- End diff --
    
    That is a good idea. Will add it soon. Thanks!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166225222
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166022788
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166022587
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48118001
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala ---
    @@ -131,6 +131,7 @@ class DataFrameSuite extends QueryTest with SharedSQLContext {
           df.explode('letters) {
             case Row(letters: String) => letters.split(" ").map(Tuple1(_)).toSeq
           }
    +    assert(!df2.queryExecution.toString.contains("!"))
    --- End diff --
    
    ah, I think we should not add `resolved` to `SparkPlan` for this purpose,
    how about `assert(df2.queryExecution.executedPlan.missingInput.isEmpty)`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166343749
  
    **[Test build #48115 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48115/consoleFull)** for PR 10393 at commit [`63058e3`](https://github.com/apache/spark/commit/63058e32ebe178616af54702852a9e83fa025df9).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166243793
  
    **[Test build #48096 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48096/consoleFull)** for PR 10393 at commit [`6b4ba74`](https://github.com/apache/spark/commit/6b4ba7458398ecd74c394fba0b062b2d8bfa8752).
     * This patch **fails from timeout after a configured wait of \`250m\`**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166016740
  
    **[Test build #48055 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48055/consoleFull)** for PR 10393 at commit [`21071d1`](https://github.com/apache/spark/commit/21071d1bc9fe0d5869fa94f21f617ee5dbaae390).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166129455
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166118417
  
    **[Test build #48071 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48071/consoleFull)** for PR 10393 at commit [`b7edd97`](https://github.com/apache/spark/commit/b7edd97e823c57c3f53e0a5d8b2c9a953e69082b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166203846
  
    **[Test build #48096 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48096/consoleFull)** for PR 10393 at commit [`6b4ba74`](https://github.com/apache/spark/commit/6b4ba7458398ecd74c394fba0b062b2d8bfa8752).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166022588
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48055/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48226547
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/basicOperators.scala ---
    @@ -307,6 +307,7 @@ case class MapPartitions[T, U](
         uEncoder: ExpressionEncoder[U],
         output: Seq[Attribute],
         child: SparkPlan) extends UnaryNode {
    +  override def missingInput: AttributeSet = AttributeSet.empty
    --- End diff --
    
    Is the `!` prefix the only influence if we have `missingInput` in `SparkPlan`? There is no `resolved` concept for `SparkPlan` and it looks to me that physical plan doesn't need to care about these stuff.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48304161
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/basicOperators.scala ---
    @@ -307,6 +307,7 @@ case class MapPartitions[T, U](
         uEncoder: ExpressionEncoder[U],
         output: Seq[Attribute],
         child: SparkPlan) extends UnaryNode {
    +  override def missingInput: AttributeSet = AttributeSet.empty
    --- End diff --
    
    I guess it is possible that generation of `SparkPlan` could introduce new `missingInput`. Having this will simplify the debugging?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166017392
  
    **[Test build #48056 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48056/consoleFull)** for PR 10393 at commit [`22d8ba5`](https://github.com/apache/spark/commit/22d8ba5bf1d4ba9e25de02a4619ff3718f257c1b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48190846
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala ---
    @@ -317,6 +319,7 @@ class DatasetSuite extends QueryTest with SharedSQLContext {
         val ds = Seq(("a", 10), ("a", 20), ("b", 1), ("b", 2), ("c", 1)).toDS()
         val grouped = ds.groupBy($"_1").keyAs[String]
         val agged = grouped.mapGroups { case (g, iter) => (g, iter.map(_._2).sum) }
    +    assert(agged.queryExecution.executedPlan.missingInput.isEmpty)
    --- End diff --
    
    Maybe we should just have `checkAnswer` assert that there are no physical or logical nodes that have missing input (after analysis)?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by viirya <gi...@git.apache.org>.
Github user viirya commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48226884
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/basicOperators.scala ---
    @@ -307,6 +307,7 @@ case class MapPartitions[T, U](
         uEncoder: ExpressionEncoder[U],
         output: Seq[Attribute],
         child: SparkPlan) extends UnaryNode {
    +  override def missingInput: AttributeSet = AttributeSet.empty
    --- End diff --
    
    I think so. It seems only for better printing out?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48306508
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/basicOperators.scala ---
    @@ -307,6 +307,7 @@ case class MapPartitions[T, U](
         uEncoder: ExpressionEncoder[U],
         output: Seq[Attribute],
         child: SparkPlan) extends UnaryNode {
    +  override def missingInput: AttributeSet = AttributeSet.empty
    --- End diff --
    
    Yeah, exactly.  I added this when I was debugging a problem with the query planner.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166225104
  
    **[Test build #48098 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48098/consoleFull)** for PR 10393 at commit [`63058e3`](https://github.com/apache/spark/commit/63058e32ebe178616af54702852a9e83fa025df9).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/10393


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166129456
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48071/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166129424
  
    **[Test build #48071 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48071/consoleFull)** for PR 10393 at commit [`b7edd97`](https://github.com/apache/spark/commit/b7edd97e823c57c3f53e0a5d8b2c9a953e69082b).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:\n  * `public class JavaTwitterHashTagJoinSentiments `\n  * `case class UnresolvedAlias(child: Expression, aliasName: Option[String] = None)`\n


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-167651518
  
    Thanks, merging to master.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166749224
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48217/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48191121
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/basicOperators.scala ---
    @@ -307,6 +307,7 @@ case class MapPartitions[T, U](
         uEncoder: ExpressionEncoder[U],
         output: Seq[Attribute],
         child: SparkPlan) extends UnaryNode {
    +  override def missingInput: AttributeSet = AttributeSet.empty
    --- End diff --
    
    This fixes the problem, but I think it might be better to add something like `def producedAttributes: AttributeSet = AttributeSet` to `QueryPlan`.  This can be automatically subtracted from `missingInput` and nodes that produce attributes can override this function.  I think that will be clearer.  What do you think?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166343915
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166751209
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48218/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48117154
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala ---
    @@ -131,6 +131,7 @@ class DataFrameSuite extends QueryTest with SharedSQLContext {
           df.explode('letters) {
             case Row(letters: String) => letters.split(" ").map(Tuple1(_)).toSeq
           }
    +    assert(!df2.queryExecution.toString.contains("!"))
    --- End diff --
    
    @cloud-fan I like your suggestion!
    
    `resolved` is not defined in `SparkPlan`. It is only available in `LogicalPlan`. I am not sure if you want me to define it in `SparkPlan` and overwrite it if necessary like what we did in `LogicalPlan`?
    
    Thank you!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166764928
  
    **[Test build #48219 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48219/consoleFull)** for PR 10393 at commit [`8c48c35`](https://github.com/apache/spark/commit/8c48c35df3cba1dc409a7d355d682aed8ea4fec0).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166243846
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48117166
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Generate.scala ---
    @@ -51,9 +52,12 @@ case class Generate(
         join: Boolean,
         outer: Boolean,
         output: Seq[Attribute],
    +    generatorOutput: Seq[Attribute],
         child: SparkPlan)
       extends UnaryNode {
     
    +  override def missingInput: AttributeSet = super.missingInput -- generatorOutput
    +
    --- End diff --
    
    Thank you, @viirya and @cloud-fan ! Just did the change. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166022565
  
    **[Test build #48055 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48055/consoleFull)** for PR 10393 at commit [`21071d1`](https://github.com/apache/spark/commit/21071d1bc9fe0d5869fa94f21f617ee5dbaae390).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166317709
  
    retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166415241
  
    I think its fine to have this stuff in physical plans.  I actually added the `!` when I was debugging problems with the query planner (i.e. it was producing spark plans with the wrong attributes).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48119148
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala ---
    @@ -131,6 +131,7 @@ class DataFrameSuite extends QueryTest with SharedSQLContext {
           df.explode('letters) {
             case Row(letters: String) => letters.split(" ").map(Tuple1(_)).toSeq
           }
    +    assert(!df2.queryExecution.toString.contains("!"))
    --- End diff --
    
    Thank you! @cloud-fan I did the change as you suggested. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166022790
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48056/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48197749
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala ---
    @@ -317,6 +319,7 @@ class DatasetSuite extends QueryTest with SharedSQLContext {
         val ds = Seq(("a", 10), ("a", 20), ("b", 1), ("b", 2), ("c", 1)).toDS()
         val grouped = ds.groupBy($"_1").keyAs[String]
         val agged = grouped.mapGroups { case (g, iter) => (g, iter.map(_._2).sum) }
    +    assert(agged.queryExecution.executedPlan.missingInput.isEmpty)
    --- End diff --
    
    Just did that, and then realized more nodes need a fix. : )


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166213409
  
    **[Test build #48098 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48098/consoleFull)** for PR 10393 at commit [`63058e3`](https://github.com/apache/spark/commit/63058e32ebe178616af54702852a9e83fa025df9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by viirya <gi...@git.apache.org>.
Github user viirya commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48114994
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Generate.scala ---
    @@ -51,9 +52,12 @@ case class Generate(
         join: Boolean,
         outer: Boolean,
         output: Seq[Attribute],
    +    generatorOutput: Seq[Attribute],
         child: SparkPlan)
       extends UnaryNode {
     
    +  override def missingInput: AttributeSet = super.missingInput -- generatorOutput
    +
    --- End diff --
    
    You can use the same approach in logical.Generate, i.e.,
    
         override def expressions: Seq[Expression] = generator :: Nil
    
    to solve this issue. Then you don't need to modify SparkStrategies.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by aqqaluk <gi...@git.apache.org>.
Github user aqqaluk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48218623
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Generate.scala ---
    @@ -51,9 +52,12 @@ case class Generate(
         join: Boolean,
         outer: Boolean,
         output: Seq[Attribute],
    +    generatorOutput: Seq[Attribute],
         child: SparkPlan)
       extends UnaryNode {
     
    +  override def missingInput: AttributeSet = super.missingInput -- generatorOutput
    +
    --- End diff --
    
    Testing new platform with a new vers. of xubuntu linux of course 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48115321
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala ---
    @@ -131,6 +131,7 @@ class DataFrameSuite extends QueryTest with SharedSQLContext {
           df.explode('letters) {
             case Row(letters: String) => letters.split(" ").map(Tuple1(_)).toSeq
           }
    +    assert(!df2.queryExecution.toString.contains("!"))
    --- End diff --
    
    how about `assert(df2.queryExecution.executedPlan.resolved)`?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166749223
  
    Build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166022745
  
    **[Test build #48056 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48056/consoleFull)** for PR 10393 at commit [`22d8ba5`](https://github.com/apache/spark/commit/22d8ba5bf1d4ba9e25de02a4619ff3718f257c1b).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166320154
  
    **[Test build #48115 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48115/consoleFull)** for PR 10393 at commit [`63058e3`](https://github.com/apache/spark/commit/63058e32ebe178616af54702852a9e83fa025df9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166228564
  
    This LGTM.
    
    But I have a question that do we need these resolution related stuff in `SparkPlan`? cc @marmbrus 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10393#discussion_r48115249
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/Generate.scala ---
    @@ -51,9 +52,12 @@ case class Generate(
         join: Boolean,
         outer: Boolean,
         output: Seq[Attribute],
    +    generatorOutput: Seq[Attribute],
         child: SparkPlan)
       extends UnaryNode {
     
    +  override def missingInput: AttributeSet = super.missingInput -- generatorOutput
    +
    --- End diff --
    
    +1


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12441] [SQL] Fixing missingInput in Gen...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10393#issuecomment-166225223
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48098/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org