You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by scwf <gi...@git.apache.org> on 2014/10/07 03:49:45 UTC

[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

GitHub user scwf opened a pull request:

    https://github.com/apache/spark/pull/2685

    [SPARK-3826][SQL]enable hive-thriftserver to support hive-0.13.1

     In #2241 hive-thriftserver is not enabled. This patch enable hive-thriftserver to support hive-0.13.1 by using a shim layer refer to #2241.
    
     1 A light shim layer(code in sql/hive-thriftserver/hive-version) for each different hive version to handle api compatibility
    
     2 New pom profiles "hive-default" and "hive-versions"(copy from #2241) to activate different hive version
    
     3 SBT cmd for different version as follows:
       hive-0.12.0 --- sbt/sbt -Phive,hadoop-2.3 assembly
       hive-0.13.1 --- sbt/sbt -Phive,hadoop-2.3 -Dhive.version=0.13.1 assembly
    
     4 Since hive-thriftserver depend on hive subproject, this patch should be merged with #2241 to enable hive-0.13.1 for hive-thriftserver


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/scwf/spark shim-thriftserver1

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/2685.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #2685
    
----
commit a956e92b92079f50bd4fc466e60a6324b30bc857
Author: scwf <wa...@huawei.com>
Date:   2014-10-06T23:54:15Z

    draft to enable hive-thriftserver support hive0.13.1

commit 3a08b14710e7b5649c6de5406c257ce8d59dc8e0
Author: scwf <wa...@huawei.com>
Date:   2014-10-07T00:45:27Z

    code format

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60842557
  
    I am not familiar with the test script, is this change ok?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61209877
  
    @scwf @zhzhan Just FYI, to run PySpark tests locally, you may first export `SPARK_HOME` and `PYTHONPATH` as follows:
    
    ```bash
    export SPARK_HOME=<...>
    export PYTHONPATH=$SPARK_HOME/python:$SPARK_HOME/python/lib/py4j-0.8.2.1-src.zip
    ```
    
    and then run `./python/run-tests` to start PySpark tests.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by chenghao-intel <gi...@git.apache.org>.
Github user chenghao-intel commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60694925
  
    Thanks, @marmbrus @scwf That makes sense to me, I will update my PRs with Hive 0.12 code compatible, to keep the build successful. 
    @scwf Let's see if there any help you need from me while doing the shim code refactor.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214247
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22588/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61183861
  
    it used, see https://github.com/apache/spark/blob/master/python/pyspark/sql.py#L1439-L1442


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61038997
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22508/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052028
  
    ```com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy``` is in kryo 2.22


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61222247
  
    it is not possible to mutate them after the fact. Let's stick with 0.13.1a - I will fully release it now. But don't remove the extra repository (we can remove it later) because it takes some time to propagate.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61137061
  
    If they are not source compatible, maybe we should re-shade hive to use kryo 2.21.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60857256
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22390/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61001350
  
    Yes, i also able to reproduce it, can you test with org.apache.hive? my locally it will be ok


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60787703
  
    @marmbrus, there are some problems when consolidating all the Shim code into the existing hive subproject. The shim code of thriftserver depend on some classes of hive-thriftserver such as ```SparkSQLCLIService```, ```AbstractSparkSQLDriver```. So moving them to hive subproject leads to hive dependency on hive-thriftserver, that is not we expected.     
    And another problem is that if we consolidate all shim code in a existing(or new) subproject, we need include hive-thriftserver in parent pom to ensure compile ok. This violates that now we do not include/compile hive-thriftserver default.
    So i suggest keep the shim code for themselves, and hive-thriftserver can reuse some shim code of hive to avoid  duplicating code:)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61054248
  
    @zhzhan we use a special hive-exec jar that doesn't shade any dependencies. The original hive-exec jar includes a bunch of other stuff that we don't want. However, it might make sense for our special hive-exec jar to shade kryo to not have to deal with this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61437220
  
    @coderfi This is a good catch, would you mind to file a JIRA ticket for this? A PR would be even better :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61043371
  
    Seems we can not upgrade kryo in spark, since the latest chill depend on 2.21. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052655
  
    Please refer to following hive-exec directory, as we can see esotericsoftware are all in org.apache.hive.
    
    HW11188:tmp1 zzhang$ ls org/apache/hive/com/esotericsoftware/
    kryo/       minlog/     reflectasm/ shaded/


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-58133525
  
      [QA tests have finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/277/consoleFull) for   PR 2685 at commit [`3a08b14`](https://github.com/apache/spark/commit/3a08b14710e7b5649c6de5406c257ce8d59dc8e0).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052281
  
    @pwendell, right in hive 0.13.1 it use the shaded ```com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy``` in kryo 2.22.
    So if we exclude it, we will get classnotfound error, because in kryo 2.21(spark chill depends on) in spark does not have this class(the class in kryo 2.21 is org.objenesis.strategy.InstantiatorStrategy)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61438681
  
    Yes, here should be 0.13.1a.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61306597
  
    Thanks guys for all your hard work on this!  Merging to master.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60857250
  
      [Test build #22390 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22390/consoleFull) for   PR 2685 at commit [`8a4daf2`](https://github.com/apache/spark/commit/8a4daf2a3d9a1529eb8f6598bbb619e6a0283002).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19384176
  
    --- Diff: pom.xml ---
    @@ -95,6 +95,7 @@
         <module>sql/catalyst</module>
         <module>sql/core</module>
         <module>sql/hive</module>
    +    <module>sql/hive-thriftserver</module>
    --- End diff --
    
    I think this module gets added already when you run with `-Phive` (see the profile declaration below in this file), we should probably move the `hive` module there as well. I think right now Hive will get built by default, which is wrong.
    
    Can we make it so that `sql/hive` and `sql/hive-thriftserver` are only included when you use `-Phive` profile?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61221784
  
      [Test build #22591 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22591/consoleFull) for   PR 2685 at commit [`f5cac74`](https://github.com/apache/spark/commit/f5cac74bfcf683cfe9129ea7917fa44cc023c277).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61165478
  
    Locally test seems ok(just test with thrift server and hash shuffle), updated.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61032871
  
    Ok, i will update and have a try. But why apache hive is ok link #3004?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/2685


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60534600
  
    Some minor comments on the build stuff. I think we're close, it's just small stuff.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61032670
  
    Okay I narrowed it down at an issue. Hive 0.13 now lists Kryo 2.22 as a dependency. Spark depends on Kryo 2.21, and they are binary incompatible due to the way the interact with thier objenesis dependency (which kryo started shading in 2.22).
    
    My local reproduction was solved by adding the following exclusion from hive-exec dependency - we should try that on this patch.
    
    ```
    --- a/sql/hive/pom.xml
    +++ b/sql/hive/pom.xml
    @@ -65,6 +65,10 @@
               <groupId>commons-logging</groupId>
               <artifactId>commons-logging</artifactId>
             </exclusion>
    +        <exclusion>
    +          <groupId>com.esotericsoftware.kryo</groupId>
    +          <artifactId>kryo</artifactId>
    +        </exclusion>
           </exclusions>
         </dependency>
         <dependency>
    ```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60868885
  
    I'm looking into this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052076
  
    Another thing to notice is that Kryo 2.21 is a really weird release. [Kryo 2.21 POM](https://repo1.maven.org/maven2/com/esotericsoftware/kryo/kryo/2.21/kryo-2.21.pom) suggests that Objenesis classes are relocated to package `com.esotericsoftware.shaded.org.objenesis`, but classes within the Maven artifact jar file still reside in package `org.objenesis`. Also, Kryo GitHub repo doesn't provide 2.21 release download and the version number in the POM of [kryo-2.21 tag](https://github.com/EsotericSoftware/kryo/blob/kryo-2.21/pom.xml#L13) is actually `2.21-SNAPSHOT`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61135531
  
    @liancheng okay - I'll see what I can do. If Kryo 2.21 and 2.22 are source compatible it should be doable.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61203563
  
    retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61209998
  
    Cool, thanks for this.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60997808
  
    hey, @zhzhan, i think we'd better not test with adding megastore dep to core/pom.xml. The key point is #2241 core test failed with org.spark-project.hive but passed with org.apache.hive. @pwendell, any idea about this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60640623
  
    @chenghao-intel, i think we should support both, this is why we make a new subproject for this.   


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60959426
  
    Yes, i changed org.spark-project.hive to org.apache.hive in hive pom and hive-thriftserver pom, then it passed.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60846346
  
      [Test build #22386 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22386/consoleFull) for   PR 2685 at commit [`0d7f6cf`](https://github.com/apache/spark/commit/0d7f6cf958cb293051719c60020d26a552063d31).
     * This patch **fails to build**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19384192
  
    --- Diff: dev/run-tests ---
    @@ -167,7 +167,7 @@ CURRENT_BLOCK=$BLOCK_SPARK_UNIT_TESTS
       # If the Spark SQL tests are enabled, run the tests with the Hive profiles enabled.
       # This must be a single argument, as it is.
       if [ -n "$_RUN_SQL_TESTS" ]; then
    -    SBT_MAVEN_PROFILES_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive"
    +    SBT_MAVEN_PROFILES_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
    --- End diff --
    
    Yeah agreed.  If Hive13 is the default we should test Hive13.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61165645
  
    Jenkins, test this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61203445
  
      [Test build #22580 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22580/consoleFull) for   PR 2685 at commit [`f5cac74`](https://github.com/apache/spark/commit/f5cac74bfcf683cfe9129ea7917fa44cc023c277).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19515513
  
    --- Diff: dev/run-tests ---
    @@ -147,12 +147,18 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    +    | grep -v -e "info.*Resolving" -e "warn.*Merging" -e "info.*Including"
    +
    +  # Then build with default version(0.13.1) because tests based on this version
    --- End diff --
    
    because tests _are_ based on this version.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60866836
  
      [Test build #22410 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22410/consoleFull) for   PR 2685 at commit [`dddeb54`](https://github.com/apache/spark/commit/dddeb549e268591a71853aaab01342f92ab8837c).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052515
  
    Okay I think the issue is pretty tough. Unfortunately hive is directly using the shaded objenesis classes. However, Spark needs Kryo 2.21 which depends on the original objenesis classes.
    
    Here is the hive code that uses it:
    
    https://github.com/apache/hive/blob/branch-0.13/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java#L186
    
    So we can't just remove kryo that hive uses. This is pretty ugly. One solution might be to update chill in Spark so that Spark is using the same Kryo version as Hive.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61203450
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22580/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052007
  
    @scwf the hive classes only link against kryo... they don't link against objenesis directly. As long as kryo did not make a binary-incompatible change between 2.21 and 2.22, it should be fine.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60840749
  
    I'm afraid that will take too long to do it for each PR.  We do plan to set up a separate build that runs the hive 12 tests nightly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61042000
  
    I am testing with just upgrade kryo in spark and do not exclude hive's kryo


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-58126244
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19653777
  
    --- Diff: dev/run-tests ---
    @@ -142,17 +142,24 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # We always build with Hive because the PySpark Spark SQL tests need it.
       BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
     
    -  echo "[info] Building Spark with these arguments: $BUILD_MVN_PROFILE_ARGS"
     
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
    +  echo "[info] Compile with hive 0.12"
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    +    | grep -v -e "info.*Resolving" -e "warn.*Merging" -e "info.*Including"
    +
    +  # Then build with default version(0.13.1) because tests are based on this version
    +  echo "[info] Building Spark with these arguments: $SBT_MAVEN_PROFILES_ARGS -Phive"
    +  echo -e "q\n" \
    +    | sbt/sbt $SBT_MAVEN_PROFILES_ARGS -Phive clean package assembly/assembly  \
    --- End diff --
    
    sure


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60997618
  
    @liancheng @scwf Looks like my guess is wrong. I got the same result as @scwf, passing the test with  genuine hive


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61178184
  
    hmm, the result is same as #3004, core and hive passed but pyspark failed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60533046
  
    Now that Hive 13 is merged in it would be great to get this in ASAP.  I looked over this and it seems pretty good.  My only high level comment is maybe we should keep all the Hive Shim code in a single project instead of having version specific code in both hive and hive-thrift server.  That would simplify the build and consolidate the places where we have these hacks.  It woudl also allow us to avoid duplicating things like `getCommandProcessor` in both Shims.  Thoughts?
    
    @pwendell can you glance over the (limited) build changes.
    @liancheng can you look this over as well?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19653726
  
    --- Diff: dev/run-tests ---
    @@ -142,17 +142,24 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # We always build with Hive because the PySpark Spark SQL tests need it.
       BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
     
    -  echo "[info] Building Spark with these arguments: $BUILD_MVN_PROFILE_ARGS"
     
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
    +  echo "[info] Compile with hive 0.12"
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    --- End diff --
    
    i think it's against 0.12 because ```BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0" ```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19653250
  
    --- Diff: dev/run-tests ---
    @@ -142,17 +142,24 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # We always build with Hive because the PySpark Spark SQL tests need it.
       BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
     
    -  echo "[info] Building Spark with these arguments: $BUILD_MVN_PROFILE_ARGS"
     
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
    +  echo "[info] Compile with hive 0.12"
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    --- End diff --
    
    I actually don't think this is compiling against Hive 0.12 right now... is it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19250561
  
    --- Diff: sql/hive-thriftserver/pom.xml ---
    @@ -42,6 +42,11 @@
           <version>${project.version}</version>
         </dependency>
         <dependency>
    +      <groupId>org.scalatest</groupId>
    --- End diff --
    
    Any specific reason to change the location? If not, please revert it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61038986
  
      [Test build #22508 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22508/consoleFull) for   PR 2685 at commit [`18fb1ff`](https://github.com/apache/spark/commit/18fb1fff1c2a097604b573fffba92b9a7a3f3e8f).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60833918
  
      [Test build #488 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/488/consoleFull) for   PR 2685 at commit [`bcf943f`](https://github.com/apache/spark/commit/bcf943f8fffcbd8c452651130eb5d232ea803e4f).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60840498
  
    A feasible way, should we run tests with 0.12 and 0.13 both?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61222008
  
    Yay! It passed!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19251000
  
    --- Diff: pom.xml ---
    @@ -1272,16 +1276,33 @@
             </dependency>
           </dependencies>
         </profile>
    -
         <profile>
    -      <id>hive</id>
    +      <id>hive-0.12.0</id>
           <activation>
             <activeByDefault>false</activeByDefault>
           </activation>
           <modules>
             <module>sql/hive-thriftserver</module>
           </modules>
    +      <properties>
    +        <hive.version>0.12.0-protobuf-2.5</hive.version>
    +        <hive.version.short>0.12.0</hive.version.short>
    +        <derby.version>10.4.2.0</derby.version>
    +      </properties>
    +    </profile>
    +    <profile>
    +      <id>hive-0.13.1</id>
    +      <activation>
    +        <activeByDefault>false</activeByDefault>
    +      </activation>
    +      <modules>
    +        <module>sql/hive-thriftserver</module>
    --- End diff --
    
    you mean move to <modules> upper in this pom?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61045831
  
    I'm combing shading and dependency among Hive, Spark, Chill, Kryo and Objenesis. Will give a summary later. I don't think we can fix all the problem unless relationships among these key components are crystal clear...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60859874
  
    It failed in core twice with the same errors.  I think it might be our fault.  Lets try putting package back in before running tests.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61134677
  
    Did a bunch of local tests, but didn't have any luck. I'd suggest to re-shade the org.spark-project.hive:hive-exec:0.13.1 jar and include the shaded Kryo 2.22 in it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60896714
  
    A summary of what I've found with local testing:
    
    - For master branch:
    
      - Check out a fresh copy, build the assembly jar first, then run `HashShuffleSuite`: pass
      - Check out a fresh copy, run `HashShuffleSuite` directly without building assembly jar: fail
    
    - For this PR:
    
      Both approaches fail.
    
    So I guess the problem should be related to Maven configurations.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61051983
  
    actually the most recent failures, it is using kryo 2.21


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60547522
  
    @marmbrus, how about make a new sub project named hive-shim to keep all the Hive Shim code in it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60842352
  
      [Test build #22382 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22382/consoleFull) for   PR 2685 at commit [`f7c93ae`](https://github.com/apache/spark/commit/f7c93ae312c7e06eafbdb214bcfeedb53e280df5).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60972893
  
    @liancheng @scwf As long as I add metastore dependency to core/pom.xml there will be timeout error, no matter it is genuine hive or not,  or even with 0.12.0. It may indicate there is hive package leakage in this test. I will do another experiment and update you guys. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61146852
  
    I am publishing a version of Hive that relies on Kryo 2.21. We test this patch with it. I'll update when it's ready.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61053147
  
    @zhzhan they can both exist. The issue is that Spark uses a library Chill that requires Kryo 2.21. If 2.21 and 2.22 are not binary compatible, this will break it and there is no solution. If 2.21 and 2.22 are binary compatible, then we can get away with it, but we may need to add objeneiss explicitly because IIRC chill directly uses the original objenesis packages.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60852219
  
      [Test build #22382 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22382/consoleFull) for   PR 2685 at commit [`f7c93ae`](https://github.com/apache/spark/commit/f7c93ae312c7e06eafbdb214bcfeedb53e280df5).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60937978
  
    A missing piece from my last comment: while doing all the local tests I mentioned above, I was following [this change](https://github.com/marmbrus/spark/commit/8f6b09a7813cec22480a23f0301c5d5988090d02#commitcomment-8299290), which builds assembly jar with `-Phive,hive-0.12.0` but runs tests with `-Phive`. In this way, we can pass all Spark core tests.
    
    But this PR tries to fix this inconsistency and removed the `hive-0.12.0` profile while building assembly jar. Although we haven't figured out why, but this breaks some Spark core test suites.
    
    Building assembly jar with Hive 0.12.0 dependency while testing with 0.13.1 doesn't work for `HiveThriftServer2`, since `HiveThriftServer2Suite` and `CliSuite` both relies on the assembly jar and spawn external processes for testing purposes.
    
    Basically, somehow #2241 breaks Spark core tests if compiled with Hive 0.13.1 dependencies. @scwf and I are investigating the reason.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052977
  
    @pwendell @liancheng @scwf  Folks, why the shaded com.esotericsoftware in hive cannot coexist with com.esotericsoftware in spark?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61221786
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22591/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61056033
  
    locally test result
    1 
    ```
    diff --git a/core/pom.xml b/core/pom.xml
    index 5cd21e1..c87f661 100644
    --- a/core/pom.xml
    +++ b/core/pom.xml
    @@ -131,6 +131,11 @@
           <artifactId>chill-java</artifactId>
         </dependency>
         <dependency>
    +      <groupId>com.esotericsoftware.kryo</groupId>
    +      <artifactId>kryo</artifactId>
    +      <version>2.22</version>
    +    </dependency>
    +    <dependency>
           <groupId>org.roaringbitmap</groupId>
           <artifactId>RoaringBitmap</artifactId>
         </dependency>
    ```
    compile error.
    
    2 
    ```
    diff --git a/core/pom.xml b/core/pom.xml
    index 5cd21e1..50271ed 100644
    --- a/core/pom.xml
    +++ b/core/pom.xml
    @@ -125,10 +125,32 @@
         <dependency>
           <groupId>com.twitter</groupId>
           <artifactId>chill_${scala.binary.version}</artifactId>
    +      <exclusions>
    +        <exclusion>
    +          <groupId>com.esotericsoftware.kryo</groupId>
    +          <artifactId>kryo</artifactId>
    +        </exclusion>
    +      </exclusions>
         </dependency>
         <dependency>
           <groupId>com.twitter</groupId>
           <artifactId>chill-java</artifactId>
    +      <exclusions>
    +        <exclusion>
    +          <groupId>com.esotericsoftware.kryo</groupId>
    +          <artifactId>kryo</artifactId>
    +        </exclusion>
    +      </exclusions>
    +    </dependency>
    +    <dependency>
    +      <groupId>com.esotericsoftware.kryo</groupId>
    +      <artifactId>kryo</artifactId>
    +      <version>2.22</version>
    +    </dependency>
    +    <dependency>
    +      <groupId>org.objenesis</groupId>
    +      <artifactId>objenesis</artifactId>
    +      <version>1.2</version>
    ```
    hashshuffle test failed as before.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-58856535
  
    @pwendell, i am resolving the conflicts, other TODO's here?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214972
  
      [Test build #22591 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22591/consoleFull) for   PR 2685 at commit [`f5cac74`](https://github.com/apache/spark/commit/f5cac74bfcf683cfe9129ea7917fa44cc023c277).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60818429
  
    There is a little issue that when first running "show tables" with hadoop1(i tested with 1.0.4) will get the following error:
    java.lang.NoSuchMethodError: org.apache.hadoop.conf.Configuration.unset(Ljava/lang/String;)
    but the second time it's ok.
    After diagnosing I think it is caused by a minor bug in hive. we can fix it after this PR and @liancheng's PR merged.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214243
  
    **[Test build #22588 timed out](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22588/consoleFull)**     for PR 2685 at commit [`f5cac74`](https://github.com/apache/spark/commit/f5cac74bfcf683cfe9129ea7917fa44cc023c277)     after a configured wait of `120m`.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19256989
  
    --- Diff: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/server/SparkSQLOperationManager.scala ---
    @@ -54,158 +45,8 @@ private[thriftserver] class SparkSQLOperationManager(hiveContext: HiveContext)
           confOverlay: JMap[String, String],
           async: Boolean): ExecuteStatementOperation = synchronized {
     
    -    val operation = new ExecuteStatementOperation(parentSession, statement, confOverlay) {
    -      private var result: SchemaRDD = _
    -      private var iter: Iterator[SparkRow] = _
    -      private var dataTypes: Array[DataType] = _
    -
    -      def close(): Unit = {
    -        // RDDs will be cleaned automatically upon garbage collection.
    -        logDebug("CLOSING")
    -      }
    -
    -      def getNextRowSet(order: FetchOrientation, maxRowsL: Long): RowSet = {
    -        if (!iter.hasNext) {
    -          new RowSet()
    -        } else {
    -          // maxRowsL here typically maps to java.sql.Statement.getFetchSize, which is an int
    -          val maxRows = maxRowsL.toInt
    -          var curRow = 0
    -          var rowSet = new ArrayBuffer[Row](maxRows.min(1024))
    -
    -          while (curRow < maxRows && iter.hasNext) {
    -            val sparkRow = iter.next()
    -            val row = new Row()
    -            var curCol = 0
    -
    -            while (curCol < sparkRow.length) {
    -              if (sparkRow.isNullAt(curCol)) {
    -                addNullColumnValue(sparkRow, row, curCol)
    -              } else {
    -                addNonNullColumnValue(sparkRow, row, curCol)
    -              }
    -              curCol += 1
    -            }
    -            rowSet += row
    -            curRow += 1
    -          }
    -          new RowSet(rowSet, 0)
    -        }
    -      }
    -
    -      def addNonNullColumnValue(from: SparkRow, to: Row, ordinal: Int) {
    -        dataTypes(ordinal) match {
    -          case StringType =>
    -            to.addString(from(ordinal).asInstanceOf[String])
    -          case IntegerType =>
    -            to.addColumnValue(ColumnValue.intValue(from.getInt(ordinal)))
    -          case BooleanType =>
    -            to.addColumnValue(ColumnValue.booleanValue(from.getBoolean(ordinal)))
    -          case DoubleType =>
    -            to.addColumnValue(ColumnValue.doubleValue(from.getDouble(ordinal)))
    -          case FloatType =>
    -            to.addColumnValue(ColumnValue.floatValue(from.getFloat(ordinal)))
    -          case DecimalType =>
    -            val hiveDecimal = from.get(ordinal).asInstanceOf[BigDecimal].bigDecimal
    -            to.addColumnValue(ColumnValue.stringValue(new HiveDecimal(hiveDecimal)))
    -          case LongType =>
    -            to.addColumnValue(ColumnValue.longValue(from.getLong(ordinal)))
    -          case ByteType =>
    -            to.addColumnValue(ColumnValue.byteValue(from.getByte(ordinal)))
    -          case ShortType =>
    -            to.addColumnValue(ColumnValue.shortValue(from.getShort(ordinal)))
    -          case TimestampType =>
    -            to.addColumnValue(
    -              ColumnValue.timestampValue(from.get(ordinal).asInstanceOf[Timestamp]))
    -          case BinaryType | _: ArrayType | _: StructType | _: MapType =>
    -            val hiveString = result
    -              .queryExecution
    -              .asInstanceOf[HiveContext#QueryExecution]
    -              .toHiveString((from.get(ordinal), dataTypes(ordinal)))
    -            to.addColumnValue(ColumnValue.stringValue(hiveString))
    -        }
    -      }
    -
    -      def addNullColumnValue(from: SparkRow, to: Row, ordinal: Int) {
    -        dataTypes(ordinal) match {
    -          case StringType =>
    -            to.addString(null)
    -          case IntegerType =>
    -            to.addColumnValue(ColumnValue.intValue(null))
    -          case BooleanType =>
    -            to.addColumnValue(ColumnValue.booleanValue(null))
    -          case DoubleType =>
    -            to.addColumnValue(ColumnValue.doubleValue(null))
    -          case FloatType =>
    -            to.addColumnValue(ColumnValue.floatValue(null))
    -          case DecimalType =>
    -            to.addColumnValue(ColumnValue.stringValue(null: HiveDecimal))
    -          case LongType =>
    -            to.addColumnValue(ColumnValue.longValue(null))
    -          case ByteType =>
    -            to.addColumnValue(ColumnValue.byteValue(null))
    -          case ShortType =>
    -            to.addColumnValue(ColumnValue.shortValue(null))
    -          case TimestampType =>
    -            to.addColumnValue(ColumnValue.timestampValue(null))
    -          case BinaryType | _: ArrayType | _: StructType | _: MapType =>
    -            to.addColumnValue(ColumnValue.stringValue(null: String))
    -        }
    -      }
    -
    -      def getResultSetSchema: TableSchema = {
    --- End diff --
    
    actually this is a override method, we can not move to outside


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19250491
  
    --- Diff: pom.xml ---
    @@ -1272,16 +1276,33 @@
             </dependency>
           </dependencies>
         </profile>
    -
         <profile>
    -      <id>hive</id>
    +      <id>hive-0.12.0</id>
           <activation>
             <activeByDefault>false</activeByDefault>
           </activation>
           <modules>
             <module>sql/hive-thriftserver</module>
           </modules>
    +      <properties>
    +        <hive.version>0.12.0-protobuf-2.5</hive.version>
    +        <hive.version.short>0.12.0</hive.version.short>
    +        <derby.version>10.4.2.0</derby.version>
    +      </properties>
    +    </profile>
    +    <profile>
    +      <id>hive-0.13.1</id>
    +      <activation>
    +        <activeByDefault>false</activeByDefault>
    +      </activation>
    +      <modules>
    +        <module>sql/hive-thriftserver</module>
    --- End diff --
    
    After hive-0.13.1 is committed, sql/hive-thirftserver can be put to top level instead of here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19387530
  
    --- Diff: pom.xml ---
    @@ -95,6 +95,7 @@
         <module>sql/catalyst</module>
         <module>sql/core</module>
         <module>sql/hive</module>
    +    <module>sql/hive-thriftserver</module>
    --- End diff --
    
    @pwendell, we need ```sql/hive``` included even no ```-Phive```, because ```examples``` depend on it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60645981
  
    @marmbrus ok, i will consolidate all shim code in hive project. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61442885
  
    Yes - we just need to change that to 0.13.1a
    
    On Sun, Nov 2, 2014 at 8:05 PM, wangfei <no...@github.com> wrote:
    
    > Yes, here should be 0.13.1a.
    >
    > —
    > Reply to this email directly or view it on GitHub
    > <https://github.com/apache/spark/pull/2685#issuecomment-61438681>.
    >


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60858544
  
    Test failed in core


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by coderfi <gi...@git.apache.org>.
Github user coderfi commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61449633
  
    @liancheng PR #3072 created (all of one line! :) ).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61040412
  
    yeah, in hive-0.13.1 https://github.com/apache/hive/blob/release-0.13.1/ql/src/java/org/apache/hadoop/hive/ql/exec/Utilities.java#L186 using ```com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy```, while in com.twitter:chill_2.10:0.3.6 using ```org.objenesis.strategy.InstantiatorStrategy```
    
    ```
    class EmptyScalaKryoInstantiator extends KryoInstantiator {
      override def newKryo = {
        val k = new KryoBase
        k.setRegistrationRequired(false)
        k.setInstantiatorStrategy(new org.objenesis.strategy.StdInstantiatorStrategy)
        k
      }
    }
    ```



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61178002
  
      [Test build #22560 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22560/consoleFull) for   PR 2685 at commit [`578234d`](https://github.com/apache/spark/commit/578234d398ecd40c3ba024752f04bcf98a78710f).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052650
  
    Actually @scwf found Chill had once tried to upgrade to Kryo 2.22, but reverted it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61150244
  
    Ok, thanks for that.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61034841
  
      [Test build #22508 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22508/consoleFull) for   PR 2685 at commit [`18fb1ff`](https://github.com/apache/spark/commit/18fb1fff1c2a097604b573fffba92b9a7a3f3e8f).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60869228
  
    @liancheng, master branch passed in your local machine?
    i tested with
    ```
    sbt/sbt -Phive,hadoop-2.3 assembly
    sbt/sbt -Phive,hadoop-2.3 core/test
    ``` 
    and failed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61231536
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22602/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61180228
  
    @scwf  
     : org.apache.spark.sql.execution.QueryExecutionException: FAILED: SemanticException [Error 10072]: Database does not exist: default
    
    Looks like need to call HiveShim.createDefaultDBIfNeeded, currently it is only invoked in TestHive. But I don't know why it pass locally as I remembered. The server cleans the environment more completely?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61054698
  
    Actually before we merge the #2241, we even not test core with hive-0.13, so this issue comes here:)  


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61189157
  
    @scwf  It seems that dev/run-tests does not run python test. I don't have right environment setup for python test. I think you can either add/use the default database or delete that test as Michael's suggestion.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61166027
  
      [Test build #22560 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22560/consoleFull) for   PR 2685 at commit [`578234d`](https://github.com/apache/spark/commit/578234d398ecd40c3ba024752f04bcf98a78710f).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60818569
  
    And this is ok to test?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61051413
  
    The problem here is that Hive 0.13 upgrades the Kryo version from 2.21 to 2.22. Spark previously depends on Kryo 2.22 via chill. In Kryo 2.22 they made a build change where they started inlining the objenesis dependency via shading. This patch somehow causes Spark to compile against Kryo 2.21 and run against Kryo 2.22, which is the root cause of the errors.
    
    My suggestion was to just exclude Kryo from Hive, hoping that it would result in us just keeping Kryo 2.21 and that Hive could deal with it. We might need to exclude it in other places than hive-exec. That could be the issue.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61195313
  
      [Test build #22580 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22580/consoleFull) for   PR 2685 at commit [`f5cac74`](https://github.com/apache/spark/commit/f5cac74bfcf683cfe9129ea7917fa44cc023c277).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60556909
  
    Yeah, I like that idea.
    On Oct 26, 2014 8:43 PM, "wangfei" <no...@github.com> wrote:
    
    > @marmbrus <https://github.com/marmbrus>, how about make a new sub project
    > named hive-shim to keep all the Hive Shim code in it?
    >
    > —
    > Reply to this email directly or view it on GitHub
    > <https://github.com/apache/spark/pull/2685#issuecomment-60547522>.
    >


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19251025
  
    --- Diff: sql/hive-thriftserver/pom.xml ---
    @@ -42,6 +42,11 @@
           <version>${project.version}</version>
         </dependency>
         <dependency>
    +      <groupId>org.scalatest</groupId>
    --- End diff --
    
    ok, i will revert this


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60961069
  
    Hm, seems that my experiment over core/pom.xml is not sound. Changing to Apache Hive still fails HashShuffleSuite, but all the failures are caused by timeouts. Anyway, replacing org.spark-project.hive jars with org.apache.hive jars in #2241 fixes the build failure.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61018904
  
    So I found out, something really weird is going on. There was an exception in the test logs:
    
    ```
    14/10/29 13:41:35.349 task-result-getter-0 WARN TaskSetManager: Lost task 0.0 in stage 0.0 (TID 0, localhost): java.lang.NoClassDefFoundError: org/objenesis/strategy/InstantiatorStrategy
            org.apache.spark.serializer.KryoSerializer.newKryo(KryoSerializer.scala:72)
            org.apache.spark.serializer.KryoSerializerInstance.<init>(KryoSerializer.scala:157)
            org.apache.spark.serializer.KryoSerializer.newInstance(KryoSerializer.scala:119)
            org.apache.spark.storage.DiskBlockObjectWriter.open(BlockObjectWriter.scala:127)
            org.apache.spark.storage.DiskBlockObjectWriter.write(BlockObjectWriter.scala:192)
      org.apache.spark.shuffle.hash.HashShuffleWriter$$anonfun$write$1.apply(HashShuffleWriter.scala:67)
            org.apache.spark.shuffle.hash.HashShuffleWriter$$anonfun$write$1.apply(HashShuffleWriter.scala:65)
            scala.collection.Iterator$class.foreach(Iterator.scala:727)
            scala.collection.AbstractIterator.foreach(Iterator.scala:1157)
            org.apache.spark.shuffle.hash.HashShuffleWriter.write(HashShuffleWriter.scala:65)
            org.apache.spark.scheduler.ShuffleMapTask.runTask(ShuffleMapTask.scala:68)
    
    ```
    
    I looked a bit more and for some reason the `objenesis-1.2.jar` file is not included in the assembly classpath when Hive 0.13 is used at compile even though it is included with Hive 0.12. They both should logically depend on it via twitter chill, so I'm pretty stumped. I'm continuing to investigate. You can find the dependency classpath using:
    
    ```
    sbt/sbt -Pyarn -Phadoop-2.3 -Dhadoop.version=2.3.0 -Pkinesis-asl -Phive -Phive-0.13.1 "show assembly/compile:dependency-classpath"
    ```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19653264
  
    --- Diff: dev/run-tests ---
    @@ -142,17 +142,24 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # We always build with Hive because the PySpark Spark SQL tests need it.
       BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
     
    -  echo "[info] Building Spark with these arguments: $BUILD_MVN_PROFILE_ARGS"
     
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
    +  echo "[info] Compile with hive 0.12"
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    +    | grep -v -e "info.*Resolving" -e "warn.*Merging" -e "info.*Including"
    +
    +  # Then build with default version(0.13.1) because tests are based on this version
    +  echo "[info] Building Spark with these arguments: $SBT_MAVEN_PROFILES_ARGS -Phive"
    +  echo -e "q\n" \
    +    | sbt/sbt $SBT_MAVEN_PROFILES_ARGS -Phive clean package assembly/assembly  \
    --- End diff --
    
    Can we try removing the "clean" here and see if things work? It will take a lot more time to double compile everything which is likely contributing to timeouts.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60852226
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22382/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61055173
  
    @scwf I locally test it, but with the hive uber jar. Later, it seems that the spark-project jar for 0.13.1 is not available 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61180636
  
    Or in python test we need call ```reset``` of TestHiveContext, i am not good at python, but i will have a try. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60860496
  
      [Test build #22410 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22410/consoleFull) for   PR 2685 at commit [`dddeb54`](https://github.com/apache/spark/commit/dddeb549e268591a71853aaab01342f92ab8837c).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60815370
  
    @scwf, okay cool that sounds like a good compromise.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60852574
  
      [Test build #491 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/491/consoleFull) for   PR 2685 at commit [`8a4daf2`](https://github.com/apache/spark/commit/8a4daf2a3d9a1529eb8f6598bbb619e6a0283002).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60843170
  
    I think you have the right idea, but maybe we can isolate the changes a little as I commented on the diff.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60867822
  
    I tested the master branch locally, it also failed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60839388
  
    Seems build with hive-0.12 and test with 0.13, should we remove ``` -Phive-0.12.0``` in ```BUILD_MVN_PROFILE_ARGS```?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214412
  
    retest this please


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052933
  
    yeah we can have a try to use kryo 2.22 and original objenesis in core


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60815376
  
    Build changes LGTM


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214875
  
    @pwendell So I think another doable plan is to make a shaded Kryo 2.21 artifact. In this artifact, we leave the binary jar file untouched, but make the POM consistent with the jar file, i.e., don't shade Objenesis in Kryo 2.21.
    
    Shading Kryo 2.22 in org.spark-project.hive:hive-exec (as what you have already done) should also work, but this inconsistency in Kryo 2.21 is really a time bomb...


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19251134
  
    --- Diff: sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/server/SparkSQLOperationManager.scala ---
    @@ -54,158 +45,8 @@ private[thriftserver] class SparkSQLOperationManager(hiveContext: HiveContext)
           confOverlay: JMap[String, String],
           async: Boolean): ExecuteStatementOperation = synchronized {
     
    -    val operation = new ExecuteStatementOperation(parentSession, statement, confOverlay) {
    -      private var result: SchemaRDD = _
    -      private var iter: Iterator[SparkRow] = _
    -      private var dataTypes: Array[DataType] = _
    -
    -      def close(): Unit = {
    -        // RDDs will be cleaned automatically upon garbage collection.
    -        logDebug("CLOSING")
    -      }
    -
    -      def getNextRowSet(order: FetchOrientation, maxRowsL: Long): RowSet = {
    -        if (!iter.hasNext) {
    -          new RowSet()
    -        } else {
    -          // maxRowsL here typically maps to java.sql.Statement.getFetchSize, which is an int
    -          val maxRows = maxRowsL.toInt
    -          var curRow = 0
    -          var rowSet = new ArrayBuffer[Row](maxRows.min(1024))
    -
    -          while (curRow < maxRows && iter.hasNext) {
    -            val sparkRow = iter.next()
    -            val row = new Row()
    -            var curCol = 0
    -
    -            while (curCol < sparkRow.length) {
    -              if (sparkRow.isNullAt(curCol)) {
    -                addNullColumnValue(sparkRow, row, curCol)
    -              } else {
    -                addNonNullColumnValue(sparkRow, row, curCol)
    -              }
    -              curCol += 1
    -            }
    -            rowSet += row
    -            curRow += 1
    -          }
    -          new RowSet(rowSet, 0)
    -        }
    -      }
    -
    -      def addNonNullColumnValue(from: SparkRow, to: Row, ordinal: Int) {
    -        dataTypes(ordinal) match {
    -          case StringType =>
    -            to.addString(from(ordinal).asInstanceOf[String])
    -          case IntegerType =>
    -            to.addColumnValue(ColumnValue.intValue(from.getInt(ordinal)))
    -          case BooleanType =>
    -            to.addColumnValue(ColumnValue.booleanValue(from.getBoolean(ordinal)))
    -          case DoubleType =>
    -            to.addColumnValue(ColumnValue.doubleValue(from.getDouble(ordinal)))
    -          case FloatType =>
    -            to.addColumnValue(ColumnValue.floatValue(from.getFloat(ordinal)))
    -          case DecimalType =>
    -            val hiveDecimal = from.get(ordinal).asInstanceOf[BigDecimal].bigDecimal
    -            to.addColumnValue(ColumnValue.stringValue(new HiveDecimal(hiveDecimal)))
    -          case LongType =>
    -            to.addColumnValue(ColumnValue.longValue(from.getLong(ordinal)))
    -          case ByteType =>
    -            to.addColumnValue(ColumnValue.byteValue(from.getByte(ordinal)))
    -          case ShortType =>
    -            to.addColumnValue(ColumnValue.shortValue(from.getShort(ordinal)))
    -          case TimestampType =>
    -            to.addColumnValue(
    -              ColumnValue.timestampValue(from.get(ordinal).asInstanceOf[Timestamp]))
    -          case BinaryType | _: ArrayType | _: StructType | _: MapType =>
    -            val hiveString = result
    -              .queryExecution
    -              .asInstanceOf[HiveContext#QueryExecution]
    -              .toHiveString((from.get(ordinal), dataTypes(ordinal)))
    -            to.addColumnValue(ColumnValue.stringValue(hiveString))
    -        }
    -      }
    -
    -      def addNullColumnValue(from: SparkRow, to: Row, ordinal: Int) {
    -        dataTypes(ordinal) match {
    -          case StringType =>
    -            to.addString(null)
    -          case IntegerType =>
    -            to.addColumnValue(ColumnValue.intValue(null))
    -          case BooleanType =>
    -            to.addColumnValue(ColumnValue.booleanValue(null))
    -          case DoubleType =>
    -            to.addColumnValue(ColumnValue.doubleValue(null))
    -          case FloatType =>
    -            to.addColumnValue(ColumnValue.floatValue(null))
    -          case DecimalType =>
    -            to.addColumnValue(ColumnValue.stringValue(null: HiveDecimal))
    -          case LongType =>
    -            to.addColumnValue(ColumnValue.longValue(null))
    -          case ByteType =>
    -            to.addColumnValue(ColumnValue.byteValue(null))
    -          case ShortType =>
    -            to.addColumnValue(ColumnValue.shortValue(null))
    -          case TimestampType =>
    -            to.addColumnValue(ColumnValue.timestampValue(null))
    -          case BinaryType | _: ArrayType | _: StructType | _: MapType =>
    -            to.addColumnValue(ColumnValue.stringValue(null: String))
    -        }
    -      }
    -
    -      def getResultSetSchema: TableSchema = {
    --- End diff --
    
    In hive12 and hive13, this function keep the same, we don't need the duplicated in two files.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60958696
  
    Experimented with the last commit before #2241, tried to add Hive 0.13.1 dependencies to core/pom.xml to see which one may cause HashShuffleSuite failure. Unfortunately adding only org.spark-project.hive:hive-metastore:0.13.1 is enough to break Spark core test.
    
    To reproduce this:
    
    1. Check out https://github.com/apache/spark/tree/0e88661
    1. `git clean -dfx`
    1. Add the following dependency to core/pom.xml:
    
       ```xml
        <dependency>
          <groupId>org.spark-project.hive</groupId>
          <artifactId>hive-metastore</artifactId>
          <version>${hive.version}</version>
        </dependency>
       ```
    
    1. Run `./sbt/sbt -Phadoop-2.3 clean assembly/assembly "core/test-only *.HashShuffleSuite"`
    
    According to @scwf, genuine Apache Hive jar files should work.
    
    /cc @pwendell @marmbrus 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61227318
  
    @zhzhan Had a glance of the Kryo issue you pointed out should be related to the POM inconsistency problem I mentioned above, but I'm not sure whether they are identical since this issue doesn't mention any specific Kryo version. I'll investigate this later. Although Jenkins finally nods, I'm still puzzled with the root cause of the original build failure. (We only know that introducing Kryo 2.22 prevents un-shaded Objenesis classes from being included in the assembly jar and thus breaks the core tests, but how and why?)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61178005
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22560/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-58572834
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61030054
  
     Really weird for core tests failed difference between apache.hive and spark-project.hive, locally all core tests failed with apache.hive due to 
    ```
    [info]   org.apache.spark.SparkException: Job aborted due to stage failure: Task 2 in stage 0.0 failed 4 times, most recent failure: Lost task 2.3 in stage 0.0 (TID 9, localhost): java.net.SocketTimeoutException: Read timed out
    [info]         java.net.SocketInputStream.socketRead0(Native Method)
    [info]         java.net.SocketInputStream.read(SocketInputStream.java:152)
    [info]         java.net.SocketInputStream.read(SocketInputStream.java:122)
    [info]         java.io.BufferedInputStream.fill(BufferedInputStream.java:235)
    [info]         java.io.BufferedInputStream.read1(BufferedInputStream.java:275)
    [info]         java.io.BufferedInputStream.read(BufferedInputStream.java:334)
    [info]         sun.net.www.http.HttpClient.parseHTTPHeader(HttpClient.java:687)
    [info]         sun.net.www.http.HttpClient.parseHTTP(HttpClient.java:633)
    [info]         sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1323)
    [info]         org.apache.spark.broadcast.HttpBroadcast$.org$apache$spark$broadcast$HttpBroadcast$$read(HttpBroadcast.scala:202)
    ```



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19384151
  
    --- Diff: dev/run-tests ---
    @@ -167,7 +167,7 @@ CURRENT_BLOCK=$BLOCK_SPARK_UNIT_TESTS
       # If the Spark SQL tests are enabled, run the tests with the Hive profiles enabled.
       # This must be a single argument, as it is.
       if [ -n "$_RUN_SQL_TESTS" ]; then
    -    SBT_MAVEN_PROFILES_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive"
    +    SBT_MAVEN_PROFILES_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
    --- End diff --
    
    is this change ncessary? seems like it might be good to leave it how it is now


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by coderfi <gi...@git.apache.org>.
Github user coderfi commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61417156
  
    In /pom.xml
    
        <properties>
            <hive.version>0.13.1a</hive.version>
        </properties>
        ...
        <profile>
          <id>hive-0.13.1</id>
          ...
          <properties>
            <hive.version>0.13.1</hive.version>
            ...
          </properties>
        </profile>
    
    The 'hive-0.13.1' is overriding the hive.version back to the regular 'hive-0.13.1' version, instead of referencing 'hive-0.13.1a'.
    
    It seems `mvn -Phive` would get me a build with hive-0.13a containing this pull's patches.
    
    However, I was building with `mvn -Phive -Phive-0.13.1`, so I ran into the 'NoClassDefFoundError ... InstantiatorStrategy' issue.
    Omitting the 'hive-0.13.1' profile got me past the issue (since the pom defaults effectively gives me hive 0.13(a) anyway, which is what I want).
    
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61216213
  
    @liancheng  do you mean the problem mentioned here?
    https://github.com/EsotericSoftware/kryo/issues/189


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61039529
  
    Still failed and get this error
    ```
    ......
    Caused by: java.lang.ClassNotFoundException: com.esotericsoftware.shaded.org.objenesis.strategy.InstantiatorStrategy
    ```
    i am checking whether this class in assembly jar


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60872647
  
    @scwf How about Spark dev related environment variable settings on your host? I unset `SPARK_PREPEND_CLASSES` and set `SPARK_TESTING` to 1 before running `core/test` (with same profiles as you did), up until now no test failures found locally :( Investigating unit-tests.log files of the Jenkins failure. I can send you a copy offline (this log archive is automatically uploaded to Jenkins master, but we haven't got time to make it available publicly for all developers).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61051898
  
    @pwendell,spark depends on kryo 2.21 which not  shaded objenesis while hive 0.13 depends on kryo 2.22 and it shaded objenesis. So excluding will not fix the problem because in hive can not find the shaded class


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052493
  
    @pwendell @scwf What I mean is that com.esotericsoftware is again shaded in hive as org.apache.hive.com.esotericsoftware. I think that's the reason why the original hive package work against spark. But the spark-project:hive-exec does not include the shaded org.apache.hive.com.esotericsoftware, and need to relink which cause the version confliction.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052558
  
    Unfortunately the most recent version of Chill still stick on Kryo 2.21 :(


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052124
  
    @pwendell com.esotericsoftware is already shaded in hive. Will it work if we keep it in hive-exec.jar? Please advice.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61051870
  
    Based on the most recent failures, it seems like somehow the test classpath is still using kryo 2.22.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60839602
  
    I agree, but I would like to continue to at least build with hive-12 so that we make sure patches do not break the hive 12 build.  Perhaps we can build first with hive 12, then build the assembly with hive-13 then continue with the tests?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214280
  
    @pwendell Yesterday, I tried to include both Kryo 2.21 and Kryo 2.22 into the assembly jar, so that both shaded and un-shaded Objenesis should co-exist. But to my surprise, no matter how I specify the dependencies, the un-shaded Objenesis classes can never appear in the assembly jar. Then I track down to the Kryo 2.21 POM on Maven repo [1], and you may see that this POM says Objenesis is *shaded* in 2.21, but apparently the kryo-2.21.jar binary doesn't shade Objenesis:
    
    ```
    $ tree -d kryo-2.21
    kryo-2.21
    ├── META-INF
    │   └── maven
    │       └── com.esotericsoftware.kryo
    │           └── kryo
    ├── com
    │   └── esotericsoftware
    │       ├── kryo
    │       │   ├── io
    │       │   ├── serializers
    │       │   └── util
    │       ├── minlog
    │       └── reflectasm
    │           └── shaded
    │               └── org
    │                   └── objectweb
    │                       └── asm
    └── org                           -.
        └── objenesis                  |
            ├── instantiator           |
            │   ├── basic              |
            │   ├── gcj                 > Un-shaded Objenesis classes
            │   ├── jrockit            |
            │   ├── perc               |
            │   └── sun                |
            └── strategy              -'
    ```
    
    [1] https://repo1.maven.org/maven2/com/esotericsoftware/kryo/kryo/2.21/kryo-2.21.pom
    
    I'm not very familiar with Maven plugin behaviors, but I guess while building the assembly jar, the `maven-shade-plugin` reads the inconsistent Kryo 2.21 POM retrieved from Maven repo, and always shades Objenesis, which directly causes the class not found error we saw in test failures. 



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052753
  
    Another idea - what if we upgrade Kryo to 2.22 explicitly in our core pom? If 2.22 is binary compatible with 2.21 it could work. If chill direclty uses objenesis, we might need to include the original objenesis as well explicitly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61050366
  
    @scwf Hmm, you mean the dev/run-test does not run pyspark? I locally run dev/run-test today and months ago, and didn't met pyspark error. How can I invoke pyspark test locally?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61040483
  
    Just exclude kryo is not enough, should we reshade hive 0.13.1 jar? @pwendell 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60843065
  
    ok


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61053806
  
    Yeah this seems reasonable to try. Whether it will work depends on whether kryo 2.22 and 2.21 are compatible.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61051681
  
    @scwf I checked dev/run-tests, it does invoke python/run-tests. Didn't you also run it locally and succeed, or I miss anything?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61222108
  
    So will you re-publish the hive 0.13 jar @pwendell? or use 0.13.1a? 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60612290
  
    @scwf Sorry for not being able to give a thorough review earlier. This generally LGTM. I'll rebase #2887 after this PR is merged. Thanks for all the tough work!


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-58133473
  
      [QA tests have started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/277/consoleFull) for   PR 2685 at commit [`3a08b14`](https://github.com/apache/spark/commit/3a08b14710e7b5649c6de5406c257ce8d59dc8e0).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61056389
  
    @scwf Yea, same here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61051691
  
    Just to make it more intuitive, made a dependency graph to illustrate the issue:
    
    ![dependency-hell](http://tinyurl.com/q5opqe2)
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60866610
  
    Hmm, still failing... @liancheng can you take a look?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61184011
  
    can you add reset to there and have a test?@zhzhan


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19515484
  
    --- Diff: dev/run-tests ---
    @@ -147,12 +147,18 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    +    | grep -v -e "info.*Resolving" -e "warn.*Merging" -e "info.*Including"
    +
    +  # Then build with default version(0.13.1) because tests based on this version
    +  echo -e "q\n" \
    +    | sbt/sbt $SBT_MAVEN_PROFILES_ARGS -Phive clean assembly/assembly  \
    --- End diff --
    
    Lets put `package` back in between `clean` and `assembly/assembly`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61055622
  
    Still failed... 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61054034
  
    @pwendell Kryo 2.22 in hive is already shaded by hive itself. My understanding is that shaded actually make it private to hive itself, and it is invisible to other module, e.g., spark. Maybe my understanding is wrong here. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61183571
  
    @scwf It looks like pyspark does not use TestHiveContext. Instead probably we can change the sql.py, 
        >>> hiveCtx = LocalHiveContext(sc)
    + >>> try:
    +  ...     hiveCtx.sql("CREATE DATABASE default")
    +  ...     hiveCtx.sql("USE default")
    +  ... except Exception:
        ...     pass
        >>> try:
        ...     supress = hiveCtx.sql("DROP TABLE src")


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61180469
  
    The server will do a "git clean -fdx" before every run. Maybe you can reproduce it with this option? (warning, this will remove any working state you have in your local spark install).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60818913
  
    ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61214405
  
    add to whitelist


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19653776
  
    --- Diff: dev/run-tests ---
    @@ -142,17 +142,24 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # We always build with Hive because the PySpark Spark SQL tests need it.
       BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
     
    -  echo "[info] Building Spark with these arguments: $BUILD_MVN_PROFILE_ARGS"
     
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
    +  echo "[info] Compile with hive 0.12"
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    +    | grep -v -e "info.*Resolving" -e "warn.*Merging" -e "info.*Including"
    +
    +  # Then build with default version(0.13.1) because tests are based on this version
    +  echo "[info] Building Spark with these arguments: $SBT_MAVEN_PROFILES_ARGS -Phive"
    +  echo -e "q\n" \
    +    | sbt/sbt $SBT_MAVEN_PROFILES_ARGS -Phive clean package assembly/assembly  \
    --- End diff --
    
    sure


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60859900
  
      [Test build #491 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/491/consoleFull) for   PR 2685 at commit [`8a4daf2`](https://github.com/apache/spark/commit/8a4daf2a3d9a1529eb8f6598bbb619e6a0283002).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by qiaohaijun <gi...@git.apache.org>.
Github user qiaohaijun commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60730158
  
    is this merge to master branch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61043663
  
    Two potential workaround for this:
    1 change kryo version in hive to fix the conflict
    2 to shade chill 
    other idea?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60846349
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22386/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61179025
  
    How can i test pyspark separately locally


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052820
  
    upgrade Kryo in core will get compile error


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60866843
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22410/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052784
  
    Hm, since Kryo 2.21 refer to the non-shaded version of Objenesis, while Kryo 2.22 refer to the shaded version, it should OK to let them coexist in Spark, right? @pwendell How was the org.spark-project.hive jars shaded? Is there a public repo for it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61231533
  
      [Test build #22602 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22602/consoleFull) for   PR 2685 at commit [`f26f3be`](https://github.com/apache/spark/commit/f26f3beca0fdc522a04b4491cf7fa391edbce445).
     * This patch **passes all tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052630
  
    @liancheng yeah - I just noticed that :(


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61165577
  
    cross your fingers


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60822672
  
      [Test build #488 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/488/consoleFull) for   PR 2685 at commit [`bcf943f`](https://github.com/apache/spark/commit/bcf943f8fffcbd8c452651130eb5d232ea803e4f).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60871776
  
    Local testing each test suite individually doesn't fail, guess another case related to test suite execution order.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61053748
  
    I am testing as follows, it's ok?
    ```
    --- a/core/pom.xml
    +++ b/core/pom.xml
    @@ -125,10 +125,32 @@
         <dependency>
           <groupId>com.twitter</groupId>
           <artifactId>chill_${scala.binary.version}</artifactId>
    +      <exclusions>
    +        <exclusion>
    +          <groupId>com.esotericsoftware.kryo</groupId>
    +          <artifactId>kryo</artifactId>
    +        </exclusion>
    +      </exclusions>
         </dependency>
         <dependency>
           <groupId>com.twitter</groupId>
           <artifactId>chill-java</artifactId>
    +      <exclusions>
    +        <exclusion>
    +          <groupId>com.esotericsoftware.kryo</groupId>
    +          <artifactId>kryo</artifactId>
    +        </exclusion>
    +      </exclusions>
    +    </dependency>
    +    <dependency>
    +      <groupId>com.esotericsoftware.kryo</groupId>
    +      <artifactId>kryo</artifactId>
    +      <version>2.22</version>
    +    </dependency>
    +    <dependency>
    +      <groupId>org.objenesis</groupId>
    +      <artifactId>objenesis</artifactId>
    +      <version>1.2</version>
         </dependency>
         <dependency>
           <groupId>org.roaringbitmap</groupId>
    ```
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60999502
  
    FYI - I was able to reproduce this failure by simply building with Hive 13 and running with Hive 13:
    
    https://github.com/apache/spark/pull/2999/files
    
    Hopefully ths helps narrow it down


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61203788
  
    Jenkins, retest this please.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61052641
  
    @pwendell  link https://github.com/twitter/chill/commit/3869b0122660c908e189ff08b615bd7221956224
    chill revert kryo for unknown reason


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61048834
  
    @zhzhan,  original hive failed pyspark, see #3004


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61162936
  
    Ok, i am testing.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61055765
  
    @zhzhan, you can use spark-project jar to test and it will failed, based on #2241  


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61045639
  
    I am not expert on this. Looks like esotericsoftware is already shaded in hive. Is it helpful if org.spark-prjoect:hive-exec include the shaded jar? Since the original hive jar works and no conflicts.  
                  <relocations>
                    <relocation>
                      <pattern>com.esotericsoftware</pattern>
                      <shadedPattern>org.apache.hive.com.esotericsoftware</shadedPattern>
                    </relocation>


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60840998
  
    I think @scwf is right.  Looks like we are using hive-0.12 client to connect to hive-0.13, which is not compatible. I met the similar problem before.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60845942
  
      [Test build #22386 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22386/consoleFull) for   PR 2685 at commit [`0d7f6cf`](https://github.com/apache/spark/commit/0d7f6cf958cb293051719c60020d26a552063d31).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61443613
  
    And note just to change ```hive.version``` to 0.13.1a, ```hive.version.short``` should be 0.31.1.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61203698
  
    I think this failed an already flaky test - so let's try again.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61054467
  
    @pwendell Thanks for the clarification. That's what I mean to shade com.esotericsoftware in spark-project:hive-exec. 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61184081
  
    Yeah, but its not used here: https://github.com/apache/spark/blob/master/python/pyspark/sql.py#L1409
    
    We can probably just get rid of this test.  That class is deprecated.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61217097
  
    @liancheng I already published a version of Hive that uses Kryo 2.21. This should no longer require 2.22 anywhere.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60847838
  
      [Test build #22390 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22390/consoleFull) for   PR 2685 at commit [`8a4daf2`](https://github.com/apache/spark/commit/8a4daf2a3d9a1529eb8f6598bbb619e6a0283002).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19256113
  
    --- Diff: pom.xml ---
    @@ -1272,16 +1276,33 @@
             </dependency>
           </dependencies>
         </profile>
    -
         <profile>
    -      <id>hive</id>
    +      <id>hive-0.12.0</id>
           <activation>
             <activeByDefault>false</activeByDefault>
           </activation>
           <modules>
             <module>sql/hive-thriftserver</module>
           </modules>
    +      <properties>
    +        <hive.version>0.12.0-protobuf-2.5</hive.version>
    +        <hive.version.short>0.12.0</hive.version.short>
    +        <derby.version>10.4.2.0</derby.version>
    +      </properties>
    +    </profile>
    +    <profile>
    +      <id>hive-0.13.1</id>
    +      <activation>
    +        <activeByDefault>false</activeByDefault>
    +      </activation>
    +      <modules>
    +        <module>sql/hive-thriftserver</module>
    --- End diff --
    
    Yes. either move to modules, or move to hive profile, since it is supported in both versions.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61224765
  
      [Test build #22602 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22602/consoleFull) for   PR 2685 at commit [`f26f3be`](https://github.com/apache/spark/commit/f26f3beca0fdc522a04b4491cf7fa391edbce445).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by liancheng <gi...@git.apache.org>.
Github user liancheng commented on a diff in the pull request:

    https://github.com/apache/spark/pull/2685#discussion_r19654494
  
    --- Diff: dev/run-tests ---
    @@ -142,17 +142,24 @@ CURRENT_BLOCK=$BLOCK_BUILD
       # We always build with Hive because the PySpark Spark SQL tests need it.
       BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
     
    -  echo "[info] Building Spark with these arguments: $BUILD_MVN_PROFILE_ARGS"
     
       # NOTE: echo "q" is needed because sbt on encountering a build file with failure
       #+ (either resolution or compilation) prompts the user for input either q, r, etc
       #+ to quit or retry. This echo is there to make it not block.
    -  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a 
    +  # NOTE: Do not quote $BUILD_MVN_PROFILE_ARGS or else it will be interpreted as a
       #+ single argument!
       # QUESTION: Why doesn't 'yes "q"' work?
       # QUESTION: Why doesn't 'grep -v -e "^\[info\] Resolving"' work?
    +  # First build with 0.12 to ensure patches do not break the hive 12 build
    +  echo "[info] Compile with hive 0.12"
       echo -e "q\n" \
    -    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean package assembly/assembly \
    +    | sbt/sbt $BUILD_MVN_PROFILE_ARGS clean hive/compile hive-thriftserver/compile \
    --- End diff --
    
    It should be, `BUILD_MVN_PROFILE_ARGS` is defined above with `-Phive-0.12.0`:
    
    ```
      BUILD_MVN_PROFILE_ARGS="$SBT_MAVEN_PROFILES_ARGS -Phive -Phive-0.12.0"
    ```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by marmbrus <gi...@git.apache.org>.
Github user marmbrus commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60641434
  
    @chenghao-intel, because Hive is not compatible with metastores of a different version we will need to support Hive 12 and 13 concurrently for at least a little while.  Long term I would like to switch to using HCatalog so that we can connect to different metastore versions without needing to compile against different versions of the hive library.
    
    @scwf, @pwendell has some concerns about adding another project, since each project incurs significant overhead in build time.  For this reason, perhaps we should just consolidate all the Shim code into a the existing hive subproject.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60821613
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/22369/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by scwf <gi...@git.apache.org>.
Github user scwf commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61039848
  
    In assembly jar the class is here
    ```/org/objenesis/strategy/InstantiatorStrategy.class```
    it seems the class name here wrong, should be ```org.objenesis.strategy.InstantiatorStrategy``` but not ```com.esotericsoftware.shaded.```


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by pwendell <gi...@git.apache.org>.
Github user pwendell commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61161407
  
    Okay I pushed a new version but you'll need to add a repository:
    https://oss.sonatype.org/content/repositories/orgspark-project-1089/
    
    Also, you'll need to use hive version string "0.13.1a" for the Hive 13 profilee.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by zhzhan <gi...@git.apache.org>.
Github user zhzhan commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61222998
  
    whew, finally.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by chenghao-intel <gi...@git.apache.org>.
Github user chenghao-intel commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-60559682
  
    Sorry, are we going to support both Hive 0.12 & 0.13.1 in long term? I am working on SerDe stuff #2570, seems lots of method signatures changed after upgrading to 0.13.1., as well as the `ObjectInspector`.  If we need to support both versions, probably the Shim code will be complicated.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-3826][SQL]enable hive-thriftserver to s...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/2685#issuecomment-61204241
  
      [Test build #22588 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/22588/consoleFull) for   PR 2685 at commit [`f5cac74`](https://github.com/apache/spark/commit/f5cac74bfcf683cfe9129ea7917fa44cc023c277).
     * This patch merges cleanly.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org