You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by srowen <gi...@git.apache.org> on 2015/12/23 13:23:31 UTC

[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

GitHub user srowen opened a pull request:

    https://github.com/apache/spark/pull/10446

    [SPARK-12481] [CORE] [STREAMING] [SQL] Remove usage of Hadoop deprecated APIs and reflection that supported 1.x

    Remove use of deprecated Hadoop APIs now that 2.2+ is required

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/srowen/spark SPARK-12481

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/10446.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #10446
    
----
commit 2c9f628bce92d916caa78a0532da6b112f3dc70b
Author: Sean Owen <so...@cloudera.com>
Date:   2015-12-23T12:23:19Z

    Remove use of deprecated Hadoop APIs now that 2.2+ is required

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167590962
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48366/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by vanzin <gi...@git.apache.org>.
Github user vanzin commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167907581
  
    LGTM; there's at least one extra possible cleanup, but feel free to punt on that one.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166982259
  
    **[Test build #2253 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2253/consoleFull)** for PR 10446 at commit [`9864b68`](https://github.com/apache/spark/commit/9864b682eb5e432270bd1bbcc502a932a9a6b70b).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166947653
  
    **[Test build #48243 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48243/consoleFull)** for PR 10446 at commit [`9864b68`](https://github.com/apache/spark/commit/9864b682eb5e432270bd1bbcc502a932a9a6b70b).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by rxin <gi...@git.apache.org>.
Github user rxin commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167867032
  
    Yea I think this one is fine. 
    
    (I haven't looked at the detailed changes though)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by vanzin <gi...@git.apache.org>.
Github user vanzin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10446#discussion_r48581350
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/SqlNewHadoopRDD.scala ---
    @@ -26,10 +26,10 @@ import org.apache.hadoop.conf.{Configurable, Configuration}
     import org.apache.hadoop.io.Writable
     import org.apache.hadoop.mapreduce._
     import org.apache.hadoop.mapreduce.lib.input.{CombineFileSplit, FileSplit}
    +import org.apache.hadoop.mapreduce.task.{TaskAttemptContextImpl, JobContextImpl}
    --- End diff --
    
    nit: order


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167987565
  
    **[Test build #2266 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2266/consoleFull)** for PR 10446 at commit [`5e3318a`](https://github.com/apache/spark/commit/5e3318a4ce7e1ac90c5a3a2e448377f6bfa95948).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166947723
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48243/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167590960
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167985799
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48475/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166888672
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166947721
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by vanzin <gi...@git.apache.org>.
Github user vanzin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10446#discussion_r48581127
  
    --- Diff: core/src/main/scala/org/apache/spark/SparkHadoopWriter.scala ---
    @@ -143,6 +141,12 @@ class SparkHadoopWriter(jobConf: JobConf)
         taskContext
       }
     
    +  protected def newTaskAttemptContext(
    --- End diff --
    
    Is this still needed? It seems you replaced the call with a direct call to the constructor in a lot of places.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167590880
  
    **[Test build #48366 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48366/consoleFull)** for PR 10446 at commit [`1fffa8e`](https://github.com/apache/spark/commit/1fffa8e73398347663a568496d4ab14ecf7cff2c).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by vanzin <gi...@git.apache.org>.
Github user vanzin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10446#discussion_r48581178
  
    --- Diff: core/src/main/scala/org/apache/spark/rdd/NewHadoopRDD.scala ---
    @@ -26,11 +26,11 @@ import org.apache.hadoop.conf.{Configurable, Configuration}
     import org.apache.hadoop.io.Writable
     import org.apache.hadoop.mapreduce._
     import org.apache.hadoop.mapreduce.lib.input.{CombineFileSplit, FileSplit}
    +import org.apache.hadoop.mapreduce.task.{TaskAttemptContextImpl, JobContextImpl}
    --- End diff --
    
    nit: order


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166923370
  
    **[Test build #48243 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48243/consoleFull)** for PR 10446 at commit [`9864b68`](https://github.com/apache/spark/commit/9864b682eb5e432270bd1bbcc502a932a9a6b70b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167985797
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by srowen <gi...@git.apache.org>.
Github user srowen closed the pull request at:

    https://github.com/apache/spark/pull/10446


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10446#discussion_r48593961
  
    --- Diff: core/src/main/scala/org/apache/spark/SparkHadoopWriter.scala ---
    @@ -143,6 +141,12 @@ class SparkHadoopWriter(jobConf: JobConf)
         taskContext
       }
     
    +  protected def newTaskAttemptContext(
    --- End diff --
    
    Yea, it's overridden in a mock object in `OutputCommitCoordinatorSuite` so I kept this


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by vanzin <gi...@git.apache.org>.
Github user vanzin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10446#discussion_r48581268
  
    --- Diff: core/src/main/scala/org/apache/spark/scheduler/EventLoggingListener.scala ---
    @@ -97,7 +97,7 @@ private[spark] class EventLoggingListener(
        * Creates the log file in the configured log directory.
        */
       def start() {
    -    if (!fileSystem.getFileStatus(new Path(logBaseDir)).isDir) {
    +    if (!fileSystem.getFileStatus(new Path(logBaseDir)).isDirectory) {
    --- End diff --
    
    Unrelated to this line: this class has the `hadoopFlushMethod` hack which probably can go away now, if you want to do more cleanup.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166884922
  
    **[Test build #48240 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48240/consoleFull)** for PR 10446 at commit [`2c9f628`](https://github.com/apache/spark/commit/2c9f628bce92d916caa78a0532da6b112f3dc70b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167971529
  
    **[Test build #48475 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48475/consoleFull)** for PR 10446 at commit [`5e3318a`](https://github.com/apache/spark/commit/5e3318a4ce7e1ac90c5a3a2e448377f6bfa95948).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166960152
  
    **[Test build #2253 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2253/consoleFull)** for PR 10446 at commit [`9864b68`](https://github.com/apache/spark/commit/9864b682eb5e432270bd1bbcc502a932a9a6b70b).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-168390287
  
    Merged to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167773896
  
    Pinging folks like @vanzin @pwendell @rxin @sarutak 
    I don't think this one is controversial, as it just follows from removing older Hadoop support and cleans up the build. Still, it's not a small change, so wouldn't mind any other set of eyes on it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by vanzin <gi...@git.apache.org>.
Github user vanzin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/10446#discussion_r48581372
  
    --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/WriterContainer.scala ---
    @@ -24,10 +24,9 @@ import scala.collection.JavaConverters._
     import org.apache.hadoop.fs.Path
     import org.apache.hadoop.mapreduce._
     import org.apache.hadoop.mapreduce.lib.output.{FileOutputCommitter => MapReduceFileOutputCommitter}
    +import org.apache.hadoop.mapreduce.task.TaskAttemptContextImpl
     import org.apache.spark._
    --- End diff --
    
    nit: not your fault but a good opportunity to add a blank line here.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166888673
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/48240/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167985723
  
    **[Test build #48475 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48475/consoleFull)** for PR 10446 at commit [`5e3318a`](https://github.com/apache/spark/commit/5e3318a4ce7e1ac90c5a3a2e448377f6bfa95948).
     * This patch **fails PySpark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-166888644
  
    **[Test build #48240 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48240/consoleFull)** for PR 10446 at commit [`2c9f628`](https://github.com/apache/spark/commit/2c9f628bce92d916caa78a0532da6b112f3dc70b).
     * This patch **fails MiMa tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167566796
  
    **[Test build #48366 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/48366/consoleFull)** for PR 10446 at commit [`1fffa8e`](https://github.com/apache/spark/commit/1fffa8e73398347663a568496d4ab14ecf7cff2c).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-168004825
  
    **[Test build #2266 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2266/consoleFull)** for PR 10446 at commit [`5e3318a`](https://github.com/apache/spark/commit/5e3318a4ce7e1ac90c5a3a2e448377f6bfa95948).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request: [SPARK-12481] [CORE] [STREAMING] [SQL] Remove ...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the pull request:

    https://github.com/apache/spark/pull/10446#issuecomment-167971026
  
    I'm pushing an update with those changes. I found a similar one in `FileBasedWriteAheadLogWriter` and one more in `FsHistoryProvider` regarding checking for safe mode.
    
    BTW here are the additional cleanups that could be made if Hadoop 2.6+ was assumed:
    
    - SparkHadoopUtil: `getFileSystemThreadStatistics`, `getFSBytesReadOnThreadCallback` and `getFSBytesWrittenOnThreadCallback` only actually work on Hadoop 2.5+
    - Hadoop RDDs could always directly access SplitLocationInfo, InputSplitWithLocationInfo introduced in 2.5
    - YARN integration tests could directly access tag-related APIs like ApplicationContext getApplicationTags from 2.4 (?)
    - AM could cleanly handle `ApplicationAttemptNotFoundException`
    
    Taken together those don't add up to much advantage, but it's not nothing. The bigger reason would be less JAR hell to deal with and being able to access new APIs more freely.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org