You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by holdenk <gi...@git.apache.org> on 2016/07/30 07:10:20 UTC

[GitHub] spark pull request #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet co...

GitHub user holdenk opened a pull request:

    https://github.com/apache/spark/pull/14419

    [SPARK-16814][SQL][WIP] Fix deprecated parquet constructor usage

    ## What changes were proposed in this pull request?
    
    Replace deprecated ParquetWriter with the new builders
    
    
    ## How was this patch tested?
    
    Existing tests

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/holdenk/spark SPARK-16814-fix-deprecated-parquet-constructor-usage

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/14419.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #14419
    
----
commit 9fa8c8f1f3d2e519541ae988e009116615375b34
Author: Holden Karau <ho...@us.ibm.com>
Date:   2016-07-29T20:46:33Z

    start refactoring to builder

commit b41a057c7fab71fd0f0b7564a158de568d919f35
Author: Holden Karau <ho...@us.ibm.com>
Date:   2016-07-30T01:33:13Z

    Merge branch 'master' into fix-deprecated-parquet-constructor-usage

commit 1ba76430eaf64804fe983ed50d967dbc58c5f8da
Author: Holden Karau <ho...@us.ibm.com>
Date:   2016-07-30T06:04:52Z

    Use the builder

commit e10c147db6fa9250bbaf16faee944dd22d741ada
Author: Holden Karau <ho...@us.ibm.com>
Date:   2016-07-30T07:01:31Z

    Cleanup some other deprecated parquet writer constructors

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Do we know if this works across versions of Parquet that are still in play for Spark builds? I know we've got some YARN and Scala warnings that we can't adjust for that reason. Just double checking. @HyukjinKwon did you look at this too or am I imagining it?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73320096
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    You might just go ahead and add it as a comment for good measure.
    
    Isn't `@Override` the _Java_ annotation? thought Scala needed `@override` but could be missing something.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    @HyukjinKwon no worries - just wanted to make sure we weren't accidentally duplicating eachothers efforts :) 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet co...

Posted by rxin <gi...@git.apache.org>.
Github user rxin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r72894773
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    can you explain what's going on here? This seems like pretty complicated.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63062 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63062/consoleFull)** for PR 14419 at commit [`2067b63`](https://github.com/apache/spark/commit/2067b634784f94cd23dcd8c41b9035afcb0140e2).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet co...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r72896264
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    Sure, so the parquetWriter constructors are deprecated now and its been replaced with a builder interface. For Avro and others there is a standard builder - but for sort "raw" formats you need to implement your own builder. This is equivalent to the old constructor we were using - you can see the deprecation in https://github.com/apache/parquet-mr/pull/199/files as well as how the builder interface ends up calling an equivalent (now protected) constructor. Also since our WriteSupport doesn't need to change based on the configuration we always return the same writesupport regardless of conf.
    
    If it would be useful I can add some of this as a comment in the sourcecode.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63039/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Yes, actually I took a look for this as well and it seems okay (I was thinking doing this with other issues).
    If my understanding is correct, this would not affect actual writing in Spark but just tests.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    No, I don't :). I just meant I wanted to say that this fix is reasonable and the reason I didn't submit a PR has not been because there is potential incompatability. It was because just I was thinking doing this with other ones in the future. I like this PR :).
    
     Sorry for confusing you.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r72914240
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,18 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    +    case class ParquetWriterBuilder() extends
    --- End diff --
    
    Ah, you want to just block that this class is inherited accidentally just in case?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r72915861
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,18 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    +    case class ParquetWriterBuilder() extends
    --- End diff --
    
    Thats a good point, they probably don't need to be case classes (just happened to write that way when I started the code) - I'll switch them to regular classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/14419


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73395973
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,18 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    +    class ParquetWriterBuilder() extends
    +        ParquetWriter.Builder[RecordConsumer => Unit, ParquetWriterBuilder](new Path(path)) {
    +      @Override def getWriteSupport(conf: org.apache.hadoop.conf.Configuration) = {
    --- End diff --
    
    sure :)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63183 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63183/consoleFull)** for PR 14419 at commit [`1a37b8f`](https://github.com/apache/spark/commit/1a37b8ffc518dff94117e6129660c1c585412518).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73327929
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    I didn't know Java annotation passes Scala style checking. Should we consider adding a rule for this?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63062 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63062/consoleFull)** for PR 14419 at commit [`2067b63`](https://github.com/apache/spark/commit/2067b634784f94cd23dcd8c41b9035afcb0140e2).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63062/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63039 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63039/consoleFull)** for PR 14419 at commit [`e10c147`](https://github.com/apache/spark/commit/e10c147db6fa9250bbaf16faee944dd22d741ada).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds the following public classes _(experimental)_:
      * `    case class ParquetWriterBuilder() extends`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73320252
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,18 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    +    class ParquetWriterBuilder() extends
    +        ParquetWriter.Builder[RecordConsumer => Unit, ParquetWriterBuilder](new Path(path)) {
    +      @Override def getWriteSupport(conf: org.apache.hadoop.conf.Configuration) = {
    --- End diff --
    
    Import `Configuration`? and maybe just write the trivial body of both methods inline without braces on one line. Just personal taste really, to keep this anonymous class compact


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63183/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73373242
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    Ack yeah that's what I meant, thank you.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63051 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63051/consoleFull)** for PR 14419 at commit [`5e92e6f`](https://github.com/apache/spark/commit/5e92e6f59f959772fdc276b40939d4830a70512f).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/63051/
    Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73328549
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    Sure, because I don't think it would actually cause the Scala compiler to verify it overrides. It's not illegal to use a Java annotation, just doesn't do anything in this case?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63039 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63039/consoleFull)** for PR 14419 at commit [`e10c147`](https://github.com/apache/spark/commit/e10c147db6fa9250bbaf16faee944dd22d741ada).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73106592
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    @rxin Does this explanation make sense for you?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73331335
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    Filed here, https://issues.apache.org/jira/browse/SPARK-16877 but I will do some researches and take a look into this deeper before opening a PR.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r72914197
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,18 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    +    case class ParquetWriterBuilder() extends
    --- End diff --
    
    One thing I am a bit wondering though is why this has to be `case class` instead of just `class`. Because it seems it is self-contained, so it seems not have to be serializable and it seems there is no pattern matching with this. Just curious.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by HyukjinKwon <gi...@git.apache.org>.
Github user HyukjinKwon commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Could  cc @liancheng who has more insight about thie please?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    So yes as @HyukjinKwon says its just in the tests.
    I figured this would be ok across the hadoop versions we target because I looked at the build deps files and they all listed parquet-hadoop-1.8.1 for the different hadoop profiles we build with (`grep -r parquet-hadoop ./dev/deps/`)


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by holdenk <gi...@git.apache.org>.
Github user holdenk commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    @HyukjinKwon do you have a PR for this? If so I can go ahead and close this one (and lets maybe sync some on the deprecation stuff we are both cleaning up :))


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL][WIP] Fix deprecated parquet construct...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63051 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63051/consoleFull)** for PR 14419 at commit [`5e92e6f`](https://github.com/apache/spark/commit/5e92e6f59f959772fdc276b40939d4830a70512f).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #14419: [SPARK-16814][SQL] Fix deprecated parquet constru...

Posted by rxin <gi...@git.apache.org>.
Github user rxin commented on a diff in the pull request:

    https://github.com/apache/spark/pull/14419#discussion_r73367655
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetCompatibilityTest.scala ---
    @@ -119,8 +119,19 @@ private[sql] object ParquetCompatibilityTest {
           metadata: Map[String, String],
           recordWriters: (RecordConsumer => Unit)*): Unit = {
         val messageType = MessageTypeParser.parseMessageType(schema)
    -    val writeSupport = new DirectWriteSupport(messageType, metadata)
    -    val parquetWriter = new ParquetWriter[RecordConsumer => Unit](new Path(path), writeSupport)
    +    val testWriteSupport = new DirectWriteSupport(messageType, metadata)
    --- End diff --
    
    scala just do override, not even @override.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by srowen <gi...@git.apache.org>.
Github user srowen commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Merged to master


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    Merged build finished. Test PASSed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #14419: [SPARK-16814][SQL] Fix deprecated parquet constructor us...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/14419
  
    **[Test build #63183 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/63183/consoleFull)** for PR 14419 at commit [`1a37b8f`](https://github.com/apache/spark/commit/1a37b8ffc518dff94117e6129660c1c585412518).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org