You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by debugger87 <gi...@git.apache.org> on 2017/08/10 04:20:06 UTC

[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

GitHub user debugger87 opened a pull request:

    https://github.com/apache/spark/pull/18900

    [SPARK-21687][SQL] Spark SQL should set createTime for Hive partition

    ## What changes were proposed in this pull request?
    
    Set createTime for every hive partition created in Spark SQL, which could be used to manage data lifecycle in Hive warehouse.
    
    ## How was this patch tested?
    
    No tests
    
    Please review http://spark.apache.org/contributing.html before opening a pull request.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/debugger87/spark fix/set-create-time-for-hive-partition

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/18900.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #18900
    
----
commit 71a660ac8dad869d9ba3b4e206b74f5c44660ee6
Author: debugger87 <ya...@gmail.com>
Date:   2017-08-10T04:17:00Z

    [SPARK-21687][SQL] Spark SQL should set createTime for Hive partition

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    `createTime` is set by HiveMetaStore#initializeAddedPartition
    
    ```
    private void initializeAddedPartition(Table tbl, PartitionIterator part, boolean madeDir) throws MetaException {
          if(HiveConf.getBoolVar(this.hiveConf, ConfVars.HIVESTATSAUTOGATHER) && !MetaStoreUtils.isView(tbl)) {
            MetaStoreUtils.updatePartitionStatsFast(part, this.wh, madeDir, false);
          }
    
          long time = System.currentTimeMillis() / 1000L;
          part.setCreateTime((long)((int)time));
          if(part.getParameters() == null || part.getParameters().get("transient_lastDdlTime") == null) {
            part.putToParameters("transient_lastDdlTime", Long.toString(time));
          }
          // ignore code lines
    }
    ```
    
    This PR should be closed and we will check the reason why createTime is zero for partitions created by spark sql again.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92376/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Any update?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #80558 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/80558/testReport)** for PR 18900 at commit [`bf2a105`](https://github.com/apache/spark/commit/bf2a1052f807a7ae36004c819e66fff5c4b45820).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92222 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92222/testReport)** for PR 18900 at commit [`c843ef1`](https://github.com/apache/spark/commit/c843ef18c4073281c8e8e59abf17a7bcbc6c5560).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by cxzl25 <gi...@git.apache.org>.
Github user cxzl25 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **Modify the partition will lose createTime.**
    Reading the hive partitions ignores createTime when converting the CatalogTablePartition, it will also be lost when modifying partitions.
    Calling this method SessionCatalog#alterPartitions will be lost createTime.
    So can you reopen this pr? @debugger87 
    
    ```sql
    CREATE  TABLE `tmp_test_partition`(
      `c1` string
    )
    PARTITIONED BY (
      `d` string
    );
    ALTER TABLE `tmp_test_partition_1` ADD PARTITION (d='1');
    ALTER TABLE `tmp_test_partition_1` PARTITION (d='1') SET LOCATION 'xxx';
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r132375612
  
    --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala ---
    @@ -986,6 +986,7 @@ private[hive] object HiveClientImpl {
         tpart.setTableName(ht.getTableName)
         tpart.setValues(partValues.asJava)
         tpart.setSd(storageDesc)
    +    tpart.setCreateTime((System.currentTimeMillis() / 1000).toInt)
    --- End diff --
    
    This is to Hive, how about from Hive? 


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91512 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91512/testReport)** for PR 18900 at commit [`a00e943`](https://github.com/apache/spark/commit/a00e943a7097f386c842fd725cb1474e3a7f74c8).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/91511/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    If hive outputs createTime for partitions, we should do it too, right?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92374 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92374/testReport)** for PR 18900 at commit [`715c7cc`](https://github.com/apache/spark/commit/715c7ccee05cf85a4e6a771dabd0799a48b559a3).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r132727520
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala ---
    @@ -97,7 +97,9 @@ object CatalogStorageFormat {
     case class CatalogTablePartition(
         spec: CatalogTypes.TablePartitionSpec,
         storage: CatalogStorageFormat,
    -    parameters: Map[String, String] = Map.empty) {
    +    parameters: Map[String, String] = Map.empty,
    +    createTime: Long = System.currentTimeMillis,
    +    lastAccessTime: Long = -1) {
     
       def toLinkedHashMap: mutable.LinkedHashMap[String, String] = {
    --- End diff --
    
    You also need to add it to this map for display


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r132711854
  
    --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala ---
    @@ -986,6 +986,7 @@ private[hive] object HiveClientImpl {
         tpart.setTableName(ht.getTableName)
         tpart.setValues(partValues.asJava)
         tpart.setSd(storageDesc)
    +    tpart.setCreateTime((System.currentTimeMillis() / 1000).toInt)
    --- End diff --
    
    @gatorsmile sorry for my misunderstanding of your point. `toHivePartition` and `fromHivePartition` should be symmetric, and I change my implementation. could you please review it again?
    



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91512 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91512/testReport)** for PR 18900 at commit [`a00e943`](https://github.com/apache/spark/commit/a00e943a7097f386c842fd725cb1474e3a7f74c8).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r198399664
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala ---
    @@ -93,12 +93,16 @@ object CatalogStorageFormat {
      * @param spec partition spec values indexed by column name
      * @param storage storage format of the partition
      * @param parameters some parameters for the partition
    + * @param createTime creation time of the partition
    --- End diff --
    
    let's mention the time unit, i.e. in milliseconds.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    LGTM Thanks! Merged to master.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    ok to test


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cxzl25 OK, reopen it


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cxzl25 I run `build/mvn  -Dtest=none -DwildcardSuites=org.apache.spark.sql.streaming.FlatMapGroupsWithStateSuite  test` in my local PC, output is as follows:
    
    ```
    Run completed in 36 seconds, 46 milliseconds.
    Total number of tests run: 73
    Suites: completed 2, aborted 0
    Tests: succeeded 73, failed 0, canceled 0, ignored 0, pending 0
    All tests passed.
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92376 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92376/testReport)** for PR 18900 at commit [`18c85b6`](https://github.com/apache/spark/commit/18c85b61139e2b9d434214b9082b43a46e1c8787).
     * This patch passes all tests.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by asfgit <gi...@git.apache.org>.
Github user asfgit closed the pull request at:

    https://github.com/apache/spark/pull/18900


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by maropu <gi...@git.apache.org>.
Github user maropu commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r195898963
  
    --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala ---
    @@ -1019,6 +1021,8 @@ private[hive] object HiveClientImpl {
             compressed = apiPartition.getSd.isCompressed,
             properties = Option(apiPartition.getSd.getSerdeInfo.getParameters)
               .map(_.asScala.toMap).orNull),
    +      createTime = apiPartition.getCreateTime.toLong * 1000,
    +      lastAccessTime = apiPartition.getLastAccessTime.toLong * 1000,
    --- End diff --
    
    Can we use `DurationConversions` here?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92222 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92222/testReport)** for PR 18900 at commit [`c843ef1`](https://github.com/apache/spark/commit/c843ef18c4073281c8e8e59abf17a7bcbc6c5560).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r198401803
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala ---
    @@ -93,12 +93,16 @@ object CatalogStorageFormat {
      * @param spec partition spec values indexed by column name
      * @param storage storage format of the partition
      * @param parameters some parameters for the partition
    + * @param createTime creation time of the partition
    --- End diff --
    
    OK, it's the same as CatalogTable, in milliseconds. I fill fix this comment.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cxzl25  Any suggestions?
    
    ```
    org.apache.spark.sql.streaming.FlatMapGroupsWithStateSuite.flatMapGroupsWithState - streaming with processing time timeout
    ```


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 closed the pull request at:

    https://github.com/apache/spark/pull/18900


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r132802873
  
    --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/interface.scala ---
    @@ -97,7 +97,9 @@ object CatalogStorageFormat {
     case class CatalogTablePartition(
         spec: CatalogTypes.TablePartitionSpec,
         storage: CatalogStorageFormat,
    -    parameters: Map[String, String] = Map.empty) {
    +    parameters: Map[String, String] = Map.empty,
    +    createTime: Long = System.currentTimeMillis,
    +    lastAccessTime: Long = -1) {
     
       def toLinkedHashMap: mutable.LinkedHashMap[String, String] = {
    --- End diff --
    
    @gatorsmile  Thanks for your reminding, i will add it.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92376 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92376/testReport)** for PR 18900 at commit [`18c85b6`](https://github.com/apache/spark/commit/18c85b61139e2b9d434214b9082b43a46e1c8787).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91513 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91513/testReport)** for PR 18900 at commit [`e3a0cc4`](https://github.com/apache/spark/commit/e3a0cc43b10828b8111f7cd9523391cd3a2fdb6f).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by gatorsmile <gi...@git.apache.org>.
Github user gatorsmile commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    ok to test


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91513 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91513/testReport)** for PR 18900 at commit [`e3a0cc4`](https://github.com/apache/spark/commit/e3a0cc43b10828b8111f7cd9523391cd3a2fdb6f).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    we should also update `CatalogTablePartition.toLinkedHashMap`, following `CatalogTable.toLinkedHashMap`
    
    Otherwise LGTM


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92213 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92213/testReport)** for PR 18900 at commit [`0390e88`](https://github.com/apache/spark/commit/0390e88e7e16a5200041978073f263994decc6f6).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #80528 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/80528/testReport)** for PR 18900 at commit [`c833ce7`](https://github.com/apache/spark/commit/c833ce7aa5f2ba0b684494fd1b24b7995f1c09c9).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/91513/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/80528/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r193730957
  
    --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala ---
    @@ -1019,6 +1021,8 @@ private[hive] object HiveClientImpl {
             compressed = apiPartition.getSd.isCompressed,
             properties = Option(apiPartition.getSd.getSerdeInfo.getParameters)
               .map(_.asScala.toMap).orNull),
    +      createTime = apiPartition.getCreateTime.toLong * 1000,
    +      lastAccessTime = apiPartition.getLastAccessTime.toLong * 1000)
    --- End diff --
    
    @cxzl25 yeah, it's my mistake, i will fix it


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91511 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91511/testReport)** for PR 18900 at commit [`478e205`](https://github.com/apache/spark/commit/478e2051c775a594ad729256c3ef78cc311c992d).
     * This patch **fails Scala style tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by cxzl25 <gi...@git.apache.org>.
Github user cxzl25 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r193685282
  
    --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala ---
    @@ -1019,6 +1021,8 @@ private[hive] object HiveClientImpl {
             compressed = apiPartition.getSd.isCompressed,
             properties = Option(apiPartition.getSd.getSerdeInfo.getParameters)
               .map(_.asScala.toMap).orNull),
    +      createTime = apiPartition.getCreateTime.toLong * 1000,
    +      lastAccessTime = apiPartition.getLastAccessTime.toLong * 1000)
    --- End diff --
    
    Add a comma to the end?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #80558 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/80558/testReport)** for PR 18900 at commit [`bf2a105`](https://github.com/apache/spark/commit/bf2a1052f807a7ae36004c819e66fff5c4b45820).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    +1, similar to `CatalogTable.createTime`, we should have a `CatalogTablePartition.createTime`


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92374/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Can one of the admins verify this patch?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by cloud-fan <gi...@git.apache.org>.
Github user cloud-fan commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    can you check if hive DESC TABLE  output the create time of partitions? We can regenerate the output files for `SQLQueryTestSuite` if needed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/91512/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan Have a look at this PR again?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92374 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92374/testReport)** for PR 18900 at commit [`715c7cc`](https://github.com/apache/spark/commit/715c7ccee05cf85a4e6a771dabd0799a48b559a3).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cxzl25 this PR will break some unit tests, I will fix it.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92213/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cxzl25  @maropu any suggestions?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/80558/
    Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test FAILed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/91522/
    Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    My changes is not enough to support `createTime` in CatalogTablePartition, I will check and re-commit again.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91522 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91522/testReport)** for PR 18900 at commit [`b0846c3`](https://github.com/apache/spark/commit/b0846c39a94d729ec0324cc72b98861da7c073c7).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Can one of the admins verify this patch?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan Any suggestions?


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
GitHub user debugger87 reopened a pull request:

    https://github.com/apache/spark/pull/18900

    [SPARK-21687][SQL] Spark SQL should set createTime for Hive partition

    ## What changes were proposed in this pull request?
    
    Set createTime for every hive partition created in Spark SQL, which could be used to manage data lifecycle in Hive warehouse. We found  that almost every partition created by spark sql has not been set createTime.
    
    ```
    mysql> select * from partitions where create_time=0 limit 1\G;
    *************************** 1. row ***************************
             PART_ID: 1028584
         CREATE_TIME: 0
    LAST_ACCESS_TIME: 1502203611
           PART_NAME: date=20170130
               SD_ID: 1543605
              TBL_ID: 211605
      LINK_TARGET_ID: NULL
    1 row in set (0.27 sec)
    ```
    
    ## How was this patch tested?
     N/A

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/debugger87/spark fix/set-create-time-for-hive-partition

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/18900.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #18900
    
----
commit 71a660ac8dad869d9ba3b4e206b74f5c44660ee6
Author: debugger87 <ya...@...>
Date:   2017-08-10T04:17:00Z

    [SPARK-21687][SQL] Spark SQL should set createTime for Hive partition

commit f668ce8837ee553c61687bd03d04cddd32e5f36f
Author: debugger87 <ya...@...>
Date:   2017-08-11T07:50:26Z

    added createTime and lastAccessTime into CatalogTablePartition

commit 2fb1ddabdb2ab8f7b585ee7aea93280f96a23467
Author: debugger87 <ya...@...>
Date:   2017-08-11T07:54:26Z

    minor tweak

commit c833ce7aa5f2ba0b684494fd1b24b7995f1c09c9
Author: debugger87 <ya...@...>
Date:   2017-08-11T08:07:57Z

    fix type missmatch

commit bf2a1052f807a7ae36004c819e66fff5c4b45820
Author: debugger87 <ya...@...>
Date:   2017-08-11T23:26:29Z

    added createTime and lastAccessTime into partition map for display

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #92213 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/92213/testReport)** for PR 18900 at commit [`0390e88`](https://github.com/apache/spark/commit/0390e88e7e16a5200041978073f263994decc6f6).
     * This patch **fails Spark unit tests**.
     * This patch merges cleanly.
     * This patch adds no public classes.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #80528 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/80528/testReport)** for PR 18900 at commit [`c833ce7`](https://github.com/apache/spark/commit/c833ce7aa5f2ba0b684494fd1b24b7995f1c09c9).


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91522 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91522/testReport)** for PR 18900 at commit [`b0846c3`](https://github.com/apache/spark/commit/b0846c39a94d729ec0324cc72b98861da7c073c7).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by rxin <gi...@git.apache.org>.
Github user rxin commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    We should put this in the catalog, shouldn't we?



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Merged build finished. Test FAILed.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan I found that if we follow code in CatalogTable.toLinkedHashMap, this patch will break so many test cases in org.apache.spark.sql.SQLQueryTestSuite (compare *.out files).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark pull request #18900: [SPARK-21687][SQL] Spark SQL should set createTim...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/18900#discussion_r132377142
  
    --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/client/HiveClientImpl.scala ---
    @@ -986,6 +986,7 @@ private[hive] object HiveClientImpl {
         tpart.setTableName(ht.getTableName)
         tpart.setValues(partValues.asJava)
         tpart.setSd(storageDesc)
    +    tpart.setCreateTime((System.currentTimeMillis() / 1000).toInt)
    --- End diff --
    
    We just need to use API of hive_metastore like `get_partition` to fetch related information of Hive Partition.
    
    * Partition in hive_metastore.thrift
    
    ```
    struct Partition {
      1: list<string> values // string value is converted to appropriate partition key type
      2: string       dbName,
      3: string       tableName,
      4: i32          createTime,
      5: i32          lastAccessTime,
      6: StorageDescriptor   sd,
      7: map<string, string> parameters,
      8: optional PrincipalPrivilegeSet privileges
    }
    ```
    
    * get_partition in hive_metastore.thrit
    ```
    Partition get_partition(1:string db_name, 2:string tbl_name, 3:list<string> part_vals)
                           throws(1:MetaException o1, 2:NoSuchObjectException o2)
    ```
    We will set TTL as kv into parameters of `Table` and fetch `createTime` from `Partition` to decide if we can drop a partition in Hive.



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan totally agree with you, i will try to regenerate output files for `SQLQueryTestSuite`


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan could you please help me to review this PR?


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by SparkQA <gi...@git.apache.org>.
Github user SparkQA commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    **[Test build #91511 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/91511/testReport)** for PR 18900 at commit [`478e205`](https://github.com/apache/spark/commit/478e2051c775a594ad729256c3ef78cc311c992d).


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan Look at this PR again? I just put `createTime` into CatalogTablePartion.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastructure@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by AmplabJenkins <gi...@git.apache.org>.
Github user AmplabJenkins commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    Test PASSed.
    Refer to this link for build results (access rights to CI server needed): 
    https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/92222/
    Test PASSed.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] spark issue #18900: [SPARK-21687][SQL] Spark SQL should set createTime for H...

Posted by debugger87 <gi...@git.apache.org>.
Github user debugger87 commented on the issue:

    https://github.com/apache/spark/pull/18900
  
    @cloud-fan For Hive, `desc extended dbName.table` or `desc extended dbName.table partition (name=value)` will print `Detailed Table Information` or `Detailed Partition Information`. Of course, `createTime` will be in output console.
    
    However, it's not needed to regenerate `SQLQueryTestSuite ` because of so many changes/diff generated.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org