You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/03/07 17:08:45 UTC

[GitHub] [hudi] wangxianghu opened a new pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop c…

wangxianghu opened a new pull request #4972:
URL: https://github.com/apache/hudi/pull/4972


   …olumns from schema
   
   ## *Tips*
   - *Thank you very much for contributing to Apache Hudi.*
   - *Please review https://hudi.apache.org/contribute/how-to-contribute before opening a pull request.*
   
   ## What is the purpose of the pull request
   
   *Introduce DropColumnSchemaPostProcessor to support drop columns from schema*
   
   ## Verify this pull request
   
   This pull request is already covered by existing tests, such as *(please describe tests)*.
   org.apache.hudi.utilities.TestSchemaPostProcessor#testDeleteColumn
   org.apache.hudi.utilities.TestSchemaPostProcessor#testDeleteColumnThrows
   
   ## Committer checklist
   
    - [ ] Has a corresponding JIRA in PR title & commit
    
    - [ ] Commit message is descriptive of the change
    
    - [ ] CI is green
   
    - [ ] Necessary doc changes done or have another open PR
          
    - [ ] For large changes, please consider breaking it into sub-tasks under an umbrella JIRA.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061536724


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063811958


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063720103


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1064184189


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063718670


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   * 6394a89d2faac0ccbb232557699df71b0414d088 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063811958


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060925162


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061419844


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060985033


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "CANCELED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060985033


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "CANCELED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061418575


   @hudi-bot run azure


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1062706963


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu merged pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu merged pull request #4972:
URL: https://github.com/apache/hudi/pull/4972


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop c…

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060925162


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1062635880


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063948727


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] pratyakshsharma commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
pratyakshsharma commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063741976


   Thank you for another useful processor @wangxianghu . Will merge once the CI passes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] pratyakshsharma commented on a change in pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
pratyakshsharma commented on a change in pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#discussion_r823044722



##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    List<String> columnsToDelete = new ArrayList<>(Arrays.asList(columnToDeleteStr.split(",")));

Review comment:
       Better to have it as a HashSet? That would make the .contains() check O(1).




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on a change in pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on a change in pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#discussion_r823393856



##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    List<String> columnsToDelete = new ArrayList<>(Arrays.asList(columnToDeleteStr.split(",")));
+
+    List<Schema.Field> sourceFields = schema.getFields();
+    List<Schema.Field> targetFields = new LinkedList<>();
+
+    for (Schema.Field sourceField : sourceFields) {
+      if (!columnsToDelete.contains(sourceField.name())) {
+        targetFields.add(new Schema.Field(sourceField.name(), sourceField.schema(), sourceField.doc(), sourceField.defaultVal()));
+      } else {
+        columnsToDelete.remove(sourceField.name());

Review comment:
       > is it needed?
   
   not very necessary :)
   removed




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1064202092


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6798",
       "triggerID" : "1064201136",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6798) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] pratyakshsharma commented on a change in pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
pratyakshsharma commented on a change in pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#discussion_r823045773



##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    List<String> columnsToDelete = new ArrayList<>(Arrays.asList(columnToDeleteStr.split(",")));
+
+    List<Schema.Field> sourceFields = schema.getFields();
+    List<Schema.Field> targetFields = new LinkedList<>();
+
+    for (Schema.Field sourceField : sourceFields) {
+      if (!columnsToDelete.contains(sourceField.name())) {
+        targetFields.add(new Schema.Field(sourceField.name(), sourceField.schema(), sourceField.doc(), sourceField.defaultVal()));
+      } else {
+        columnsToDelete.remove(sourceField.name());

Review comment:
       is it needed?




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063720103


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1064202092


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6798",
       "triggerID" : "1064201136",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6798) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060934081


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "CANCELED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   * 750e07438a19600c55d37e7107f88ba04416e218 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1064201136


   @hudi-bot run azure


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on a change in pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on a change in pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#discussion_r823393250



##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    List<String> columnsToDelete = new ArrayList<>(Arrays.asList(columnToDeleteStr.split(",")));
+
+    List<Schema.Field> sourceFields = schema.getFields();
+    List<Schema.Field> targetFields = new LinkedList<>();
+
+    for (Schema.Field sourceField : sourceFields) {
+      if (!columnsToDelete.contains(sourceField.name())) {

Review comment:
       > Reference
   
   Addressed

##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    List<String> columnsToDelete = new ArrayList<>(Arrays.asList(columnToDeleteStr.split(",")));

Review comment:
       > Better to have it as a HashSet? That would make the .contains() check O(1).
   
   Done




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060931186


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   * 750e07438a19600c55d37e7107f88ba04416e218 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060934081


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "CANCELED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [CANCELED](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   * 750e07438a19600c55d37e7107f88ba04416e218 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] pratyakshsharma commented on a change in pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
pratyakshsharma commented on a change in pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#discussion_r830486075



##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+import java.util.Locale;
+import java.util.Set;
+import java.util.stream.Collectors;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    // convert field to lowerCase for compare purpose
+    Set<String> columnsToDelete = Arrays.stream(columnToDeleteStr.split(","))
+        .map(filed -> filed.toLowerCase(Locale.ROOT))
+        .collect(Collectors.toSet());
+
+    List<Schema.Field> sourceFields = schema.getFields();
+    List<Schema.Field> targetFields = new LinkedList<>();
+
+    for (Schema.Field sourceField : sourceFields) {
+      if (!columnsToDelete.contains(sourceField.name().toLowerCase(Locale.ROOT))) {

Review comment:
       Avro schema fields are actually case sensitive. I was reading about this today. But in practical world, any table should not have column names which differ only in case sensitivity. So this conversion to lower case should not break anything. 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061419844


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop c…

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060922057


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop c…

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060922057


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063942716


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063948727


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063824161


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063824161


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063942716


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1064308177


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6798",
       "triggerID" : "1064201136",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6798) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061536724


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061443825


   hi @XuQianJin-Stars, could you please help review this as well ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1062634216


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   * 03d756b4173274233a6c3547a0bcd88d94df87bc UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1062634216


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   * 03d756b4173274233a6c3547a0bcd88d94df87bc UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063821502


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   * 5542a4582e31032113a7b9083f92f913701e6f28 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063947769


   @hudi-bot run azure


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063821502


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 6394a89d2faac0ccbb232557699df71b0414d088 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774) 
   * 5542a4582e31032113a7b9083f92f913701e6f28 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1060931186


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 9245dbb4ad086446c396af680e1f70a5a00abab5 Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652) 
   * 750e07438a19600c55d37e7107f88ba04416e218 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061067561


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1064184189


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6774",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780",
       "triggerID" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "triggerType" : "PUSH"
     }, {
       "hash" : "5542a4582e31032113a7b9083f92f913701e6f28",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793",
       "triggerID" : "1063947769",
       "triggerType" : "MANUAL"
     } ]
   }-->
   ## CI report:
   
   * 5542a4582e31032113a7b9083f92f913701e6f28 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6780) Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6793) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] pratyakshsharma commented on a change in pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
pratyakshsharma commented on a change in pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#discussion_r823043958



##########
File path: hudi-utilities/src/main/java/org/apache/hudi/utilities/schema/DropColumnSchemaPostProcessor.java
##########
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ *      http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.hudi.utilities.schema;
+
+import org.apache.hudi.common.config.TypedProperties;
+import org.apache.hudi.common.util.StringUtils;
+import org.apache.hudi.utilities.exception.HoodieSchemaPostProcessException;
+
+import org.apache.avro.Schema;
+import org.apache.log4j.LogManager;
+import org.apache.log4j.Logger;
+import org.apache.spark.api.java.JavaSparkContext;
+
+import java.util.ArrayList;
+import java.util.Arrays;
+import java.util.LinkedList;
+import java.util.List;
+
+/**
+ * A {@link SchemaPostProcessor} that support to delete column(s) from given schema.
+ * <p>
+ * Multiple columns are separated by commas.
+ * For example:
+ * <p>
+ * properties.put("hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns", "column1,column2").
+ */
+public class DropColumnSchemaPostProcessor extends SchemaPostProcessor {
+
+  private static final Logger LOG = LogManager.getLogger(DropColumnSchemaPostProcessor.class);
+
+  public DropColumnSchemaPostProcessor(TypedProperties props, JavaSparkContext jssc) {
+    super(props, jssc);
+  }
+
+  public static class Config {
+    public static final String DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP =
+        "hoodie.deltastreamer.schemaprovider.schema_post_processor.delete.columns";
+  }
+
+  @Override
+  public Schema processSchema(Schema schema) {
+
+    String columnToDeleteStr = this.config.getString(Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP);
+
+    if (StringUtils.isNullOrEmpty(columnToDeleteStr)) {
+      LOG.warn(String.format("Param %s is null or empty, return original schema", Config.DELETE_COLUMN_POST_PROCESSOR_COLUMN_PROP));
+    }
+
+    List<String> columnsToDelete = new ArrayList<>(Arrays.asList(columnToDeleteStr.split(",")));
+
+    List<Schema.Field> sourceFields = schema.getFields();
+    List<Schema.Field> targetFields = new LinkedList<>();
+
+    for (Schema.Field sourceField : sourceFields) {
+      if (!columnsToDelete.contains(sourceField.name())) {

Review comment:
       Small doubt, is it possible to have fields whose names differ only in case sensitivity in avro schemas? 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063718670


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     }, {
       "hash" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "status" : "UNKNOWN",
       "url" : "TBD",
       "triggerID" : "6394a89d2faac0ccbb232557699df71b0414d088",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   * 6394a89d2faac0ccbb232557699df71b0414d088 UNKNOWN
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1062706963


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] wangxianghu commented on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
wangxianghu commented on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1063719238


   > Changes look good to me overall. Minor comments. Please address them.
   
   @pratyakshsharma thanks for you review, I have addressed all your concern, please take another look when free


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1061067561


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] hudi-bot removed a comment on pull request #4972: [HUDI-3522] Introduce DropColumnSchemaPostProcessor to support drop columns from schema

Posted by GitBox <gi...@apache.org>.
hudi-bot removed a comment on pull request #4972:
URL: https://github.com/apache/hudi/pull/4972#issuecomment-1062635880


   <!--
   Meta data
   {
     "version" : 1,
     "metaDataEntries" : [ {
       "hash" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "status" : "DELETED",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6652",
       "triggerID" : "9245dbb4ad086446c396af680e1f70a5a00abab5",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "FAILURE",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656",
       "triggerID" : "750e07438a19600c55d37e7107f88ba04416e218",
       "triggerType" : "PUSH"
     }, {
       "hash" : "750e07438a19600c55d37e7107f88ba04416e218",
       "status" : "SUCCESS",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686",
       "triggerID" : "1061418575",
       "triggerType" : "MANUAL"
     }, {
       "hash" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "status" : "PENDING",
       "url" : "https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729",
       "triggerID" : "03d756b4173274233a6c3547a0bcd88d94df87bc",
       "triggerType" : "PUSH"
     } ]
   }-->
   ## CI report:
   
   * 750e07438a19600c55d37e7107f88ba04416e218 Azure: [FAILURE](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6656) Azure: [SUCCESS](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6686) 
   * 03d756b4173274233a6c3547a0bcd88d94df87bc Azure: [PENDING](https://dev.azure.com/apache-hudi-ci-org/785b6ef4-2f42-4a89-8f0e-5f0d7039a0cc/_build/results?buildId=6729) 
   
   <details>
   <summary>Bot commands</summary>
     @hudi-bot supports the following commands:
   
    - `@hudi-bot run azure` re-run the last Azure build
   </details>


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org