You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@doris.apache.org by "airborne12 (via GitHub)" <gi...@apache.org> on 2023/04/28 09:01:58 UTC

[GitHub] [doris] airborne12 opened a new pull request, #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

airborne12 opened a new pull request, #19207:
URL: https://github.com/apache/doris/pull/19207

   # Proposed changes
   
   This pull request improves compaction performance by enabling direct merging of inverted indices. 
   It introduces changes to the existing compaction implementation, adds inverted index compaction for inverted index column types.
   
   ## Problem summary
   
   - Adds a new configuration option inverted_index_compaction_enable (default: false) to enable or disable inverted index compaction.
   - Modifies construct_output_rowset_writer() to add inverted index columns to the context for compaction.
   - Adds a new method compact_column() for handling the actual index compaction process.
   - Implements the index compaction process in do_compaction_impl()
   - Adds two new files: inverted_index_compaction.cpp and inverted_index_compaction.h for handling index compaction-related functionality.
   
   ## Checklist(Required)
   
   * [ ] Does it affect the original behavior
   * [ ] Has unit tests been added
   * [ ] Has document been added or modified
   * [ ] Does it need to update dependencies
   * [ ] Is this PR support rollback (If NO, please explain WHY)
   
   ## Further comments
   
   If this is a relatively large or complex change, kick off the discussion at [dev@doris.apache.org](mailto:dev@doris.apache.org) by explaining why you chose the solution you did and what alternatives you considered, etc...
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] qidaye merged pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "qidaye (via GitHub)" <gi...@apache.org>.
qidaye merged PR #19207:
URL: https://github.com/apache/doris/pull/19207


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] github-actions[bot] commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1535657017

   PR approved by anyone and no changes requested.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] airborne12 commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "airborne12 (via GitHub)" <gi...@apache.org>.
airborne12 commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1527230479

   run buildall


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] airborne12 commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "airborne12 (via GitHub)" <gi...@apache.org>.
airborne12 commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1534018438

   run buildall


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] github-actions[bot] commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1527233618

   clang-tidy review says "All clean, LGTM! :+1:"


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] github-actions[bot] commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1535657004

   PR approved by at least one committer and no changes requested.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] hello-stephen commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "hello-stephen (via GitHub)" <gi...@apache.org>.
hello-stephen commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1527362154

   TeamCity pipeline, clickbench performance test result:
    the sum of best hot time: 34.67 seconds
    stream load tsv:          427 seconds loaded 74807831229 Bytes, about 167 MB/s
    stream load json:         24 seconds loaded 2358488459 Bytes, about 93 MB/s
    stream load orc:          59 seconds loaded 1101869774 Bytes, about 17 MB/s
    stream load parquet:          30 seconds loaded 861443392 Bytes, about 27 MB/s
    https://doris-community-test-1308700295.cos.ap-hongkong.myqcloud.com/tmp/20230428103848_clickbench_pr_137083.html


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [doris] github-actions[bot] commented on pull request #19207: [Improvement](inverted index) Enhance compaction performance through direct inverted index merging

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on PR #19207:
URL: https://github.com/apache/doris/pull/19207#issuecomment-1534021678

   clang-tidy review says "All clean, LGTM! :+1:"


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org