You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@doris.apache.org by GitBox <gi...@apache.org> on 2021/01/26 05:40:23 UTC

[GitHub] [incubator-doris] stdpain opened a new issue #5300: [optimization] Optimize hash table build

stdpain opened a new issue #5300:
URL: https://github.com/apache/incubator-doris/issues/5300


   **Describe**
   In the original logic, Hashtable uses a vector-like structure to store actual data. When constructing the hash table, there may be about a quarter of the time copying data continuously. Especially in the case of building more columns, it will take more time. So I changed this to a raw pointer to avoid extra copy overhead. There will be good results in the hash table construction phase
   
   Here  is my test case, LINE_ORDER and LINE_ORDER_V2 is from SSB datasets:
   
   ```
   SELECT count(*) FROM LINE_ORDER t1 join LINE_ORDER_V2 t2 WHERE t1.LO_ORDERKEY=t2.LO_ORDERKEY;
   ```
   
   |Type| Right Table Rows | Build Time | Probe Time | Time Cost (s) |
   |--| ------------ | ---------- | ---------- | ---- |
   | After |6001215          | 658.288ms  | 1s451ms    | 4.07 |
   | Before |6001215          | 1s428ms    | 1s512ms    | 4.69 |
   
   
   
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [incubator-doris] wangbo closed issue #5300: [optimization] Optimize hash table build

Posted by GitBox <gi...@apache.org>.
wangbo closed issue #5300:
URL: https://github.com/apache/incubator-doris/issues/5300


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org


[GitHub] [incubator-doris] wangbo closed issue #5300: [optimization] Optimize hash table build

Posted by GitBox <gi...@apache.org>.
wangbo closed issue #5300:
URL: https://github.com/apache/incubator-doris/issues/5300


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@doris.apache.org
For additional commands, e-mail: commits-help@doris.apache.org