You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pinot.apache.org by GitBox <gi...@apache.org> on 2020/09/03 23:53:42 UTC

[GitHub] [incubator-pinot] Jackie-Jiang opened a new issue #5925: Optimize query filtering on the result of another query

Jackie-Jiang opened a new issue #5925:
URL: https://github.com/apache/incubator-pinot/issues/5925


   Example query:
   `SELECT COUNT(*) FROM table1 WHERE id1 IN (SELECT id2 FROM table2 WHERE ...) ...`
   
   A naive solution would be sending the sub-query first, gather the result, then use the result to construct the main query. It works for cases where the result for the sub-query is small (< 100), but when the result size becomes big (> 1000), the cost of ser/de, query compilation and query processing will be too high.
   
   In order to optimize this query, we need to reduce the number of ids to process. We can rely on the partitioning to achieve that. When all the segments for a partition is on a single server, we can solve the query for the partition on the server side without sending and merging the result of the sub-query on the broker.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [incubator-pinot] Jackie-Jiang commented on issue #5925: Optimize query filtering on the result of another query

Posted by GitBox <gi...@apache.org>.
Jackie-Jiang commented on issue #5925:
URL: https://github.com/apache/incubator-pinot/issues/5925#issuecomment-683966432


   Support `IN_ID_SET` predicate: #5952 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [incubator-pinot] Jackie-Jiang commented on issue #5925: Optimize query filtering on the result of another query

Posted by GitBox <gi...@apache.org>.
Jackie-Jiang commented on issue #5925:
URL: https://github.com/apache/incubator-pinot/issues/5925#issuecomment-680622115


   Add `IdSet` and `IdSetAggregationFunction`: #5926 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [incubator-pinot] Jackie-Jiang commented on issue #5925: Optimize query filtering on the result of another query

Posted by GitBox <gi...@apache.org>.
Jackie-Jiang commented on issue #5925:
URL: https://github.com/apache/incubator-pinot/issues/5925#issuecomment-686821662


   Alternative of #5952, model `IN_ID_SET` as a `TransformFunction`: #5973 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [incubator-pinot] Jackie-Jiang closed issue #5925: Optimize query filtering on the result of another query

Posted by GitBox <gi...@apache.org>.
Jackie-Jiang closed issue #5925:
URL: https://github.com/apache/incubator-pinot/issues/5925


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org