You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pinot.apache.org by GitBox <gi...@apache.org> on 2022/10/17 21:55:30 UTC

[GitHub] [pinot] Jackie-Jiang commented on issue #7978: Support Structured, Repeated Records

Jackie-Jiang commented on issue #7978:
URL: https://github.com/apache/pinot/issues/7978#issuecomment-1281543027

   It is very smart. There are some extra cares need to be taken:
   - There should be only one main record for each id, or it might cross match other records, and get unexpected result
   - To resolve `IN_PARTITIONED_SUBQUERY` (`IN_ID_SET` after resolving `IdSet`), we need to scan all the ids because inverted index won't be applied for `IN_ID_SET`. When there are very few matching ids, this could be slower than sending 2 queries, where the second query is constructed using the ids from first query in an `IN` clause.
   
   Ideally we can build this natively to handle these inefficiencies. The subquery can happen at segment level instead of partition level (`IN_PARTITIONeD_SUBQUERY`) or table level (`IN_SUBQUERY`)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org