You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@pinot.apache.org by "egalpin (via GitHub)" <gi...@apache.org> on 2023/03/16 23:32:23 UTC

[GitHub] [pinot] egalpin opened a new issue, #10437: Store validDocIds with each segment when sealed

egalpin opened a new issue, #10437:
URL: https://github.com/apache/pinot/issues/10437

   It might already be the case that the set of valid doc IDs (upsert) are stored as part of the segments themselves. Having access to the bitmap of valid (or likely smaller bitmap, invalid) doc IDs would allow for compaction or other externalized reprocessing of segments where the information about which docs are valid is desirable.  Once a document becomes invalid it can never become valid again, so this should be safe to store.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [pinot] egalpin closed issue #10437: Store validDocIds with each segment when sealed

Posted by "egalpin (via GitHub)" <gi...@apache.org>.
egalpin closed issue #10437: Store validDocIds with each segment when sealed
URL: https://github.com/apache/pinot/issues/10437


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org


[GitHub] [pinot] Jackie-Jiang commented on issue #10437: Store validDocIds with each segment when sealed

Posted by "Jackie-Jiang (via GitHub)" <gi...@apache.org>.
Jackie-Jiang commented on issue #10437:
URL: https://github.com/apache/pinot/issues/10437#issuecomment-1477126223

   Please take a look at #10052 which added an API to download the validDocIds from the server


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@pinot.apache.org
For additional commands, e-mail: commits-help@pinot.apache.org