You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@druid.apache.org by GitBox <gi...@apache.org> on 2020/04/21 20:59:39 UTC

[GitHub] [druid] gianm edited a comment on issue #9736: HLLSketchMerge aggregator failing for some metrics after upgrade to v0.18

gianm edited a comment on issue #9736:
URL: https://github.com/apache/druid/issues/9736#issuecomment-617411096


   @scrawfor Perhaps you could extract and upload just the HLL column? It only contains sketches of data, so it's less sensitive than the entire segment.
   
   A good way to do it is to unzip the segment and look at the `meta.smoosh` file. It has one line per column, where each line has four parts: column name, smoosh file index (usually "0" except for large multipart segments), start byte offset within that smooth file, end byte offset. So for this column:
   
   ```
   diffUrl,0,3316799,5665555
   ```
   
   You could extract it by running:
   
   ```
   dd bs=1 if=00000.smoosh skip=3316799 count=2348756 of=diffUrl
   ```
   
   What I'm after is a binary image of a specific sketch that exhibits the problem — I think once we have that it should be easier to find and fix it.
   
   If it is possible to reproduce this on some test data that would be great too.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@druid.apache.org
For additional commands, e-mail: commits-help@druid.apache.org