You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "micah-white (via GitHub)" <gi...@apache.org> on 2023/05/09 22:14:25 UTC

[GitHub] [arrow] micah-white opened a new issue, #35521: Hashing array scalar with null bitmap and non-null 0s bitmap produces different hashes.

micah-white opened a new issue, #35521:
URL: https://github.com/apache/arrow/issues/35521

   ### Describe the bug, including details regarding any error messages, version, and platform.
   
   Null bitmaps in arrays are generally set to `nullptr` when the null count for the array is 0. However, there are cases where the null bitmap is set to a 0s buffer instead. Semantically, a 0s bitmap and a `nullptr` bitmap are the same thing. However, the hashing algorithm for array scalars will hash the null bitmap if it exists, and the resulting hash is not 0. This behavior can lead to two scalars with the same semantic value, but different internal values of the null bitmap, to hash to different values.
   
   ### Component(s)
   
   C++


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] pitrou closed issue #35521: [C++] Hashing array scalar with null bitmap and non-null 0s bitmap produces different hashes.

Posted by "pitrou (via GitHub)" <gi...@apache.org>.
pitrou closed issue #35521: [C++] Hashing array scalar with null bitmap and non-null 0s bitmap produces different hashes.
URL: https://github.com/apache/arrow/issues/35521


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] jorisvandenbossche commented on issue #35521: [C++] Hashing array scalar with null bitmap and non-null 0s bitmap produces different hashes.

Posted by "jorisvandenbossche (via GitHub)" <gi...@apache.org>.
jorisvandenbossche commented on issue #35521:
URL: https://github.com/apache/arrow/issues/35521#issuecomment-1541447492

   Related issue: https://github.com/apache/arrow/issues/35360 (not strictly related to the null bitmap, but just in general about that there are quite some things lacking about the ScalarHash implementation)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org