You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by "bkietz (via GitHub)" <gi...@apache.org> on 2023/04/18 15:32:12 UTC

[GitHub] [arrow] bkietz commented on issue #22806: [C++] vendor a half precision floating point library

bkietz commented on issue #22806:
URL: https://github.com/apache/arrow/issues/22806#issuecomment-1513370288

   I think if available software implementations are unsatisfactory, it'd be preferable to write a very minimal version of our own and handle any operations by first converting to 32 bit float. This should be sufficient since we'd be able to use a native type most of the time. In that case, we'd only need to maintain:
   - The cmake logic/flags necessary to enable native IEEE 754-2008 half precision floats when available
   - The conversions `soft 16 bit float <=> 32 bit float`
     - Including bitwise equivalence to native 16 bit floats when available


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org