You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2019/09/17 21:00:00 UTC

[jira] [Commented] (ARROW-3263) [R] Use R sentinel values for missingness in addition to bitmask

    [ https://issues.apache.org/jira/browse/ARROW-3263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16931813#comment-16931813 ] 

Wes McKinney commented on ARROW-3263:
-------------------------------------

[~romainfrancois] do you know if it's possible to achieve this in R?

> [R] Use R sentinel values for missingness in addition to bitmask
> ----------------------------------------------------------------
>
>                 Key: ARROW-3263
>                 URL: https://issues.apache.org/jira/browse/ARROW-3263
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Format, R
>            Reporter: Gabriel Becker
>            Priority: Major
>
> R uses sentinal values to indicate missingness within Atomic vectors (read arrays in Arrow parlance, AFAIK). 
> Currently according to [~wesmckinn], the current value in the array in memory is undefined if the bitmap indicating missingness is set to 1. 
> This will force R to copy and modify data whenever adopting Arrow data which has missingness present as a native vector.
> If the value were written to the relevant sentinal values (INT_MIN for 32 bit integers, and NaN with payload 1954 for double precision floats) _in addition to_ the bit mask, then R would be able to use Arrow as intended while not breaking any other systems.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)