You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/11/01 16:38:00 UTC

[jira] [Work logged] (BEAM-13081) Portable representation of "packed bitset indicating null fields" in beam Row format is not compatible with jvm representations

     [ https://issues.apache.org/jira/browse/BEAM-13081?focusedWorklogId=672751&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-672751 ]

ASF GitHub Bot logged work on BEAM-13081:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 01/Nov/21 16:37
            Start Date: 01/Nov/21 16:37
    Worklog Time Spent: 10m 
      Work Description: JiayangWu commented on pull request #15829:
URL: https://github.com/apache/beam/pull/15829#issuecomment-956391582


   R: @reuvenlax


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 672751)
    Time Spent: 2h 40m  (was: 2.5h)

> Portable representation of "packed bitset indicating null fields" in beam Row format is not compatible with jvm representations
> -------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: BEAM-13081
>                 URL: https://issues.apache.org/jira/browse/BEAM-13081
>             Project: Beam
>          Issue Type: Bug
>          Components: cross-language
>            Reporter: Steve Niemitz
>            Assignee: Steve Niemitz
>            Priority: P2
>          Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> The JVM RowCoder strips trailing 0s from the null-value bitmap, while both python and go expect all bits to be present in the encoded bitmap.  This causes index-out-of-range errors when trying to decode a row encoded on the JVM in other languages in some circumstances.
> For example, given a Row with 10 nullable fields, if the first 8 are null and the last two are set, the row will fail to decode in python, because the nullable bitmap will only have 1 byte, but the python coder expects 2.
> As discussed in the thread, the best solution here is probably to change the python (and go) coders to accept truncated nullable bitmaps.
>  
> More discussion here:
> [https://lists.apache.org/thread.html/r2f148e29902bda8bb0ff7106fffb8a5494295450827ad7fd17289383%40%3Cdev.beam.apache.org%3E]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)