You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/06/27 23:48:00 UTC

[jira] [Updated] (ARROW-7273) [Python] Non-nullable null field is allowed / crashes when writing to parquet

     [ https://issues.apache.org/jira/browse/ARROW-7273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated ARROW-7273:
----------------------------------
    Labels: parquet pull-request-available  (was: parquet)

> [Python] Non-nullable null field is allowed / crashes when writing to parquet
> -----------------------------------------------------------------------------
>
>                 Key: ARROW-7273
>                 URL: https://issues.apache.org/jira/browse/ARROW-7273
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, Python
>            Reporter: Joris Van den Bossche
>            Assignee: Wes McKinney
>            Priority: Major
>              Labels: parquet, pull-request-available
>             Fix For: 1.0.0
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> It seems to be possible to create a "non-nullable null field". While this does not make any sense (so already a reason to disallow this I think), this can also lead to crashed in further operations, such as writing to parquet:
> {code}
> In [18]: table = pa.table([pa.array([None, None], pa.null())], schema=pa.schema([pa.field('a', pa.null(), nullable=False)]))
> In [19]: table
> Out[19]:
> pyarrow.Table
> a: null not null
> In [20]: pq.write_table(table, "test_null.parquet")
> WARNING: Logging before InitGoogleLogging() is written to STDERR
> F1128 14:08:30.267439 27560 column_writer.cc:837]  Check failed: (nullptr) != (values)
> *** Check failure stack trace: ***
> Aborted (core dumped)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)