You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Wes McKinney (Jira)" <ji...@apache.org> on 2020/06/26 16:00:00 UTC
[jira] [Commented] (ARROW-7273) [Python] Non-nullable null field is
allowed / crashes when writing to parquet
[ https://issues.apache.org/jira/browse/ARROW-7273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17146433#comment-17146433 ]
Wes McKinney commented on ARROW-7273:
-------------------------------------
I assigned this so it doesn't get evicted from the backlog. If someone else can take a look I would be appreciative
> [Python] Non-nullable null field is allowed / crashes when writing to parquet
> -----------------------------------------------------------------------------
>
> Key: ARROW-7273
> URL: https://issues.apache.org/jira/browse/ARROW-7273
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Python
> Reporter: Joris Van den Bossche
> Assignee: Wes McKinney
> Priority: Major
> Labels: parquet
> Fix For: 1.0.0
>
>
> It seems to be possible to create a "non-nullable null field". While this does not make any sense (so already a reason to disallow this I think), this can also lead to crashed in further operations, such as writing to parquet:
> {code}
> In [18]: table = pa.table([pa.array([None, None], pa.null())], schema=pa.schema([pa.field('a', pa.null(), nullable=False)]))
> In [19]: table
> Out[19]:
> pyarrow.Table
> a: null not null
> In [20]: pq.write_table(table, "test_null.parquet")
> WARNING: Logging before InitGoogleLogging() is written to STDERR
> F1128 14:08:30.267439 27560 column_writer.cc:837] Check failed: (nullptr) != (values)
> *** Check failure stack trace: ***
> Aborted (core dumped)
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)