You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2021/02/27 13:05:00 UTC
[jira] [Updated] (ARROW-10706) [Python][Parquet] Fix a bug when
partition filters has empty result
[ https://issues.apache.org/jira/browse/ARROW-10706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joris Van den Bossche updated ARROW-10706:
------------------------------------------
Component/s: Python
> [Python][Parquet] Fix a bug when partition filters has empty result
> -------------------------------------------------------------------
>
> Key: ARROW-10706
> URL: https://issues.apache.org/jira/browse/ARROW-10706
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Reporter: Weiyang Zhao
> Assignee: Weiyang Zhao
> Priority: Major
>
> The below code will raise IndexError:
> {{dataset = pq.ParquetDataset(}}
> base_path, filesystem=fs,
> filters=[('string', '=', "notExisted")],
> use_legacy_dataset=True
> {{)}}
> when the partition 'string' does not have a matching partition value 'notExisted'.
> The correct behavior should be returning an empty dataset with the actual schema.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)