You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Pere-Lluís Huguet Cabot (Jira)" <ji...@apache.org> on 2021/01/04 21:18:00 UTC
[jira] [Commented] (ARROW-9612) [Python] Automatically back on
larger IO block size when JSON parsing fails
[ https://issues.apache.org/jira/browse/ARROW-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17258501#comment-17258501 ]
Pere-Lluís Huguet Cabot commented on ARROW-9612:
------------------------------------------------
I have added a file that prompts the same error. It is a dump of wikipedia abstracts with wikidata information.
> [Python] Automatically back on larger IO block size when JSON parsing fails
> ---------------------------------------------------------------------------
>
> Key: ARROW-9612
> URL: https://issues.apache.org/jira/browse/ARROW-9612
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Wes McKinney
> Priority: Major
> Fix For: 3.0.0
>
> Attachments: wiki_04.jsonl
>
>
> From GitHub issue
> https://github.com/apache/arrow/issues/7835
> This seems like a less than ideal failure mode, perhaps when this occurs it could automatically change to processing the file as a single block?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)