You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Emiel Botman (Jira)" <ji...@apache.org> on 2021/03/17 19:41:00 UTC

[jira] [Created] (ARROW-12007) Loading parquet file returns "Invalid UTF8 payload" error

Emiel Botman created ARROW-12007:
------------------------------------

             Summary: Loading parquet file returns "Invalid UTF8 payload" error
                 Key: ARROW-12007
                 URL: https://issues.apache.org/jira/browse/ARROW-12007
             Project: Apache Arrow
          Issue Type: Bug
          Components: Format
    Affects Versions: 3.0.0
            Reporter: Emiel Botman


While loading a specific parquet file (arrow::read_parquet(file = file)), the following error is returned:

Error in parquet___arrow___FileReader__ReadTable1(self) : 
  Invalid: Invalid UTF8 payload

I managed to load several other parquet files, it is just this specific file due to which I presume it may be due to some syntax used in this file.  As there any known bug in terms of handling the UTF8 encoding?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)