You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Weston Pace (Jira)" <ji...@apache.org> on 2021/09/22 18:44:00 UTC

[jira] [Commented] (ARROW-14066) [C++] Possible issue with cpp StreamWriter

    [ https://issues.apache.org/jira/browse/ARROW-14066?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17418777#comment-17418777 ] 

Weston Pace commented on ARROW-14066:
-------------------------------------

I think the footer should be written out when the parquet::ParquetFileWriter (owned by parquet::StreamWriter) is closed.  I don't see an explicit parquet::StreamWriter::Close but the destructor of parquet::ParquetFileWriter will call close.

So, did you make sure to delete your parquet::StreamWriter instance before you tried to read the file?

> [C++] Possible issue with cpp StreamWriter
> ------------------------------------------
>
>                 Key: ARROW-14066
>                 URL: https://issues.apache.org/jira/browse/ARROW-14066
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++
>            Reporter: Shashank Yadav
>            Priority: Major
>         Attachments: temp.cpp
>
>
> I am using the cpp streamwriter example given here: [https://github.com/apache/arrow/blob/master/cpp/examples/parquet/parquet_stream_api/stream_reader_writer.cc]
>  
> When I try to read the parquet file back in python it says: Could not open parquet input source '<Buffer>': Invalid: Parquet magic bytes not found in footer. Either the file is corrupted or this is not a parquet file



--
This message was sent by Atlassian Jira
(v8.3.4#803005)