You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@parquet.apache.org by "Wes McKinney (JIRA)" <ji...@apache.org> on 2019/05/08 15:52:00 UTC
[jira] [Resolved] (PARQUET-1571) [C++] Can't read data from parquet
file in C++ library
[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney resolved PARQUET-1571.
-----------------------------------
Resolution: Fixed
Fix Version/s: cpp-1.6.0
Issue resolved by pull request 4275
[https://github.com/apache/arrow/pull/4275]
> [C++] Can't read data from parquet file in C++ library
> ------------------------------------------------------
>
> Key: PARQUET-1571
> URL: https://issues.apache.org/jira/browse/PARQUET-1571
> Project: Parquet
> Issue Type: Bug
> Components: parquet-cpp
> Reporter: worker24h
> Priority: Critical
> Labels: pull-request-available
> Fix For: cpp-1.6.0
>
> Time Spent: 1h 40m
> Remaining Estimate: 0h
>
> Specified the second param *parquet::ReaderProperties* When I used parquet::ParquetFileReader::Open, it can't work.
> The following code:
> {code:java}
> parquet::ReaderProperties _properties;
> _properties = parquet::ReaderProperties();
> _properties.enable_buffered_stream(); // used buffer stream. Don't set buffer-size
> parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties);
> ...
> int32_t value;
> parquet::Int32Reader* int32_reader =
> static_cast<parquet::Int32Reader*>(column_reader.get());
> int32_reader->Skip(_current_line_of_group);// skip lines of processed.
> rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, &values_read);
> {code}
> The interface *Skip* throw exception:
> {color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid data Deserializing page header failed.}}{color}
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)