You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Sifang Li (Jira)" <ji...@apache.org> on 2022/03/01 18:57:00 UTC

[jira] [Commented] (ARROW-15790) [C++] field's metadata is not written to Parquet file

    [ https://issues.apache.org/jira/browse/ARROW-15790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17499692#comment-17499692 ] 

Sifang Li commented on ARROW-15790:
-----------------------------------

yes - that worked for me - it would be nice if they are stored automatically because I cannot imagine it would take up much space or why people would want that info dropped in any scenarios.

> [C++] field's metadata is not written to Parquet file
> -----------------------------------------------------
>
>                 Key: ARROW-15790
>                 URL: https://issues.apache.org/jira/browse/ARROW-15790
>             Project: Apache Arrow
>          Issue Type: Bug
>         Environment: Ubuntu
>            Reporter: Sifang Li
>            Priority: Blocker
>
> I used this code to test the metadata write into file and read back behavior of parquet  file:
> [https://gist.github.com/dantrim/33f9f14d0b2d3ec45c022aa05f7a45ee]
>  
> The generated file does not have metadata when I read the file in using code below and print it out: 
>  
> {quote}std::shared_ptr<arrow::io::ReadableFile> infile;
> PARQUET_ASSIGN_OR_THROW(infile,
> arrow::io::ReadableFile::Open("./test.parquet", arrow::default_memory_pool()));
> std::unique_ptr<parquet::arrow::FileReader> reader;
> PARQUET_THROW_NOT_OK(
> parquet::arrow::OpenFile(infile, arrow::default_memory_pool(), &reader));
> std::shared_ptr<arrow::Table> table;
> PARQUET_THROW_NOT_OK(reader->ReadTable(&table));
> EXPECT_EQ(frameCount, table->num_rows());
> std::cout<<"==="<<table->schema()->ToString(true) <<std::endl; /// no meta shown{quote}
> Here is the version info:
> libparquet-dev/focal,now 7.0.0-1 amd64 [installed]
> libparquet-glib-dev/focal,now 7.0.0-1 amd64 [installed]
> libparquet-glib700/focal,now 7.0.0-1 amd64 [installed,automatic]
> libparquet700/focal,now 7.0.0-1 amd64 [installed,automatic]



--
This message was sent by Atlassian Jira
(v8.20.1#820001)