You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Andrew Lamb (Jira)" <ji...@apache.org> on 2022/10/01 11:29:00 UTC

[jira] [Updated] (ARROW-17907) [Website] Blog about Arrow <--> Parquet translation and structured representation

     [ https://issues.apache.org/jira/browse/ARROW-17907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrew Lamb updated ARROW-17907:
--------------------------------
    Description: 
@tustvold has spent a significant amount of time fixing the Rust implementation of the parquet <–> arrow conversion logic for all the corner cases of nulls, etc. 

 

During that process, he observed there was a relative lack of information on the topic to be found, so we would like to write some blog posts to remedy that and explain the format and parquet

 

The basic outline is:

Part 1: Intro / Encoding Primitive Arrays in Arrow and Parquet
Part 2: Encoding Structs and Lists  in Arrow and Parquet
Part 3: Encoding Arbitrary Structs of Lists, Lists of Structs in Arrow and Parquet !https://a.slack-edge.com/production-standard-emoji-assets/14.0/apple-medium/1f92f.png!

  was:
@tustvold has spent a significant amount of time fixing the Rust implementation of the parquet <–> arrow conversion logic for all the corner cases of nulls, etc. 

 

During that process, he observed there was a relative lack of information on the topic to be found, so we would like to write some blog posts to remedy that and explain the format and parquet


> [Website] Blog about Arrow <--> Parquet translation and structured representation 
> ----------------------------------------------------------------------------------
>
>                 Key: ARROW-17907
>                 URL: https://issues.apache.org/jira/browse/ARROW-17907
>             Project: Apache Arrow
>          Issue Type: Task
>            Reporter: Andrew Lamb
>            Assignee: Andrew Lamb
>            Priority: Major
>
> @tustvold has spent a significant amount of time fixing the Rust implementation of the parquet <–> arrow conversion logic for all the corner cases of nulls, etc. 
>  
> During that process, he observed there was a relative lack of information on the topic to be found, so we would like to write some blog posts to remedy that and explain the format and parquet
>  
> The basic outline is:
> Part 1: Intro / Encoding Primitive Arrays in Arrow and Parquet
> Part 2: Encoding Structs and Lists  in Arrow and Parquet
> Part 3: Encoding Arbitrary Structs of Lists, Lists of Structs in Arrow and Parquet !https://a.slack-edge.com/production-standard-emoji-assets/14.0/apple-medium/1f92f.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)