You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Andrew Lamb (Jira)" <ji...@apache.org> on 2022/10/01 11:30:00 UTC

[jira] [Updated] (ARROW-17907) [Website] Blog about Arrow <--> Parquet translation and nesting

     [ https://issues.apache.org/jira/browse/ARROW-17907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Andrew Lamb updated ARROW-17907:
--------------------------------
    Summary: [Website] Blog about Arrow <--> Parquet translation and nesting  (was: [Website] Blog about Arrow <--> Parquet translation and structured representation )

> [Website] Blog about Arrow <--> Parquet translation and nesting
> ---------------------------------------------------------------
>
>                 Key: ARROW-17907
>                 URL: https://issues.apache.org/jira/browse/ARROW-17907
>             Project: Apache Arrow
>          Issue Type: Task
>            Reporter: Andrew Lamb
>            Assignee: Andrew Lamb
>            Priority: Major
>
> @tustvold has spent a significant amount of time fixing the Rust implementation of the parquet <–> arrow conversion logic for all the corner cases of nulls, etc. 
>  
> During that process, he observed there was a relative lack of information on the topic to be found, so we would like to write some blog posts to remedy that and explain the format and parquet
>  
> The basic outline is:
> Part 1: Intro / Encoding Primitive Arrays in Arrow and Parquet
> Part 2: Encoding Structs and Lists  in Arrow and Parquet
> Part 3: Encoding Arbitrary Structs of Lists, Lists of Structs in Arrow and Parquet !https://a.slack-edge.com/production-standard-emoji-assets/14.0/apple-medium/1f92f.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)