You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Kyle Barron (Jira)" <ji...@apache.org> on 2022/03/04 05:25:00 UTC

[jira] [Commented] (ARROW-2786) [JS] Read Parquet files in JavaScript

    [ https://issues.apache.org/jira/browse/ARROW-2786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17501161#comment-17501161 ] 

Kyle Barron commented on ARROW-2786:
------------------------------------

I have a [working prototype here|https://github.com/kylebarron/parquet-wasm] of the Rust `parquet` and `arrow` crates compiled to WASM. As [described on the Rust repo|https://github.com/apache/arrow-rs/issues/180#issuecomment-1058827130], I've gotten Snappy, Gzip, Brotli, and ZSTD codecs working in WASM, and it looks like LZ4 should be attainable by switching out the dependency.

My idea is that the bindings can be pretty small by just parsing the Parquet input and creating Arrow IPC data to send back to the JS side. Then the Arrow JS bindings would parse the IPC data and Arrow JS could be used from then on.

Unfortunately, there seem to be issues currently with Arrow JS' IPC implementation (the reason why I was looking through these issues 😄), so Arrow JS often can't read the IPC data returned from Rust (though Pyarrow can read the files, so they seem correctly-formatted).

> [JS] Read Parquet files in JavaScript
> -------------------------------------
>
>                 Key: ARROW-2786
>                 URL: https://issues.apache.org/jira/browse/ARROW-2786
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: JavaScript
>            Reporter: Wes McKinney
>            Priority: Major
>              Labels: parquet
>
> See question in https://github.com/apache/arrow/issues/2209



--
This message was sent by Atlassian Jira
(v8.20.1#820001)