You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Todd Farmer (Jira)" <ji...@apache.org> on 2022/07/12 14:05:03 UTC

[jira] [Assigned] (ARROW-12629) [C++] Configurable read-ahead in CSV and JSON readers

     [ https://issues.apache.org/jira/browse/ARROW-12629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Todd Farmer reassigned ARROW-12629:
-----------------------------------

    Assignee:     (was: Supun Kamburugamuva)

This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon.

> [C++] Configurable read-ahead in CSV and JSON readers
> -----------------------------------------------------
>
>                 Key: ARROW-12629
>                 URL: https://issues.apache.org/jira/browse/ARROW-12629
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Andre Kohn
>            Priority: Major
>              Labels: good-first-issue, pull-request-available
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> We are compiling Arrow C++ to WebAssembly and ran into the following issue with the CSV reader:
> Browsers became very picky about the use of SharedArrayBuffers after the events around Spectre and Meltdown.
> As a result, you have to compile Arrow to WebAssembly without threads if you don't want to run your website with very strict cross-origin isolation.
> Unfortunately, the CSV reader seems to always spawn a thread for the read-ahead in both, the SerialStreamingReader and the SerialTableReader independent of whether use_threads is set.
> Right now, this effectively means that you cannot use the CSV (and JSON) readers in threadless WebAssembly builds.
>  
> [https://github.com/apache/arrow/blob/4363fefe46dc357a9013f0f4bcdc235e1e2e8124/cpp/src/arrow/csv/reader.cc#L839]
> [https://github.com/apache/arrow/blob/4363fefe46dc357a9013f0f4bcdc235e1e2e8124/cpp/src/arrow/csv/reader.cc#L913]
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)