You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/12/01 14:04:00 UTC

[jira] [Updated] (ARROW-14940) [C++] Speed up CSV parser with long CSV cells

     [ https://issues.apache.org/jira/browse/ARROW-14940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

ASF GitHub Bot updated ARROW-14940:
-----------------------------------
    Labels: pull-request-available  (was: )

> [C++] Speed up CSV parser with long CSV cells
> ---------------------------------------------
>
>                 Key: ARROW-14940
>                 URL: https://issues.apache.org/jira/browse/ARROW-14940
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Antoine Pitrou
>            Assignee: Antoine Pitrou
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Some CSV files may have long cells (values), for example if containing arbitrary texts or even things like timestamps. We can speed up parsing such CSV files by filtering multiple bytes at once for state-changing characters such as delimiters.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)