You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Antoine Pitrou (JIRA)" <ji...@apache.org> on 2019/02/28 09:16:00 UTC

[jira] [Commented] (ARROW-4706) [C++] shared conversion framework for JSON/CSV parsers

    [ https://issues.apache.org/jira/browse/ARROW-4706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16780292#comment-16780292 ] 

Antoine Pitrou commented on ARROW-4706:
---------------------------------------

I wonder how much can be shared. Obviously the parsing will be different, but even the conversion layer will be different too. So perhaps the higher-level orchestration layer can be shared.

Note that ARROW-3410 will imply changing the CSV orchestration layer.

> [C++] shared conversion framework for JSON/CSV parsers
> ------------------------------------------------------
>
>                 Key: ARROW-4706
>                 URL: https://issues.apache.org/jira/browse/ARROW-4706
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Benjamin Kietzman
>            Assignee: Benjamin Kietzman
>            Priority: Major
>
> CSV and JSON both convert strings to values in a Array but there is little code sharing beyond {{arrow::util::StringConverter}}.
> It would be advantageous if a single interface could be shared between CSV and JSON to do the heavy lifting of conversion consistently. This would simplify addition of new parsers as well as allowing all parsers to immediately take advantage of a new conversion strategy.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)