You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Bhaskar Mookerji (JIRA)" <ji...@apache.org> on 2018/12/09 07:28:00 UTC

[jira] [Created] (ARROW-3968) Standalone CSV to Arrow Conversion Tool

Bhaskar Mookerji created ARROW-3968:
---------------------------------------

             Summary: Standalone CSV to Arrow Conversion Tool
                 Key: ARROW-3968
                 URL: https://issues.apache.org/jira/browse/ARROW-3968
             Project: Apache Arrow
          Issue Type: New Feature
          Components: C++
            Reporter: Bhaskar Mookerji


As part of exploring the Arrow C++ implementation, I wrote standalone tool for streaming Arrow to a file from CSV, using the new CSV parser implementation from [~pitrou]. I realize that Arrow's emphasis is on in-memory representation, as opposed to efficient storage to disk, so I'd love to know if this has any utility for the project. At the very least, it seems like a quick way to get into exploring the format from a something easily inspectable/familiar (i.e., CSV).

In either case, I'm making this issue here as a placeholder for an accompanying PR on Github. Also, I think this is my first issue for this project, so please let me know if I should do anything differently.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)