You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2017/09/19 13:19:00 UTC

[jira] [Commented] (ARROW-1209) [C++] Implement converter between Arrow record batches and Avro records

    [ https://issues.apache.org/jira/browse/ARROW-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16171666#comment-16171666 ] 

ASF GitHub Bot commented on ARROW-1209:
---------------------------------------

Github user mariusvniekerk commented on the issue:

    https://github.com/apache/arrow/pull/1026
  
    Some more investigation.
    
    There are two kinds of avro.  Avro memory buffers (streams) and Avro datafiles.
    
    Most of the implementation of the Avro datafile handling is locked up in the private implementation of `avro-c`  (header reading, compression codecs, Avro sync markers).  As such the only way we can interact with that short of reimplementing large parts of it is to restrict ourselves to things that can provide a c `FILE*` reference to the datafile.
    
    The second case is much simpler to handle since it assumes a preshared Avro schema.


> [C++] Implement converter between Arrow record batches and Avro records
> -----------------------------------------------------------------------
>
>                 Key: ARROW-1209
>                 URL: https://issues.apache.org/jira/browse/ARROW-1209
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>              Labels: pull-request-available
>
> This would be useful for streaming systems that need to consume or produce Avro in C/C++



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)