You are viewing a plain text version of this content. The canonical link for it is here.

Posted to mapreduce-issues@hadoop.apache.org by "Jacob Rideout (JIRA)" <ji...@apache.org> on 2009/12/15 07:09:18 UTC

[jira] Commented: (MAPREDUCE-815) Add AvroInputFormat and AvroOutputFormat so that hadoop can use Avro Serialization

    [ https://issues.apache.org/jira/browse/MAPREDUCE-815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12790583#action_12790583 ] 

Jacob Rideout commented on MAPREDUCE-815:
-----------------------------------------

What is the current line of thought on how keys and values will interact with the schema for an avro file? Is the intention that there would be a master schema that encapsulated the key/values similar to:

{code}
{ "type" : "record",
  "fields" : [
  { "name" : "KEY", "type" : "record" },
  { "name" : "VALUE", "type" : "record" }
]}
{code}

What about files created without this "master" schema; would the key return a null object? Byte offset in a schema of type "long" ?

> Add AvroInputFormat and AvroOutputFormat so that hadoop can use Avro Serialization
> ----------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-815
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-815
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>            Reporter: Ravi Gummadi
>            Assignee: Ravi Gummadi
>
> MapReduce needs AvroInputFormat similar to other InputFormats like TextInputFormat to be able to use avro serialization in hadoop. Similarly AvroOutputFormat is needed.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.