You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@accumulo.apache.org by "John Vines (Updated) (JIRA)" <ji...@apache.org> on 2012/02/19 18:28:36 UTC

[jira] [Updated] (ACCUMULO-418) Make RFiles splittable

     [ https://issues.apache.org/jira/browse/ACCUMULO-418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

John Vines updated ACCUMULO-418:
--------------------------------

    Affects Version/s:     (was: 1.4.1)
                           (was: 1.3.6)
                           (was: 1.5.0)
        Fix Version/s:     (was: 1.5.0-SNAPSHOT)
                           (was: 1.3.5)
                           (was: 1.4.0)

Similar yet different. Both tickets can be implemented independently, but they benefit from one another.  So you're right on, Ivan
                
> Make RFiles splittable
> ----------------------
>
>                 Key: ACCUMULO-418
>                 URL: https://issues.apache.org/jira/browse/ACCUMULO-418
>             Project: Accumulo
>          Issue Type: New Feature
>          Components: master, tserver
>    Affects Versions: 1.3.5, 1.4.0, 1.5.0-SNAPSHOT
>         Environment: All
>            Reporter: Ivan Bella
>            Assignee: Eric Newton
>              Labels: RFile, hadoop, mapreduce
>             Fix For: 1.5.0, 1.3.6, 1.4.1
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
> There are times when iterating over RFiles is useful in map-reduce jobs.  I know that RFiles logically can be split on the block boundary, however there is no easy way to do this currently as there is no RFile RecordReader or InputFormat provided.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira