You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@giraph.apache.org by "Alessio Arleo (JIRA)" <ji...@apache.org> on 2015/03/25 10:54:53 UTC

[jira] [Created] (GIRAPH-1000) Multi Output support

Alessio Arleo created GIRAPH-1000:
-------------------------------------

             Summary: Multi Output support
                 Key: GIRAPH-1000
                 URL: https://issues.apache.org/jira/browse/GIRAPH-1000
             Project: Giraph
          Issue Type: Improvement
          Components: bsp, conf and scripts, graph
    Affects Versions: 1.0.0, 1.1.0, 1.2.0-SNAPSHOT
            Reporter: Alessio Arleo


Hadoop natively supports multiple outputs. The objective is to extend Giraph to support multiple output formats during a single giraph run.

According to the official Hadoop apidocs*, to take advantage of multiple outputs the  the pattern is the following:
- Modify the job submission
- Modify the reducer class to write on the declared different outputs

Since Giraph jobs are executed as mappers, probably this approach (or at least its second part) is not feasible, so further investigation is necessary.

*https://hadoop.apache.org/docs/r1.2.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)