You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@giraph.apache.org by "Alessio Arleo (JIRA)" <ji...@apache.org> on 2015/03/25 10:54:53 UTC
[jira] [Created] (GIRAPH-1000) Multi Output support
Alessio Arleo created GIRAPH-1000:
-------------------------------------
Summary: Multi Output support
Key: GIRAPH-1000
URL: https://issues.apache.org/jira/browse/GIRAPH-1000
Project: Giraph
Issue Type: Improvement
Components: bsp, conf and scripts, graph
Affects Versions: 1.0.0, 1.1.0, 1.2.0-SNAPSHOT
Reporter: Alessio Arleo
Hadoop natively supports multiple outputs. The objective is to extend Giraph to support multiple output formats during a single giraph run.
According to the official Hadoop apidocs*, to take advantage of multiple outputs the the pattern is the following:
- Modify the job submission
- Modify the reducer class to write on the declared different outputs
Since Giraph jobs are executed as mappers, probably this approach (or at least its second part) is not feasible, so further investigation is necessary.
*https://hadoop.apache.org/docs/r1.2.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)