You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-issues@hadoop.apache.org by "Tom White (JIRA)" <ji...@apache.org> on 2009/09/09 10:02:03 UTC
[jira] Updated: (MAPREDUCE-775) Add input/output formatters for
Vertica clustered ADBMS.
[ https://issues.apache.org/jira/browse/MAPREDUCE-775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tom White updated MAPREDUCE-775:
--------------------------------
Assignee: Omer Trajman
Status: Open (was: Patch Available)
> Add input/output formatters for Vertica clustered ADBMS.
> --------------------------------------------------------
>
> Key: MAPREDUCE-775
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-775
> Project: Hadoop Map/Reduce
> Issue Type: New Feature
> Reporter: Omer Trajman
> Assignee: Omer Trajman
> Fix For: 0.21.0
>
> Attachments: MAPREDUCE-775.patch
>
>
> Add native support for Vertica as an input or output format taking advantage of parallel read and write properties of the DBMS.
>
> On the input side allow for parametrized queries (a la prepared statements) and create a split for each combination of parameters. Also support the parameter list to be generated from a sql statement. For example - return metrics for all dimensions that meet criteria X with one input split for each dimension. Divide the read among any number of hosts in the Vertica cluster.
>
> On the output side, support Vertica streaming load to any number of hosts in the Vertica cluster. Output may be to a different cluster than input.
>
> Also includes Input and Output formatters that support streaming interface.
> Code has been tested and run on live systems under 19 and 20. Patch for 21 with new API will be ready end of this week.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.