You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "Sqoop QA bot (JIRA)" <ji...@apache.org> on 2014/11/17 08:54:35 UTC

[jira] [Commented] (SQOOP-1588) TO-side: Write data to HDFS

    [ https://issues.apache.org/jira/browse/SQOOP-1588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14214394#comment-14214394 ] 

Sqoop QA bot commented on SQOOP-1588:
-------------------------------------

Here are the results of testing the latest attachment
https://issues.apache.org/jira/secure/attachment/12681860/SQOOP-1588.5.patch against branch sqoop2.

{color:green}Overall:{color} +1 all checks pass

{color:green}SUCCESS:{color} Clean was successful
{color:green}SUCCESS:{color} Patch applied correctly
{color:green}SUCCESS:{color} Patch add/modify test case
{color:green}SUCCESS:{color} Patch compiled
{color:green}SUCCESS:{color} All tests passed

Console output: https://builds.apache.org/job/PreCommit-SQOOP-Build/622/console

This message is automatically generated.

> TO-side: Write data to HDFS
> ---------------------------
>
>                 Key: SQOOP-1588
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1588
>             Project: Sqoop
>          Issue Type: Sub-task
>          Components: connectors
>            Reporter: Qian Xu
>            Assignee: Qian Xu
>             Fix For: 1.99.5
>
>         Attachments: SQOOP-1588.5.patch
>
>
> Create a basic Kite connector that can write data (i.e. from a jdbc connection) to HDFS. 
> The scope is defined as follows:
> - Destination: HDFS
> - File Format: Avro Parquet and CSV.
> - Compression Codec: Use default
> - Partitioner Strategy: Not supported
> - Column Mapping: Not supported
> Exposed Configuration:
> - [Link] File Format (Enum)
> - [To] Dataset URI (String, has a validation check) 
> Workflow:
> - Create a link to Kite Connector
> - Create a job with valid configuration (see above)
> - Start a job {{KiteToInitializer}} will check dataset existence 
> - Sqoop will create N {{KiteLoader}} instances.
> - Kite requires an Avro schema for data manipulation, {{KiteLoader}} will create an Avro schema from Sqoop schema provided by {{LoaderContext}}. As Sqoop schema types are not identical to Avro types, some types will be mapped. The original Sqoop type information will be kept as {{SqoopType}} in schema field, which can be used for a reversed type mapping.
> - {{KiteLoader}} will create a temporary dataset and writes data records into it. If any error occurs, the dataset will be deleted.
> - {{KiteToDestroy}} will merge all temporary datasets as one dataset.
> Further features will be implemented in follow-up JIRAs.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)