You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@sqoop.apache.org by "ASF subversion and git services (JIRA)" <ji...@apache.org> on 2015/04/15 08:08:59 UTC

[jira] [Commented] (SQOOP-2299) Sqoop2: Store context classes in repository

    [ https://issues.apache.org/jira/browse/SQOOP-2299?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14495712#comment-14495712 ] 

ASF subversion and git services commented on SQOOP-2299:
--------------------------------------------------------

Commit d5584f27aa178c4457ad24ad736be0908b2d2f9b in sqoop's branch refs/heads/sqoop2 from [~abec]
[ https://git-wip-us.apache.org/repos/asf?p=sqoop.git;h=d5584f2 ]

SQOOP-2299: Sqoop2: Store context classes in repository

(Jarek Jarcec Cecho via Abraham Elmahrek)


> Sqoop2: Store context classes in repository
> -------------------------------------------
>
>                 Key: SQOOP-2299
>                 URL: https://issues.apache.org/jira/browse/SQOOP-2299
>             Project: Sqoop
>          Issue Type: Bug
>    Affects Versions: 1.99.5
>            Reporter: Jarek Jarcec Cecho
>            Assignee: Jarek Jarcec Cecho
>             Fix For: 1.99.7
>
>         Attachments: SQOOP-2299.patch, SQOOP-2299.patch
>
>
> While looking into persisting state from incremental job (SQOOP-1803), I've uncover a Hadoop bug where any Hadoop 2 will return incorrect {{job.xml}} when using {{JobClient}} APIs to get job's details. The issue is harder to track as it was initially fixed in Hadoop 2.7.0 via MAPREDUCE-5875, but subsequently reverted because of MAPREDUCE-6288 and it's not clear to me when/if the fix will be provided. This is relevant to us as we are storing our {{Context}} classes in job conf. I've looked around why nobody seen this problem before and it seems that projects are generally persisting properties in their repositories rather then using Hadoop APIs to retrieve the {{Configuration}} object back.
> Thinking about it a bit more, I think that it would be useful to keep track of the context classes as they contain additional information that can be useful for debugging purpose. I'm not yet sure whether we should expose those objects over the REST interface as they can possibly contain sensitive information, but it seems useful to at least persist those.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)