You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flume.apache.org by "Ryan Blue (JIRA)" <ji...@apache.org> on 2014/03/08 02:05:45 UTC

[jira] [Updated] (FLUME-2343) Add user impersonation to DatasetSink

     [ https://issues.apache.org/jira/browse/FLUME-2343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ryan Blue updated FLUME-2343:
-----------------------------

    Attachment: FLUME-2343-1.patch

This patch adds user impersonation using a utility class for kerberos functions, KerberosUtil.

I originally added a `DatasetWriter` implementation that ran all FS operations as the proxy user, but apparently that isn't necessary. The `FileSystem` instance must be created with the desired privileges and all actions using that `FileSystem` will use the privileges it was created with (also see [this discussion|http://comments.gmane.org/gmane.comp.jakarta.lucene.hadoop.user/32795]). Only the `Dataset` loading must be wrapped by `doAs`, which doesn't agree with the HDFS implementation but works when I test against a quickstart VM.

> Add user impersonation to DatasetSink
> -------------------------------------
>
>                 Key: FLUME-2343
>                 URL: https://issues.apache.org/jira/browse/FLUME-2343
>             Project: Flume
>          Issue Type: Improvement
>          Components: Sinks+Sources
>    Affects Versions: v1.4.0
>            Reporter: Ryan Blue
>            Assignee: Ryan Blue
>             Fix For: v1.5.0
>
>         Attachments: FLUME-2343-1.patch
>
>
> Users creating datasets with the DatasetSink rather than the HDFS sink will still need user impersonation and authentication to HDFS using a flume principal.



--
This message was sent by Atlassian JIRA
(v6.2#6252)