You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "bb (Jira)" <ji...@apache.org> on 2019/10/01 20:10:00 UTC

[jira] [Created] (ARROW-6756) Include HDFS `getfacl` in `pyarrow.hdfs.HadoopFileSystem`

bb created ARROW-6756:
-------------------------

             Summary: Include HDFS `getfacl` in `pyarrow.hdfs.HadoopFileSystem`
                 Key: ARROW-6756
                 URL: https://issues.apache.org/jira/browse/ARROW-6756
             Project: Apache Arrow
          Issue Type: Wish
          Components: Python
    Affects Versions: 0.13.0
            Reporter: bb


Extended HDFS filesystem attributes such as role-based access through Cloudera's Sentry are exposed through the `getfacl` command.

It would be immensely help to have this exposed through the `pyarrow.hdfs.HadoopFileSystem`.

Link to the official Hadoop docs where this is discussed in more detail:

[https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/FileSystemShell.html#getfacl]

Sample output from the *nix shell:

```$ hadoop fs -getfacl /path/to/hdfs/dir
# file: /path/to/hdfs/dir
# owner: hive
# group: hive
user::rwx
group:unix_group_with_acl_privs_defined:rwx
group::---
user:hive:rwx
group:hive:rwx
mask::rwx
other::--x```



--
This message was sent by Atlassian Jira
(v8.3.4#803005)