You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@impala.apache.org by "Aman Sinha (Jira)" <ji...@apache.org> on 2020/12/28 00:55:00 UTC

[jira] [Resolved] (IMPALA-2920) Provide an HDFS pseudotable in Impala

     [ https://issues.apache.org/jira/browse/IMPALA-2920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aman Sinha resolved IMPALA-2920.
--------------------------------
    Resolution: Won't Fix

Quite a lot of the file system related information can be retrieved through various SHOW and DESCRIBE commands.  e.g SHOW FILES IN [database_name.]table_name.  It is not clear if providing a SQL interface to query HDFS pseudo tables will add much additional value. 

> Provide an HDFS pseudotable in Impala
> -------------------------------------
>
>                 Key: IMPALA-2920
>                 URL: https://issues.apache.org/jira/browse/IMPALA-2920
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Frontend
>    Affects Versions: Impala 2.2.4
>            Reporter: Joe Slagel
>            Priority: Minor
>              Labels: impala
>
> Would it be possible to implement some sort of pseudo table(s) in Impala that one could query to get HDFS information?  For example, instead of having to do some sort of "hdfs dfs -ls *" from the command line (or through an hdfs api), cut, sort, filter and dump to a file to operate on, it'd be nice to do something like select filename from hdfs where path like '/tmp/some/path/%'.  Similarly you could then run queries to do things like group by files in directories, sizes of files, etc.  The possibilities are pretty endless of how useful it could be.  And all easily done through the Hue interface.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)