You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Zoltán Borók-Nagy (Jira)" <ji...@apache.org> on 2022/06/23 12:49:00 UTC

[jira] [Updated] (IMPALA-801) Add function or virtual column for file name

     [ https://issues.apache.org/jira/browse/IMPALA-801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Zoltán Borók-Nagy updated IMPALA-801:
-------------------------------------
    Epic Link: IMPALA-11386

> Add function or virtual column for file name
> --------------------------------------------
>
>                 Key: IMPALA-801
>                 URL: https://issues.apache.org/jira/browse/IMPALA-801
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Catalog
>    Affects Versions: Impala 1.2.3
>            Reporter: Udai Kiran Potluri
>            Assignee: Zoltán Borók-Nagy
>            Priority: Minor
>              Labels: built-in-function, impala-iceberg, ramp-up
>             Fix For: Impala 4.2.0
>
>
> Hive can list the data files in a table. For eg the following query lists all the data files for the table or partition:
> {noformat}
> select INPUT__FILE__NAME, count(*) from <table_name> where dt='20140210' group by INPUT__FILE__NAME;
> {noformat}
> This has two advantages over the existing "show files" functionality:
> * The output can be used in arbitrary SQL statements.
> * You can see which record came from which file.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org