You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Yin Huai (JIRA)" <ji...@apache.org> on 2016/09/20 20:54:21 UTC

[jira] [Updated] (SPARK-17613) PartitioningAwareFileCatalog.allFiles doesn't handle URI specified path at parent

     [ https://issues.apache.org/jira/browse/SPARK-17613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Yin Huai updated SPARK-17613:
-----------------------------
    Target Version/s: 2.0.1, 2.1.0

> PartitioningAwareFileCatalog.allFiles doesn't handle URI specified path at parent
> ---------------------------------------------------------------------------------
>
>                 Key: SPARK-17613
>                 URL: https://issues.apache.org/jira/browse/SPARK-17613
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.0.0
>            Reporter: Burak Yavuz
>
> Consider you have a bucket as 
> {code}
> s3a://some-bucket
> {code}
> and under it you have files:
> {code}
> s3a://some-bucket/file1.parquet
> s3a://some-bucket/file2.parquet
> {code}
> Getting the parent path of {code}s3a://some-bucket/file1.parquet{code}
> yields
> {code}s3a://some-bucket/{code}
> and the ListingFileCatalog uses this as the key in the hash map.
> When catalog.allFiles is called, we use {code}s3a://some-bucket{code} (no slash at the end) to get the list of files, and we're left with an empty list!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org