You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2016/09/20 21:49:20 UTC

[jira] [Commented] (SPARK-17613) PartitioningAwareFileCatalog.allFiles doesn't handle URI specified path at parent

    [ https://issues.apache.org/jira/browse/SPARK-17613?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15507897#comment-15507897 ] 

Apache Spark commented on SPARK-17613:
--------------------------------------

User 'brkyvz' has created a pull request for this issue:
https://github.com/apache/spark/pull/15169

> PartitioningAwareFileCatalog.allFiles doesn't handle URI specified path at parent
> ---------------------------------------------------------------------------------
>
>                 Key: SPARK-17613
>                 URL: https://issues.apache.org/jira/browse/SPARK-17613
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.0.0
>            Reporter: Burak Yavuz
>
> Consider you have a bucket as 
> {code}
> s3a://some-bucket
> {code}
> and under it you have files:
> {code}
> s3a://some-bucket/file1.parquet
> s3a://some-bucket/file2.parquet
> {code}
> Getting the parent path of {code}s3a://some-bucket/file1.parquet{code}
> yields
> {code}s3a://some-bucket/{code}
> and the ListingFileCatalog uses this as the key in the hash map.
> When catalog.allFiles is called, we use {code}s3a://some-bucket{code} (no slash at the end) to get the list of files, and we're left with an empty list!



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org