You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Burak Yavuz (JIRA)" <ji...@apache.org> on 2016/09/20 20:00:22 UTC
[jira] [Created] (SPARK-17613)
PartitioningAwareFileCatalog.allFiles doesn't handle URI specified path at
parent
Burak Yavuz created SPARK-17613:
-----------------------------------
Summary: PartitioningAwareFileCatalog.allFiles doesn't handle URI specified path at parent
Key: SPARK-17613
URL: https://issues.apache.org/jira/browse/SPARK-17613
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 2.0.0
Reporter: Burak Yavuz
Consider you have a bucket as
{code}
s3a://some-bucket
{code}
and under it you have files:
{code}
s3a://some-bucket/file1.parquet
s3a://some-bucket/file2.parquet
{code}
Getting the parent path of {code}s3a://some-bucket/file1.parquet{code}
yields
{code}s3a://some-bucket/{code}
and the ListingFileCatalog uses this as the key in the hash map.
When catalog.allFiles is called, we use {code}s3a://some-bucket{code} (no slash at the end) to get the list of files, and we're left with an empty list!
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org