You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Tathagata Das (JIRA)" <ji...@apache.org> on 2016/04/29 04:04:12 UTC

[jira] [Created] (SPARK-14997) Files in subdirectories are incorrectly considered in sqlContext.read.json()

Tathagata Das created SPARK-14997:
-------------------------------------

             Summary: Files in subdirectories are incorrectly considered in sqlContext.read.json()
                 Key: SPARK-14997
                 URL: https://issues.apache.org/jira/browse/SPARK-14997
             Project: Spark
          Issue Type: Bug
            Reporter: Tathagata Das


Lets says there are json files in the following directories structure

xyz/file0.json
xyz/subdir1/file1.json
xyz/subdir2/file2.json
xyz/subdir1/subsubdir1/file3.json

sqlContext.read.json("xyz") should read only file0.json according to behavior in Spark 1.6.1. However in current master, all the 4 files are read. 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org