You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Anshul (JIRA)" <ji...@apache.org> on 2016/09/22 11:09:20 UTC
[jira] [Created] (SPARK-17633) texFile() and wholeTextFiles() count
difference
Anshul created SPARK-17633:
------------------------------
Summary: texFile() and wholeTextFiles() count difference
Key: SPARK-17633
URL: https://issues.apache.org/jira/browse/SPARK-17633
Project: Spark
Issue Type: Bug
Components: Input/Output
Affects Versions: 1.6.2
Environment: Unix/Linux
Reporter: Anshul
sc.textFile() creates an RDD of string from a text file.
After that when count is performed, the line count is correct, but if more than one line is appended to the file manually and counting the same RDD of string increments the output/result only by 1.
But in case of sc.wholeTextFiles() the output/result is correct.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org