You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Hingorani, Vineet (JIRA)" <ji...@apache.org> on 2014/08/25 15:52:57 UTC

[jira] [Created] (SPARK-3202) Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD

Hingorani, Vineet created SPARK-3202:
----------------------------------------

             Summary: Manipulating columns in CSV file or Transpose of Array[Array[String]] RDD
                 Key: SPARK-3202
                 URL: https://issues.apache.org/jira/browse/SPARK-3202
             Project: Spark
          Issue Type: Documentation
          Components: Documentation
            Reporter: Hingorani, Vineet


Hello all,

Could someone help me with the manipulation of csv file data. I have 'semicolon' separated csv data including doubles and strings. I want to calculate the maximum/average of a column. When I read the file using sc.textFile(test.csv).map(_.split(";"), each field is read as string. Could someone help me with the above manipulation and how to do that.

Or may be if there is some way to take the transpose of the data and then manipulating the rows in some way?

Thank you in advance, I am struggling with this thing for quite sometime

Regards,
Vineet



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org