You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2018/10/19 03:16:00 UTC

[jira] [Resolved] (SPARK-25493) CRLF Line Separators don't work in multiline CSVs

     [ https://issues.apache.org/jira/browse/SPARK-25493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hyukjin Kwon resolved SPARK-25493.
----------------------------------
       Resolution: Fixed
    Fix Version/s: 3.0.0

Issue resolved by pull request 22503
[https://github.com/apache/spark/pull/22503]

> CRLF Line Separators don't work in multiline CSVs
> -------------------------------------------------
>
>                 Key: SPARK-25493
>                 URL: https://issues.apache.org/jira/browse/SPARK-25493
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.3.1
>            Reporter: Justin Uang
>            Assignee: Justin Uang
>            Priority: Major
>             Fix For: 3.0.0
>
>
> CSVs with windows style crlf (carriage return line feed) don't work in multiline mode. They work fine in single line mode because the line separation is done by Hadoop, which can handle all the different types of line separators. In multiline mode, the Univocity parser is used to also handle splitting of records.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org