You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Xiangrui Meng (JIRA)" <ji...@apache.org> on 2014/09/01 07:10:20 UTC

[jira] [Resolved] (SPARK-3205) input format for text records saved with in-record delimiter and newline characters escaped

     [ https://issues.apache.org/jira/browse/SPARK-3205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Xiangrui Meng resolved SPARK-3205.
----------------------------------
    Resolution: Later

Moved the implementation to https://github.com/mengxr/redshift-input-format. If people feel this input format is common, we can move it to Spark Core later.

> input format for text records saved with in-record delimiter and newline characters escaped
> -------------------------------------------------------------------------------------------
>
>                 Key: SPARK-3205
>                 URL: https://issues.apache.org/jira/browse/SPARK-3205
>             Project: Spark
>          Issue Type: New Feature
>          Components: Spark Core, SQL
>            Reporter: Xiangrui Meng
>            Assignee: Xiangrui Meng
>
> Text records may contain in-record delimiter or newline characters. In such cases, we can either encode them or escape them. The latter is simpler and used by Redshift's UNLOAD with the ESCAPE option. The problem is that a record will span multiple lines. We need an input format for it.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org