You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nifi.apache.org by "Karthik Narayanan (JIRA)" <ji...@apache.org> on 2016/04/30 06:48:12 UTC

[jira] [Commented] (NIFI-436) SplitText should allow changing the endline regex

    [ https://issues.apache.org/jira/browse/NIFI-436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15265152#comment-15265152 ] 

Karthik Narayanan commented on NIFI-436:
----------------------------------------

Hi  Sönke Liebau , i tried to attack this issue. I did not think Auto Detect was a good idea, especially if \n is embedded in the header or column values. It would be difficult to guess if it is part of data or metadata.  So i set the property to default to '\n'. Users will have option to select either '\r\ ,or '\r\n'.  

> SplitText should allow changing the endline regex
> -------------------------------------------------
>
>                 Key: NIFI-436
>                 URL: https://issues.apache.org/jira/browse/NIFI-436
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Jon Parise
>            Assignee: Sönke Liebau
>              Labels: beginner
>
> I have a CSV file in a format that inidcates the end of a line with a crlf. This file has embedded comments that have lf in them.
> When I run this file through the split text processor, it is splitting at the LF characters.
> I think it would be nice to have a setting to change the line ending characters for splitting text.
> I can't find anything in the documentation that indicates how I would change this behavior, so I assume it does not exist.
> Also, I would be willing to try and implement this improvement, but I can't seem to find the source for the SplitTextProcessor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)