You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Randy Gelhausen (JIRA)" <ji...@apache.org> on 2017/07/02 03:29:00 UTC

[jira] [Created] (NIFI-4146) SplitRecord does not gracefully convert medium sized CSV into individual FlowFiles

Randy Gelhausen created NIFI-4146:
-------------------------------------

             Summary: SplitRecord does not gracefully convert medium sized CSV into individual FlowFiles
                 Key: NIFI-4146
                 URL: https://issues.apache.org/jira/browse/NIFI-4146
             Project: Apache NiFi
          Issue Type: Bug
          Components: Core Framework
            Reporter: Randy Gelhausen
         Attachments: flow.xml.gz, nifi-app.log, ubuntu.nifi-app.log

SplitRecord fails to split a ~= 150k line (57 Mb) CSV file into individual FlowFiles.

This could be configuration issues, but with a build from master today, I run into problems out of the box on macOS and Linux: 

On macOS Sierra, I get a too many open files error (See attached nifi-app.log). On Ubuntu 17.04, I get OOMs (See attached ubuntu.nifi-app.log) and the Web UI fails.

The CSV file I'm using is available [here|https://opendata.arcgis.com/datasets/229220ee14c147659e1049bd517c0b78_16.csv] and I've attached the flow: [^flow.xml.gz].



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)