You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Kenneth Knowles (JIRA)" <ji...@apache.org> on 2017/06/26 17:49:00 UTC

[jira] [Created] (BEAM-2516) User reports 4 minutes to process 1 million line CSV in DirectRunner

Kenneth Knowles created BEAM-2516:
-------------------------------------

             Summary: User reports 4 minutes to process 1 million line CSV in DirectRunner
                 Key: BEAM-2516
                 URL: https://issues.apache.org/jira/browse/BEAM-2516
             Project: Beam
          Issue Type: Bug
          Components: runner-direct
            Reporter: Kenneth Knowles
            Assignee: Thomas Groh
            Priority: Minor


https://stackoverflow.com/questions/44736414/simple-apache-beam-manipulations-work-very-slow

I don't know what the expectation are here, so I wasn't ready to say this is WAI. Low priority since it isn't what the runner is for anyhow, but this seems like the scale of data that should be snappy. Worth investigating, or maybe you can quickly indicate why it is expected?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)