You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Ankur Goenka (JIRA)" <ji...@apache.org> on 2017/12/05 19:57:00 UTC

[jira] [Updated] (BEAM-3189) Python Fnapi - Worker speedup

     [ https://issues.apache.org/jira/browse/BEAM-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ankur Goenka updated BEAM-3189:
-------------------------------
    External issue URL: https://docs.google.com/document/d/1mHFaNgHA71RVGLVNrGrHIlWHgJb4tKCJ2qQzS13REY8

> Python Fnapi - Worker speedup
> -----------------------------
>
>                 Key: BEAM-3189
>                 URL: https://issues.apache.org/jira/browse/BEAM-3189
>             Project: Beam
>          Issue Type: Improvement
>          Components: sdk-py-harness
>    Affects Versions: 2.3.0
>            Reporter: Ankur Goenka
>            Assignee: Ankur Goenka
>            Priority: Minor
>              Labels: performance, portability
>
> Beam Python SDK is couple of magnitude slower than Java SDK when it comes to stream processing.
> There are two related issues:
> # Given a single core, currently we are not fully utilizing the core because the single thread spends a lot of time on the IO. This is more of a limitation of our implementation rather than a limitation of Python.
> # Given a machine with multiple cores, single Python process could only utilize one core.
> In this task we will only address 1. 2 will be good for future optimization.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)