You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Ankur Goenka (JIRA)" <ji...@apache.org> on 2017/12/05 19:57:00 UTC
[jira] [Updated] (BEAM-3189) Python Fnapi - Worker speedup
[ https://issues.apache.org/jira/browse/BEAM-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ankur Goenka updated BEAM-3189:
-------------------------------
External issue URL: https://docs.google.com/document/d/1mHFaNgHA71RVGLVNrGrHIlWHgJb4tKCJ2qQzS13REY8
> Python Fnapi - Worker speedup
> -----------------------------
>
> Key: BEAM-3189
> URL: https://issues.apache.org/jira/browse/BEAM-3189
> Project: Beam
> Issue Type: Improvement
> Components: sdk-py-harness
> Affects Versions: 2.3.0
> Reporter: Ankur Goenka
> Assignee: Ankur Goenka
> Priority: Minor
> Labels: performance, portability
>
> Beam Python SDK is couple of magnitude slower than Java SDK when it comes to stream processing.
> There are two related issues:
> # Given a single core, currently we are not fully utilizing the core because the single thread spends a lot of time on the IO. This is more of a limitation of our implementation rather than a limitation of Python.
> # Given a machine with multiple cores, single Python process could only utilize one core.
> In this task we will only address 1. 2 will be good for future optimization.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)