You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/09/03 15:16:00 UTC

[jira] [Work logged] (BEAM-7945) Allow runner to configure "semi_persist_dir" which is used in the SDK harness

     [ https://issues.apache.org/jira/browse/BEAM-7945?focusedWorklogId=305641&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-305641 ]

ASF GitHub Bot logged work on BEAM-7945:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 03/Sep/19 15:15
            Start Date: 03/Sep/19 15:15
    Worklog Time Spent: 10m 
      Work Description: mxm commented on pull request #9452: [BEAM-7945] Allow runner to configure semi_persist_dir which is used …
URL: https://github.com/apache/beam/pull/9452#discussion_r320328068
 
 

 ##########
 File path: model/fn-execution/src/main/proto/beam_fn_api.proto
 ##########
 @@ -815,6 +815,7 @@ message StartWorkerRequest {
   org.apache.beam.model.pipeline.v1.ApiServiceDescriptor logging_endpoint = 3;
   org.apache.beam.model.pipeline.v1.ApiServiceDescriptor artifact_endpoint = 4;
   org.apache.beam.model.pipeline.v1.ApiServiceDescriptor provision_endpoint = 5;
+  string semi_persist_dir = 6;
 
 Review comment:
   Should this be dynamic or rather configured up front for the worker pool?
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 305641)
    Time Spent: 0.5h  (was: 20m)

> Allow runner to configure "semi_persist_dir" which is used in the SDK harness
> -----------------------------------------------------------------------------
>
>                 Key: BEAM-7945
>                 URL: https://issues.apache.org/jira/browse/BEAM-7945
>             Project: Beam
>          Issue Type: Sub-task
>          Components: java-fn-execution, sdk-go, sdk-java-core, sdk-py-core
>            Reporter: sunjincheng
>            Assignee: sunjincheng
>            Priority: Major
>             Fix For: 2.16.0
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Currently "semi_persist_dir" is not configurable. This may become a problem in certain scenarios. For example, the default value of "semi_persist_dir" is "/tmp" ([https://github.com/apache/beam/blob/master/sdks/python/container/boot.go#L48]) in Python SDK harness. When the environment type is "PROCESS", the disk of "/tmp" may be filled up and unexpected issues will occur in production environment. We should provide a way to configure "semi_persist_dir" in EnvironmentFactory at the runner side. 



--
This message was sent by Atlassian Jira
(v8.3.2#803003)