You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Ben Sidhom (JIRA)" <ji...@apache.org> on 2018/04/13 01:03:00 UTC

[jira] [Created] (BEAM-4063) Flink runner supports cluster-wide artifact deployments through the Distributed Cache

Ben Sidhom created BEAM-4063:
--------------------------------

             Summary: Flink runner supports cluster-wide artifact deployments through the Distributed Cache
                 Key: BEAM-4063
                 URL: https://issues.apache.org/jira/browse/BEAM-4063
             Project: Beam
          Issue Type: New Feature
          Components: runner-flink
            Reporter: Ben Sidhom
            Assignee: Aljoscha Krettek


As of now, Flink effectively has a dependency on an external storage system for artifact management. This is because the Flink Distributed Cache does not actually distribute and cache blobs itself, but rather expects that each node in a running cluster has access to a well-known artifact resource.

We should get this for free whenever [https://github.com/apache/flink/pull/5580] is merged (likely in 1.5). For now, we will have to defer to external storage systems like GCS or HDFS.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)