You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by Oleksandr Nitavskyi <o....@criteo.com> on 2018/07/02 12:30:44 UTC

CoreOptions.TMP_DIRS bug

Hello guys,

We have discovered minor issue with Flink 1.5 on YARN particularly which was related with the way Flink manages temp paths (io.tmp.dirs
) in configuration: https://ci.apache.org/projects/flink/flink-docs-master/ops/config.html#io-tmp-dirs


1.       From what we can see in the code, default option doesn’t correspond to reality on YARN or on Mesos deployments. Looks like it equals to env variable ‘_FLINK_TMP_DIR’ on Mesos and to `LOCAL_DIRS` on Yarn.

2.       The issue on Yarn is that it is impossible to have different LOCAL_DIRS on JobManager and TaskManager, despite LOCAL_DIRS value depends on the container.

The issue is that CoreOptions.TMP_DIRS is configured to the default value during JobManager initialization and added to the configuration object. When TaskManager is launched the appropriate configuration object is cloned with LOCAL_DIRS which makes sense only for Job Manager container. When YARN container with TaskManager from his point of view CoreOptions.TMP_DIRS is always equal either to path in flink.yml or to the or to the LOCAL_DIRS of Job Manager (default behaviour). Is TaskManager’s container do not have an access to another folders, that folders allocated by YARN TaskManager cannot be started.

Could you please confirm that it is a bug and I will create a Jira ticket to track it?

Thanks
Kind Regards
Oleksandr Nitavskyi



Re: CoreOptions.TMP_DIRS bug

Posted by Till Rohrmann <tr...@apache.org>.
Hi Oleksandr,

sorry for my late reply. This sounds indeed like a bug. Thanks for opening
FLINK-9762 and the corresponding PR. I'll give it a review shortly so that
we can include it in Flink 1.6.

Cheers,
Till

On Mon, Jul 2, 2018 at 2:31 PM Oleksandr Nitavskyi <o....@criteo.com>
wrote:

> Hello guys,
>
> We have discovered minor issue with Flink 1.5 on YARN particularly which
> was related with the way Flink manages temp paths (io.tmp.dirs
> ) in configuration:
> https://ci.apache.org/projects/flink/flink-docs-master/ops/config.html#io-tmp-dirs
>
>
> 1.       From what we can see in the code, default option doesn’t
> correspond to reality on YARN or on Mesos deployments. Looks like it equals
> to env variable ‘_FLINK_TMP_DIR’ on Mesos and to `LOCAL_DIRS` on Yarn.
>
> 2.       The issue on Yarn is that it is impossible to have different
> LOCAL_DIRS on JobManager and TaskManager, despite LOCAL_DIRS value depends
> on the container.
>
> The issue is that CoreOptions.TMP_DIRS is configured to the default value
> during JobManager initialization and added to the configuration object.
> When TaskManager is launched the appropriate configuration object is cloned
> with LOCAL_DIRS which makes sense only for Job Manager container. When YARN
> container with TaskManager from his point of view CoreOptions.TMP_DIRS is
> always equal either to path in flink.yml or to the or to the LOCAL_DIRS of
> Job Manager (default behaviour). Is TaskManager’s container do not have an
> access to another folders, that folders allocated by YARN TaskManager
> cannot be started.
>
> Could you please confirm that it is a bug and I will create a Jira ticket
> to track it?
>
> Thanks
> Kind Regards
> Oleksandr Nitavskyi
>
>
>