You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Xuannan Su (Jira)" <ji...@apache.org> on 2022/08/15 06:16:00 UTC
[jira] [Created] (FLINK-28964) Release Testing: Verify FLIP-205 Cache in DataStream for Batch Processing
Xuannan Su created FLINK-28964:
----------------------------------
Summary: Release Testing: Verify FLIP-205 Cache in DataStream for Batch Processing
Key: FLINK-28964
URL: https://issues.apache.org/jira/browse/FLINK-28964
Project: Flink
Issue Type: Sub-task
Components: API / DataStream
Reporter: Xuannan Su
Fix For: 1.16.0
DataStream API provides the `cache` method to cache the result of a DataStream and reuse it in later jobs with batch execution mode.
I think we should verify:
# Follow the doc to write a Flink job that produces cache and a job that consumes cache and submit it to a session cluster(standalone or yarn).
# You can remove the source physically after the cache-producing job is finished to verify that the cache-consuming job is not reading from the source. For example, delete the file in the filesystem if you are using a file source.
# You can restart the TaskManager after the cache-producing job is finished to verify that the cache-consuming job will re-compute the result.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)