You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@flink.apache.org by "Xuannan Su (Jira)" <ji...@apache.org> on 2022/08/15 06:16:00 UTC

[jira] [Created] (FLINK-28964) Release Testing: Verify FLIP-205 Cache in DataStream for Batch Processing

Xuannan Su created FLINK-28964:
----------------------------------

             Summary: Release Testing: Verify FLIP-205 Cache in DataStream for Batch Processing
                 Key: FLINK-28964
                 URL: https://issues.apache.org/jira/browse/FLINK-28964
             Project: Flink
          Issue Type: Sub-task
          Components: API / DataStream
            Reporter: Xuannan Su
             Fix For: 1.16.0


DataStream API provides the `cache` method to cache the result of a DataStream and reuse it in later jobs with batch execution mode.

I think we should verify:
 # Follow the doc to write a Flink job that produces cache and a job that consumes cache and submit it to a session cluster(standalone or yarn).
 # You can remove the source physically after the cache-producing job is finished to verify that the cache-consuming job is not reading from the source. For example, delete the file in the filesystem if you are using a file source. 
 # You can restart the TaskManager after the cache-producing job is finished to verify that the cache-consuming job will re-compute the result.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)