You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@samza.apache.org by "Lakshmi Manasa Gaduputi (Jira)" <ji...@apache.org> on 2022/02/18 23:50:00 UTC
[jira] [Commented] (SAMZA-2689) [Elasticity] introduce ElasticJobModel which has SystemStreamPartitionKeyHash
[ https://issues.apache.org/jira/browse/SAMZA-2689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17494851#comment-17494851 ]
Lakshmi Manasa Gaduputi commented on SAMZA-2689:
------------------------------------------------
[https://github.com/apache/samza/pull/1580] merged
> [Elasticity] introduce ElasticJobModel which has SystemStreamPartitionKeyHash
> -----------------------------------------------------------------------------
>
> Key: SAMZA-2689
> URL: https://issues.apache.org/jira/browse/SAMZA-2689
> Project: Samza
> Issue Type: Sub-task
> Reporter: Lakshmi Manasa Gaduputi
> Assignee: Lakshmi Manasa Gaduputi
> Priority: Major
> Time Spent: 4h 40m
> Remaining Estimate: 0h
>
> # For a job with elasticity factor = X, each input SystemStreamPartition is split into X parts aka SystemStreamPartitionKeyHashes. ElasticJobModel has elastic task models where each original task ( in the original JobModel) now has X elastic tasks each of which consumes an SSPKh pertaining to the SSP of the original task.
> # Create an ElasticJobModel and thus assign elastic tasks to containers. To achieve this, the groupers which group the input SSPs into task models, need to be updated.
> # ElasticJobModel created in the earlier sub-task creates X elastic tasks for each original task where elasticity.factor=X. Each elastic task consumes an SSPKh = 1/X th of the input SSP. This task is to ensure each elastic task actually processes only the messages within that SSPKh.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)