You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@samza.apache.org by "ASF GitHub Bot (JIRA)" <ji...@apache.org> on 2018/07/19 05:44:00 UTC
[jira] [Commented] (SAMZA-1773) Side inputs for local stores
[ https://issues.apache.org/jira/browse/SAMZA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16548817#comment-16548817 ]
ASF GitHub Bot commented on SAMZA-1773:
---------------------------------------
GitHub user bharathkk opened a pull request:
https://github.com/apache/samza/pull/570
SAMZA-1773: Side inputs for local stores
@prateekm @vjagadish
Please take a look.
I will update the PR with the unit tests for SideInputStorageManager and the util functions.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/bharathkk/samza side-input-v3
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/samza/pull/570.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #570
----
commit 5393812b8ac0e67febaa9641e2db89073a17f434
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-10T00:49:51Z
Initial prototype for side input
commit 9ebdc1a15d50ae97519a4ba04a38677a34342056
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-12T23:36:24Z
Consolidated the side input logic into TaskStorageManager
commit fa793cb92b4e71fea2e6d21dfd8e23966f2f9ffd
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-13T00:38:12Z
Include the initialization of starting offsets for side input stores
commit 34675938cbf4e43c374038fd920fef08617d1d56
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-13T23:01:41Z
Alternate prototype for side input.
commit c6fd64fc0b45e5b7148166f19305d2027b75fa86
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-13T23:15:57Z
Extracted getting the start offset in TaskInstance to a method
commit fbc808f78308ba611ae199fb95e34f236cafab03
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-14T01:01:48Z
Merge the task context impl changes and fix some typos
commit c025eb6b62f5a886242e59c7ba48b93a9f4ff48b
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-16T17:57:55Z
Fix checkstyle issues and add copyrights notes
commit e4f66b32401e08e42db04ef24b884f76970de164
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-16T21:09:53Z
Fix unit tests and apply persistence store check for side input stores as well
commit 9993f76bc93b5e26541b60c67f90d8b08a0b04bc
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-17T21:28:15Z
Make the side input processor factory store level configuration and minor changes to validation
commit cf14cd54df920cc8c9fe6c8de6edc49e7d83c080
Author: Bharath Kumarasubramanian <bk...@...>
Date: 2018-07-19T05:17:22Z
Addressed Prateek's comments, merged his suggestions and added documentation for the new configs
----
> Side inputs for local stores
> ----------------------------
>
> Key: SAMZA-1773
> URL: https://issues.apache.org/jira/browse/SAMZA-1773
> Project: Samza
> Issue Type: Task
> Components: kv-store
> Reporter: Bharath Kumarasubramanian
> Assignee: Bharath Kumarasubramanian
> Priority: Major
>
> We want a way to populate the local store for jobs from a secondary data source (e.g hadoop), but is otherwise read-only. Typical use cases of such stores are scoring data for relevance jobs, etc
>
> The rows (keys) in this dataset are independent of each other, and there's no requirement for the entire dataset to be updated atomically. This means that once the store is bootstrapped, it can be updated in change capture mode in the background while processing continues.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)