You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@gobblin.apache.org by "Zhixiong Chen (JIRA)" <ji...@apache.org> on 2017/09/07 23:09:08 UTC
[jira] [Updated] (GOBBLIN-245) Create topic specific extract for a
WorkUnit in KafkaSource
[ https://issues.apache.org/jira/browse/GOBBLIN-245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zhixiong Chen updated GOBBLIN-245:
----------------------------------
Issue Type: Task (was: Bug)
> Create topic specific extract for a WorkUnit in KafkaSource
> -----------------------------------------------------------
>
> Key: GOBBLIN-245
> URL: https://issues.apache.org/jira/browse/GOBBLIN-245
> Project: Apache Gobblin
> Issue Type: Task
> Reporter: Zhixiong Chen
> Assignee: Zhixiong Chen
> Labels: Bug:Generic, Core:JobManagement, Core:Publisher
>
> runOnce feature of Gobblin does not work correctly. A job without a schedule will be re-run upon Gobblin restart or modifications on the file. Additionally, `*.done` files are not being written.
> Root cause:
> In JobScheduler, Gobblin checks whether the job has a schedule in the method `scheduleJob(Properties, JobListener, Map, Class)` and sets the key `ConfigurationKeys.JOB_RUN_ONCE_KEY` accordingly. On the other hand, the method `scheduleGeneralConfiguredJobs()` checks the key `ConfigurationKeys.JOB_RUN_ONCE_KEY` and if runonce, creates the `RunOnceJobListener` that creates the `*.done` file. However, `scheduleGeneralConfiguredJobs()` is called before `scheduleJob(Properties, JobListener, Map, Class)`, so the property has not been set yet, and the `*.done` file is never written.
> On Gobblin restart, Gobblin checks for presence of done files, and skips jobs that have already been executed. However, the done file is not present, so the job gets repeated.
>
> *Github Url* : https://github.com/linkedin/gobblin/issues/1195
> *Github Reporter* : [~ibuenros]
> *Github Created At* : 2016-08-11T21:03:24Z
> *Github Updated At* : 2017-01-12T04:59:43Z
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)