You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@oozie.apache.org by Marcelo Valle <mv...@redoop.org> on 2013/12/04 11:19:28 UTC
Synchronous Datasets
Hello,
I'm having some trouble using syncrhronous datasets in Oozie 4.0.0
My configuration files below:
coordinator.xml
....
<datasets>
<dataset name="cuarter" frequency="${coord:minutes(15)}"
initial-instance="${startTime}" timezone="${timezone}">
<uri-template>${inputDir}/${YEAR}/${MONTH}/${DAY}/${HOUR}/${MINUTE}</uri-template>
<done-flag></done-flag>
</dataset>
</datasets>
<input-events>
<data-in name="coordInput1" dataset="cuarter">
<start-instance>${coord:current(-3)}</start-instance>
<end-instance>${coord:current(0)}</end-instance>
</data-in>
</input-events>
....
With this dataset I hope the following paths in the dataset:
hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/13/20
hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/13/05
hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/12/50
hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/12/35
but in job logs i can see that only the first one is included:
<property>
<name>mapreduce.input.fileinputformat.inputdir</name>
<value>hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/13/20</value>
</property>
Can you please help me with some light?
Thanks too much!
~ Marcelo