You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@oozie.apache.org by Marcelo Valle <mv...@redoop.org> on 2013/12/04 11:19:28 UTC

Synchronous Datasets

Hello,

I'm having some trouble using syncrhronous datasets in Oozie 4.0.0

My configuration files below:

coordinator.xml

....
<datasets>
         <dataset name="cuarter" frequency="${coord:minutes(15)}"
initial-instance="${startTime}" timezone="${timezone}">

 <uri-template>${inputDir}/${YEAR}/${MONTH}/${DAY}/${HOUR}/${MINUTE}</uri-template>
             <done-flag></done-flag>
         </dataset>
</datasets>
   <input-events>
         <data-in name="coordInput1" dataset="cuarter">
              <start-instance>${coord:current(-3)}</start-instance>
              <end-instance>${coord:current(0)}</end-instance>
         </data-in>
  </input-events>
....

With this dataset I hope the following paths in the dataset:
      hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/13/20
      hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/13/05
      hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/12/50
      hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/12/35

but in job logs i can see that only the first one is included:
    <property>
      <name>mapreduce.input.fileinputformat.inputdir</name>

<value>hdfs://mvalle1:8020/data/input/temperatures/2013/12/03/13/20</value>
    </property>

Can you please help me with some light?
Thanks too much!

~ Marcelo