You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hadoop.apache.org by Rex X <dn...@gmail.com> on 2016/01/21 00:43:15 UTC

Hadoop Streaming: How to parition output into subfolders?

Dear all,

To be specific, for example, given

    hadoop jar hadoop-streaming.jar \
      -input myInputDirs \
      -output myOutputDir \
      -mapper /bin/cat \
      -reducer /usr/bin/wc

Where myInputDirs has a *dated* subfolder structure of

       /input_dir/yyyy/mm/dd/part-*

I want myOutputDir has the same *dated* subfolder structure:

       /output_dir/yyyy/mm/dd/part-*

Guess there should be an option to do this. Can "-partitioner" or any "-D"
option make this?


Thanks & regards,
Rex

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Hi Camusensei,

Thank you. That's very helpful!

Rex


On Thu, Jan 21, 2016 at 1:41 AM, Namikaze Minato <ll...@gmail.com>
wrote:

> Hi Rex X,
>
> We are using the -outputFormat <classname> option of hadoop-streaming.
> Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat
>
> Regards,
> Camusensei
>
> On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> > Thank you, Rohit!
> >
> > Any multiple outputs sample code in python?
> >
> > Rex
> >
> >
> > On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> > wrote:
> >>
> >> Hi Rex
> >>
> >> Please explore multiple outputs.
> >>
> >> Regards
> >> Rohit Sarewar
> >>
> >>
> >> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
> >>>
> >>> Dear all,
> >>>
> >>> To be specific, for example, given
> >>>
> >>>     hadoop jar hadoop-streaming.jar \
> >>>       -input myInputDirs \
> >>>       -output myOutputDir \
> >>>       -mapper /bin/cat \
> >>>       -reducer /usr/bin/wc
> >>>
> >>> Where myInputDirs has a dated subfolder structure of
> >>>
> >>>        /input_dir/yyyy/mm/dd/part-*
> >>>
> >>> I want myOutputDir has the same dated subfolder structure:
> >>>
> >>>        /output_dir/yyyy/mm/dd/part-*
> >>>
> >>> Guess there should be an option to do this. Can "-partitioner" or any
> >>> "-D" option make this?
> >>>
> >>>
> >>> Thanks & regards,
> >>> Rex
> >>
> >>
> >
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Hi Camusensei,

Thank you. That's very helpful!

Rex


On Thu, Jan 21, 2016 at 1:41 AM, Namikaze Minato <ll...@gmail.com>
wrote:

> Hi Rex X,
>
> We are using the -outputFormat <classname> option of hadoop-streaming.
> Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat
>
> Regards,
> Camusensei
>
> On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> > Thank you, Rohit!
> >
> > Any multiple outputs sample code in python?
> >
> > Rex
> >
> >
> > On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> > wrote:
> >>
> >> Hi Rex
> >>
> >> Please explore multiple outputs.
> >>
> >> Regards
> >> Rohit Sarewar
> >>
> >>
> >> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
> >>>
> >>> Dear all,
> >>>
> >>> To be specific, for example, given
> >>>
> >>>     hadoop jar hadoop-streaming.jar \
> >>>       -input myInputDirs \
> >>>       -output myOutputDir \
> >>>       -mapper /bin/cat \
> >>>       -reducer /usr/bin/wc
> >>>
> >>> Where myInputDirs has a dated subfolder structure of
> >>>
> >>>        /input_dir/yyyy/mm/dd/part-*
> >>>
> >>> I want myOutputDir has the same dated subfolder structure:
> >>>
> >>>        /output_dir/yyyy/mm/dd/part-*
> >>>
> >>> Guess there should be an option to do this. Can "-partitioner" or any
> >>> "-D" option make this?
> >>>
> >>>
> >>> Thanks & regards,
> >>> Rex
> >>
> >>
> >
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Hi Camusensei,

Thank you. That's very helpful!

Rex


On Thu, Jan 21, 2016 at 1:41 AM, Namikaze Minato <ll...@gmail.com>
wrote:

> Hi Rex X,
>
> We are using the -outputFormat <classname> option of hadoop-streaming.
> Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat
>
> Regards,
> Camusensei
>
> On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> > Thank you, Rohit!
> >
> > Any multiple outputs sample code in python?
> >
> > Rex
> >
> >
> > On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> > wrote:
> >>
> >> Hi Rex
> >>
> >> Please explore multiple outputs.
> >>
> >> Regards
> >> Rohit Sarewar
> >>
> >>
> >> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
> >>>
> >>> Dear all,
> >>>
> >>> To be specific, for example, given
> >>>
> >>>     hadoop jar hadoop-streaming.jar \
> >>>       -input myInputDirs \
> >>>       -output myOutputDir \
> >>>       -mapper /bin/cat \
> >>>       -reducer /usr/bin/wc
> >>>
> >>> Where myInputDirs has a dated subfolder structure of
> >>>
> >>>        /input_dir/yyyy/mm/dd/part-*
> >>>
> >>> I want myOutputDir has the same dated subfolder structure:
> >>>
> >>>        /output_dir/yyyy/mm/dd/part-*
> >>>
> >>> Guess there should be an option to do this. Can "-partitioner" or any
> >>> "-D" option make this?
> >>>
> >>>
> >>> Thanks & regards,
> >>> Rex
> >>
> >>
> >
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Hi Camusensei,

Thank you. That's very helpful!

Rex


On Thu, Jan 21, 2016 at 1:41 AM, Namikaze Minato <ll...@gmail.com>
wrote:

> Hi Rex X,
>
> We are using the -outputFormat <classname> option of hadoop-streaming.
> Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat
>
> Regards,
> Camusensei
>
> On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> > Thank you, Rohit!
> >
> > Any multiple outputs sample code in python?
> >
> > Rex
> >
> >
> > On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> > wrote:
> >>
> >> Hi Rex
> >>
> >> Please explore multiple outputs.
> >>
> >> Regards
> >> Rohit Sarewar
> >>
> >>
> >> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
> >>>
> >>> Dear all,
> >>>
> >>> To be specific, for example, given
> >>>
> >>>     hadoop jar hadoop-streaming.jar \
> >>>       -input myInputDirs \
> >>>       -output myOutputDir \
> >>>       -mapper /bin/cat \
> >>>       -reducer /usr/bin/wc
> >>>
> >>> Where myInputDirs has a dated subfolder structure of
> >>>
> >>>        /input_dir/yyyy/mm/dd/part-*
> >>>
> >>> I want myOutputDir has the same dated subfolder structure:
> >>>
> >>>        /output_dir/yyyy/mm/dd/part-*
> >>>
> >>> Guess there should be an option to do this. Can "-partitioner" or any
> >>> "-D" option make this?
> >>>
> >>>
> >>> Thanks & regards,
> >>> Rex
> >>
> >>
> >
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Namikaze Minato <ll...@gmail.com>.
Hi Rex X,

We are using the -outputFormat <classname> option of hadoop-streaming.
Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat

Regards,
Camusensei

On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> Thank you, Rohit!
>
> Any multiple outputs sample code in python?
>
> Rex
>
>
> On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> wrote:
>>
>> Hi Rex
>>
>> Please explore multiple outputs.
>>
>> Regards
>> Rohit Sarewar
>>
>>
>> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>>>
>>> Dear all,
>>>
>>> To be specific, for example, given
>>>
>>>     hadoop jar hadoop-streaming.jar \
>>>       -input myInputDirs \
>>>       -output myOutputDir \
>>>       -mapper /bin/cat \
>>>       -reducer /usr/bin/wc
>>>
>>> Where myInputDirs has a dated subfolder structure of
>>>
>>>        /input_dir/yyyy/mm/dd/part-*
>>>
>>> I want myOutputDir has the same dated subfolder structure:
>>>
>>>        /output_dir/yyyy/mm/dd/part-*
>>>
>>> Guess there should be an option to do this. Can "-partitioner" or any
>>> "-D" option make this?
>>>
>>>
>>> Thanks & regards,
>>> Rex
>>
>>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
For additional commands, e-mail: user-help@hadoop.apache.org


Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Namikaze Minato <ll...@gmail.com>.
Hi Rex X,

We are using the -outputFormat <classname> option of hadoop-streaming.
Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat

Regards,
Camusensei

On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> Thank you, Rohit!
>
> Any multiple outputs sample code in python?
>
> Rex
>
>
> On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> wrote:
>>
>> Hi Rex
>>
>> Please explore multiple outputs.
>>
>> Regards
>> Rohit Sarewar
>>
>>
>> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>>>
>>> Dear all,
>>>
>>> To be specific, for example, given
>>>
>>>     hadoop jar hadoop-streaming.jar \
>>>       -input myInputDirs \
>>>       -output myOutputDir \
>>>       -mapper /bin/cat \
>>>       -reducer /usr/bin/wc
>>>
>>> Where myInputDirs has a dated subfolder structure of
>>>
>>>        /input_dir/yyyy/mm/dd/part-*
>>>
>>> I want myOutputDir has the same dated subfolder structure:
>>>
>>>        /output_dir/yyyy/mm/dd/part-*
>>>
>>> Guess there should be an option to do this. Can "-partitioner" or any
>>> "-D" option make this?
>>>
>>>
>>> Thanks & regards,
>>> Rex
>>
>>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
For additional commands, e-mail: user-help@hadoop.apache.org


Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Namikaze Minato <ll...@gmail.com>.
Hi Rex X,

We are using the -outputFormat <classname> option of hadoop-streaming.
Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat

Regards,
Camusensei

On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> Thank you, Rohit!
>
> Any multiple outputs sample code in python?
>
> Rex
>
>
> On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> wrote:
>>
>> Hi Rex
>>
>> Please explore multiple outputs.
>>
>> Regards
>> Rohit Sarewar
>>
>>
>> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>>>
>>> Dear all,
>>>
>>> To be specific, for example, given
>>>
>>>     hadoop jar hadoop-streaming.jar \
>>>       -input myInputDirs \
>>>       -output myOutputDir \
>>>       -mapper /bin/cat \
>>>       -reducer /usr/bin/wc
>>>
>>> Where myInputDirs has a dated subfolder structure of
>>>
>>>        /input_dir/yyyy/mm/dd/part-*
>>>
>>> I want myOutputDir has the same dated subfolder structure:
>>>
>>>        /output_dir/yyyy/mm/dd/part-*
>>>
>>> Guess there should be an option to do this. Can "-partitioner" or any
>>> "-D" option make this?
>>>
>>>
>>> Thanks & regards,
>>> Rex
>>
>>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
For additional commands, e-mail: user-help@hadoop.apache.org


Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Namikaze Minato <ll...@gmail.com>.
Hi Rex X,

We are using the -outputFormat <classname> option of hadoop-streaming.
Here is the detail: http://www.infoq.com/articles/HadoopOutputFormat

Regards,
Camusensei

On 21 January 2016 at 07:21, Rex X <dn...@gmail.com> wrote:
> Thank you, Rohit!
>
> Any multiple outputs sample code in python?
>
> Rex
>
>
> On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
> wrote:
>>
>> Hi Rex
>>
>> Please explore multiple outputs.
>>
>> Regards
>> Rohit Sarewar
>>
>>
>> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>>>
>>> Dear all,
>>>
>>> To be specific, for example, given
>>>
>>>     hadoop jar hadoop-streaming.jar \
>>>       -input myInputDirs \
>>>       -output myOutputDir \
>>>       -mapper /bin/cat \
>>>       -reducer /usr/bin/wc
>>>
>>> Where myInputDirs has a dated subfolder structure of
>>>
>>>        /input_dir/yyyy/mm/dd/part-*
>>>
>>> I want myOutputDir has the same dated subfolder structure:
>>>
>>>        /output_dir/yyyy/mm/dd/part-*
>>>
>>> Guess there should be an option to do this. Can "-partitioner" or any
>>> "-D" option make this?
>>>
>>>
>>> Thanks & regards,
>>> Rex
>>
>>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@hadoop.apache.org
For additional commands, e-mail: user-help@hadoop.apache.org


Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Thank you, Rohit!

Any multiple outputs sample code in python?

Rex

On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
wrote:

> Hi Rex
>
> Please explore multiple outputs
> <https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
> .
>
> Regards
> Rohit Sarewar
>
>
> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>
>> Dear all,
>>
>> To be specific, for example, given
>>
>>     hadoop jar hadoop-streaming.jar \
>>       -input myInputDirs \
>>       -output myOutputDir \
>>       -mapper /bin/cat \
>>       -reducer /usr/bin/wc
>>
>> Where myInputDirs has a *dated* subfolder structure of
>>
>>        /input_dir/yyyy/mm/dd/part-*
>>
>> I want myOutputDir has the same *dated* subfolder structure:
>>
>>        /output_dir/yyyy/mm/dd/part-*
>>
>> Guess there should be an option to do this. Can "-partitioner" or any
>> "-D" option make this?
>>
>>
>> Thanks & regards,
>> Rex
>>
>
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Thank you, Rohit!

Any multiple outputs sample code in python?

Rex

On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
wrote:

> Hi Rex
>
> Please explore multiple outputs
> <https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
> .
>
> Regards
> Rohit Sarewar
>
>
> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>
>> Dear all,
>>
>> To be specific, for example, given
>>
>>     hadoop jar hadoop-streaming.jar \
>>       -input myInputDirs \
>>       -output myOutputDir \
>>       -mapper /bin/cat \
>>       -reducer /usr/bin/wc
>>
>> Where myInputDirs has a *dated* subfolder structure of
>>
>>        /input_dir/yyyy/mm/dd/part-*
>>
>> I want myOutputDir has the same *dated* subfolder structure:
>>
>>        /output_dir/yyyy/mm/dd/part-*
>>
>> Guess there should be an option to do this. Can "-partitioner" or any
>> "-D" option make this?
>>
>>
>> Thanks & regards,
>> Rex
>>
>
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Thank you, Rohit!

Any multiple outputs sample code in python?

Rex

On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
wrote:

> Hi Rex
>
> Please explore multiple outputs
> <https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
> .
>
> Regards
> Rohit Sarewar
>
>
> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>
>> Dear all,
>>
>> To be specific, for example, given
>>
>>     hadoop jar hadoop-streaming.jar \
>>       -input myInputDirs \
>>       -output myOutputDir \
>>       -mapper /bin/cat \
>>       -reducer /usr/bin/wc
>>
>> Where myInputDirs has a *dated* subfolder structure of
>>
>>        /input_dir/yyyy/mm/dd/part-*
>>
>> I want myOutputDir has the same *dated* subfolder structure:
>>
>>        /output_dir/yyyy/mm/dd/part-*
>>
>> Guess there should be an option to do this. Can "-partitioner" or any
>> "-D" option make this?
>>
>>
>> Thanks & regards,
>> Rex
>>
>
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by Rex X <dn...@gmail.com>.
Thank you, Rohit!

Any multiple outputs sample code in python?

Rex

On Wed, Jan 20, 2016 at 10:04 PM, rohit sarewar <ro...@gmail.com>
wrote:

> Hi Rex
>
> Please explore multiple outputs
> <https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
> .
>
> Regards
> Rohit Sarewar
>
>
> On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:
>
>> Dear all,
>>
>> To be specific, for example, given
>>
>>     hadoop jar hadoop-streaming.jar \
>>       -input myInputDirs \
>>       -output myOutputDir \
>>       -mapper /bin/cat \
>>       -reducer /usr/bin/wc
>>
>> Where myInputDirs has a *dated* subfolder structure of
>>
>>        /input_dir/yyyy/mm/dd/part-*
>>
>> I want myOutputDir has the same *dated* subfolder structure:
>>
>>        /output_dir/yyyy/mm/dd/part-*
>>
>> Guess there should be an option to do this. Can "-partitioner" or any
>> "-D" option make this?
>>
>>
>> Thanks & regards,
>> Rex
>>
>
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by rohit sarewar <ro...@gmail.com>.
Hi Rex

Please explore multiple outputs
<https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
.

Regards
Rohit Sarewar


On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:

> Dear all,
>
> To be specific, for example, given
>
>     hadoop jar hadoop-streaming.jar \
>       -input myInputDirs \
>       -output myOutputDir \
>       -mapper /bin/cat \
>       -reducer /usr/bin/wc
>
> Where myInputDirs has a *dated* subfolder structure of
>
>        /input_dir/yyyy/mm/dd/part-*
>
> I want myOutputDir has the same *dated* subfolder structure:
>
>        /output_dir/yyyy/mm/dd/part-*
>
> Guess there should be an option to do this. Can "-partitioner" or any "-D"
> option make this?
>
>
> Thanks & regards,
> Rex
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by rohit sarewar <ro...@gmail.com>.
Hi Rex

Please explore multiple outputs
<https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
.

Regards
Rohit Sarewar


On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:

> Dear all,
>
> To be specific, for example, given
>
>     hadoop jar hadoop-streaming.jar \
>       -input myInputDirs \
>       -output myOutputDir \
>       -mapper /bin/cat \
>       -reducer /usr/bin/wc
>
> Where myInputDirs has a *dated* subfolder structure of
>
>        /input_dir/yyyy/mm/dd/part-*
>
> I want myOutputDir has the same *dated* subfolder structure:
>
>        /output_dir/yyyy/mm/dd/part-*
>
> Guess there should be an option to do this. Can "-partitioner" or any "-D"
> option make this?
>
>
> Thanks & regards,
> Rex
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by rohit sarewar <ro...@gmail.com>.
Hi Rex

Please explore multiple outputs
<https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
.

Regards
Rohit Sarewar


On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:

> Dear all,
>
> To be specific, for example, given
>
>     hadoop jar hadoop-streaming.jar \
>       -input myInputDirs \
>       -output myOutputDir \
>       -mapper /bin/cat \
>       -reducer /usr/bin/wc
>
> Where myInputDirs has a *dated* subfolder structure of
>
>        /input_dir/yyyy/mm/dd/part-*
>
> I want myOutputDir has the same *dated* subfolder structure:
>
>        /output_dir/yyyy/mm/dd/part-*
>
> Guess there should be an option to do this. Can "-partitioner" or any "-D"
> option make this?
>
>
> Thanks & regards,
> Rex
>

Re: Hadoop Streaming: How to parition output into subfolders?

Posted by rohit sarewar <ro...@gmail.com>.
Hi Rex

Please explore multiple outputs
<https://hadoop.apache.org/docs/r2.4.1/api/org/apache/hadoop/mapreduce/lib/output/MultipleOutputs.html>
.

Regards
Rohit Sarewar


On Thu, Jan 21, 2016 at 5:13 AM, Rex X <dn...@gmail.com> wrote:

> Dear all,
>
> To be specific, for example, given
>
>     hadoop jar hadoop-streaming.jar \
>       -input myInputDirs \
>       -output myOutputDir \
>       -mapper /bin/cat \
>       -reducer /usr/bin/wc
>
> Where myInputDirs has a *dated* subfolder structure of
>
>        /input_dir/yyyy/mm/dd/part-*
>
> I want myOutputDir has the same *dated* subfolder structure:
>
>        /output_dir/yyyy/mm/dd/part-*
>
> Guess there should be an option to do this. Can "-partitioner" or any "-D"
> option make this?
>
>
> Thanks & regards,
> Rex
>