You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nifi.apache.org by SARIT GHOSH <sa...@gmail.com> on 2017/06/20 12:13:06 UTC

NIFI Flow Query

Hi Team,

I'm new to Apache NIFI and trying to build a flow as a POC. I need your
guidance for the same.

I have a compressed 'gz' file say 'sample.gz' containing a file say
'sample_file'.
I need to decompress the sample.gz file and store 'sample_file' in a hdfs
location.

I'm using GetFile processor to get the sample.gz file, CompressContent
processor in decompress mode to decompress the same file and PutHDFS
processor to put the decompressed file in HDFS location.

After running the flow, I can find that the original sample.gz file is only
copied to HDFS location whereas I needed to copy the sample_file inside the
gz file.

So decompressing has actually not worked for me.

I hope I could explain the issue I'm facing. Please suggest if I need to
change my approach.

-- 
Thanks & Regards,
Sarit Ghosh,

Re: NIFI Flow Query

Posted by Pierre Villard <pi...@gmail.com>.
Hi Sarith,

Did you set the "Update filename" property to true in the CompressContent
processor?
I believe the file is correctly uncompressed but the file name is not
changed and that's why you still have the .gz extension in HDFS.
By getting the file from HDFS and doing a 'cat' on it you could check if
the file is still compressed or not.

Thanks!
Pierre


2017-06-20 14:13 GMT+02:00 SARIT GHOSH <sa...@gmail.com>:

> Hi Team,
>
> I'm new to Apache NIFI and trying to build a flow as a POC. I need your
> guidance for the same.
>
> I have a compressed 'gz' file say 'sample.gz' containing a file say
> 'sample_file'.
> I need to decompress the sample.gz file and store 'sample_file' in a hdfs
> location.
>
> I'm using GetFile processor to get the sample.gz file, CompressContent
> processor in decompress mode to decompress the same file and PutHDFS
> processor to put the decompressed file in HDFS location.
>
> After running the flow, I can find that the original sample.gz file is only
> copied to HDFS location whereas I needed to copy the sample_file inside the
> gz file.
>
> So decompressing has actually not worked for me.
>
> I hope I could explain the issue I'm facing. Please suggest if I need to
> change my approach.
>
> --
> Thanks & Regards,
> Sarit Ghosh,
>