You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Jeff Storck (JIRA)" <ji...@apache.org> on 2018/08/08 18:49:00 UTC

[jira] [Resolved] (NIFI-4434) ListHDFS applies File Filter also to subdirectory names in recursive search

     [ https://issues.apache.org/jira/browse/NIFI-4434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jeff Storck resolved NIFI-4434.
-------------------------------
    Resolution: Fixed

> ListHDFS applies File Filter also to subdirectory names in recursive search
> ---------------------------------------------------------------------------
>
>                 Key: NIFI-4434
>                 URL: https://issues.apache.org/jira/browse/NIFI-4434
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.3.0
>            Reporter: Holger Frydrych
>            Assignee: Jeff Storck
>            Priority: Major
>
> The File Filter regex configured in the ListHDFS processor is applied not just to files found, but also to subdirectories. 
> If you try to set up a recursive search to list e.g. all csv files in a directory hierarchy via a regex like ".*\.csv", it will only pick up csv files in the base directory, not in any subdirectory. This is because subdirectories don't typically match that regex pattern.
> To fix this, either subdirectories should not be matched against the file filter, or the file filter should be applied to the full path of all files (relative to the base directory). The GetHDFS processor offers both options via a switch.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)