You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Gil Vernik (JIRA)" <ji...@apache.org> on 2015/06/16 10:13:01 UTC

[jira] [Created] (HADOOP-12091) Issues with directories handling

Gil Vernik created HADOOP-12091:
-----------------------------------

             Summary: Issues with directories handling
                 Key: HADOOP-12091
                 URL: https://issues.apache.org/jira/browse/HADOOP-12091
             Project: Hadoop Common
          Issue Type: Improvement
          Components: fs/swift
            Reporter: Gil Vernik
            Assignee: Gil Vernik


OpenStack Swift doesn't have notion of directories. In Swift everything is object, stored in some container, that belongs to account.
Current implementation contains many code that handle directories structure in Swift, in particular functions with code that zero length object is a directory.
While it might be true for certain cases, there are also many cases where "directory" handling cases problems and highly reduce performance.
For example, if Swift's container has dozens of objects and one of them has 0 length size, than Swift driver thinks it's a directory and report it to
upper layer as directory. In consequence, this leads to various exceptions and crashes in client side / upper Hadoop layer.

The propose of this Jira topic is to make directories handling in driver as an optional and configurable. The driver will behave the same, but there
will be a configurable option that will disable directories handling and so everything will be objects, even those with 0 length size.

This will cover cases, where clients doesn't care about directories structures.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)