You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-issues@hadoop.apache.org by "Da Zhou (JIRA)" <ji...@apache.org> on 2018/08/24 23:02:00 UTC

[jira] [Updated] (HADOOP-15663) ABFS: Simplify configuration

     [ https://issues.apache.org/jira/browse/HADOOP-15663?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Da Zhou updated HADOOP-15663:
-----------------------------
    Attachment: HADOOP-15663-HADOOP-15407-001.patch
        Status: Patch Available  (was: Open)

Attaching patch: HADOOP-15663-HADOOP-15407-001.patch :

These config properties changes are for *TEST* purpose only, it won't affect the production code.
 - Merged ABFS and WASB configuration files into a single file.

 - Removed *"fs.azure.test.account.name" *as it is a duplicate entry of *"fs.azure.test.account.name"*

 - Added new test config property *"fs.azure.wasb.account.name" *and *"fs.azure.abfs.account.name"* to solve the properties overlap issues.
 Meanwhile, the original property "fs.azure.account.name" is still supported when running only ABFS tests or WASB tests.

 - Added accountName/key verification to catch misconfigurations early.

 - Added new test enable/disable control property *"fs.azure.wasb.tests.enabled" *and *"fs.azure.abfs.tests.enabled"*. I was trying to enable/disable tests by checking if wasb/abfs test accoutName is missing, however there are many tests that are not depends on account name, in case that people are trying to run those tests, I gave up that approach, instead I added these two property which make it more clear to developers.

 - Removed the ABFS emulator config, added* "fs.azure.abfs.endpoint"* to support endpoint in the format of *IP: PORT*

 - Some tests were set to ran in sequential, which is not necessary, Updated the pom to run them in parallel.

> ABFS: Simplify configuration
> ----------------------------
>
>                 Key: HADOOP-15663
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15663
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/azure
>            Reporter: Thomas Marquardt
>            Assignee: Da Zhou
>            Priority: Major
>         Attachments: HADOOP-15663-HADOOP-15407-001.patch
>
>
> Configuration for WASB and ABFS is too complex.  The current approach is to use four files for test configuration. 
> Both WASB and ABFS have basic test configuration which is committed to the repo (azure-test.xml and azure-bfs-test.xml).  Currently these contain the fs.AbstractFileSystem.[scheme].impl configuration, but otherwise are empty except for an include reference to a file containing the endpoint credentials. 
> Both WASB and ABFS have endpoint credential configuration files (azure-auth-keys.xml and azure-bfs-auth-keys.xml).  These have been added to .gitignore to prevent them from accidentally being submitted in a patch, which would leak the developers storage account credentials.  These files contain account names, storage account keys, and service endpoints.
> There is some overlap of the configuration for WASB and ABFS, where they use the same property name but use different values.  
> 1) Let's reduce the number of test configuration files to one, if possible.
> 2) Let's simplify the account name, key, and endpoint configuration for WASB and ABFS if possible, but still support the legacy way of doing it, which is very error prone.
> 3) Let's improve error handling, so that typos or misconfiguration are not so difficult to troubleshoot.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscribe@hadoop.apache.org
For additional commands, e-mail: common-issues-help@hadoop.apache.org