You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@flink.apache.org by se...@apache.org on 2018/09/24 18:51:31 UTC

[flink] 02/03: [hotfix] [s3] Remove obsolete READMEs from hadoop- and presto- s3

This is an automated email from the ASF dual-hosted git repository.

sewen pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/flink.git

commit 8aa128b75713ef56192cb287b3e5c017e29c5ed6
Author: Stephan Ewen <se...@apache.org>
AuthorDate: Thu Sep 20 19:11:57 2018 +0200

    [hotfix] [s3] Remove obsolete READMEs from hadoop- and presto- s3
    
    The Hadoop shading logic is not consolidated in the flink-fs-hadoop-shaded project.
---
 flink-filesystems/flink-s3-fs-hadoop/README.md | 36 -------------------------
 flink-filesystems/flink-s3-fs-presto/README.md | 37 --------------------------
 2 files changed, 73 deletions(-)

diff --git a/flink-filesystems/flink-s3-fs-hadoop/README.md b/flink-filesystems/flink-s3-fs-hadoop/README.md
deleted file mode 100644
index f65ee61..0000000
--- a/flink-filesystems/flink-s3-fs-hadoop/README.md
+++ /dev/null
@@ -1,36 +0,0 @@
-This project is a wrapper around Hadoop's s3a file system. By pulling a smaller dependency tree and
-shading all dependencies away, this keeps the appearance of Flink being Hadoop-free,
-from a dependency perspective.
-
-We also relocate the shaded Hadoop version to allow running in a different
-setup. For this to work, however, we needed to adapt Hadoop's `Configuration`
-class to load a (shaded) `core-default-shaded.xml` configuration with the
-relocated class names of classes loaded via reflection
-(in the future, we may need to extend this to `mapred-default.xml` and `hdfs-defaults.xml` and their respective configuration classes).
-
-# Changing the Hadoop Version
-
-If you want to change the Hadoop version this project depends on, the following
-steps are required to keep the shading correct:
-
-1. from the respective Hadoop jar (currently 2.8.1 as of the `s3hadoop.hadoop.version` property our `pom.xml`),
-  - copy `org/apache/hadoop/conf/Configuration.java` to `src/main/java/org/apache/hadoop/conf/` and
-    - replace `core-default.xml` with `core-default-shaded.xml`.
-  - copy `org/apache/hadoop/util/NativeCodeLoader.java` to `src/main/java/org/apache/hadoop/util/` and
-    - replace the static initializer with
-    ```
-  static {
-    LOG.info("Skipping native-hadoop library for flink-s3-fs-hadoop's relocated Hadoop... " +
-             "using builtin-java classes where applicable");
-  }
-```
-  - copy `core-default.xml` to `src/main/resources/core-default-shaded.xml` and
-    - change every occurrence of `org.apache.hadoop` into `org.apache.flink.fs.s3hadoop.shaded.org.apache.hadoop`
-  - copy `core-site.xml` to `src/test/resources/core-site.xml` (as is)
-2. verify the shaded jar:
-  - does not contain any unshaded classes except for `org.apache.flink.fs.s3hadoop.S3FileSystemFactory`
-  - all other classes should be under `org.apache.flink.fs.s3hadoop.shaded`
-  - there should be a `META-INF/services/org.apache.flink.core.fs.FileSystemFactory` file pointing to two classes: `org.apache.flink.fs.s3hadoop.S3FileSystemFactory` and `org.apache.flink.fs.s3hadoop.S3AFileSystemFactory`
-  - other service files under `META-INF/services` should have their names and contents in the relocated `org.apache.flink.fs.s3hadoop.shaded` package
-  - contains a `core-default-shaded.xml` file
-  - does not contain a `core-default.xml` or `core-site.xml` file
diff --git a/flink-filesystems/flink-s3-fs-presto/README.md b/flink-filesystems/flink-s3-fs-presto/README.md
deleted file mode 100644
index 7244f5b..0000000
--- a/flink-filesystems/flink-s3-fs-presto/README.md
+++ /dev/null
@@ -1,37 +0,0 @@
-This project is a wrapper around the S3 file system from the Presto project which shades all dependencies.
-Initial simple tests seem to indicate that it responds slightly faster
-and in a bit more lightweight manner to write/read/list requests, compared
-to the Hadoop s3a FS, but it has some semantic differences.
-
-We also relocate the shaded Hadoop version to allow running in a different
-setup. For this to work, however, we needed to adapt Hadoop's `Configuration`
-class to load a (shaded) `core-default-shaded.xml` configuration with the
-relocated class names of classes loaded via reflection
-(in the future, we may need to extend this to `mapred-default.xml` and `hdfs-defaults.xml` and their respective configuration classes).
-
-# Changing the Hadoop Version
-
-If you want to change the Hadoop version this project depends on, the following
-steps are required to keep the shading correct:
-
-1. from the respective Hadoop jar (from the `com.facebook.presto.hadoop/hadoop-apache2` resource, currently version 2.7.3-1 as of our `pom.xml`),
-  - copy `org/apache/hadoop/conf/Configuration.java` to `src/main/java/org/apache/hadoop/conf/` and
-    - replace `core-default.xml` with `core-default-shaded.xml`.
-  - copy `org/apache/hadoop/util/NativeCodeLoader.java` to `src/main/java/org/apache/hadoop/util/` and
-    - replace the static initializer with
-    ```
-  static {
-    LOG.info("Skipping native-hadoop library for flink-s3-fs-presto's relocated Hadoop... " +
-             "using builtin-java classes where applicable");
-  }
-```
-  - copy `core-default.xml` to `src/main/resources/core-default-shaded.xml` and
-    - change every occurrence of `org.apache.hadoop` into `org.apache.flink.fs.s3presto.shaded.org.apache.hadoop`
-  - copy `core-site.xml` to `src/test/resources/core-site.xml` (as is)
-2. verify the shaded jar:
-  - does not contain any unshaded classes except for `org.apache.flink.fs.s3presto.S3FileSystemFactory`
-  - all other classes should be under `org.apache.flink.fs.s3presto.shaded`
-  - there should be a `META-INF/services/org.apache.flink.core.fs.FileSystemFactory` file pointing to the `org.apache.flink.fs.s3presto.S3FileSystemFactory` class
-  - other service files under `META-INF/services` should have their names and contents in the relocated `org.apache.flink.fs.s3presto.shaded` package
-  - contains a `core-default-shaded.xml` file
-  - does not contain a `core-default.xml` or `core-site.xml` file