You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by GitBox <gi...@apache.org> on 2019/12/12 06:20:38 UTC

[GitHub] [flink] JingsongLi opened a new pull request #10540: [FLINK-15185][hive] Shade flink-hadoop-fs to run hive in standalone mode

JingsongLi opened a new pull request #10540: [FLINK-15185][hive] Shade flink-hadoop-fs to run hive in standalone mode
URL: https://github.com/apache/flink/pull/10540
 
 
   
   ## What is the purpose of the change
   
   Now in hive HadoopFileSystemFactory, we use org.apache.flink.runtime.fs.hdfs.HadoopFileSystem to get FileSystem.
   
   But it should not work after we setting default child first class loader. Because in standalone mode, the cluster has no hadoop dependency. So the solution is:
   - Add `flink-hadoop-fs` dependency to hive module, not work, because classes with "org.apache.flink" prefix will always be loaded by parent class loader 
   - User add hadoop dependency to standalone cluster, it breaks out-of-the-box.
   - Shade hadoop FileSystem in hive module, not complex, good.
   
   ## Brief change log
   
   Shade hadoop FileSystem in hive module
   
   ## Verifying this change
   
   Manually verified the change
   
   ## Does this pull request potentially affect one of the following parts:
   
     - Dependencies (does it add or upgrade a dependency): no
     - The public API, i.e., is any changed class annotated with `@Public(Evolving)`: no
     - The serializers: no
     - The runtime per-record code paths (performance sensitive): no
     - Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Yarn/Mesos, ZooKeeper: no
     - The S3 file system connector: no
   
   ## Documentation
   
     - Does this pull request introduce a new feature? no

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services