You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user-zh@flink.apache.org by Z-Z <zz...@qq.com> on 2020/07/15 02:47:39 UTC

【求助】Flink Hadoop依赖问题

我在使用Flink 1.11.0版本中,使用docker-compose搭建,docker-compose文件如下:
version: "2.1"
services:
&nbsp; jobmanager:
&nbsp; &nbsp; image: flink:1.11.0-scala_2.12
&nbsp; &nbsp; expose:
&nbsp; &nbsp; &nbsp; - "6123"
&nbsp; &nbsp; ports:
&nbsp; &nbsp; &nbsp; - "8081:8081"
&nbsp; &nbsp; command: jobmanager
&nbsp; &nbsp; environment:
&nbsp; &nbsp; &nbsp; - JOB_MANAGER_RPC_ADDRESS=jobmanager
&nbsp; &nbsp; &nbsp; - HADOOP_CLASSPATH=/data/hadoop-2.9.2/etc/hadoop:/data/hadoop-2.9.2/share/hadoop/common/lib/*:/data/hadoop-2.9.2/share/hadoop/common/*:/data/hadoop-2.9.2/share/hadoop/hdfs:/data/hadoop-2.9.2/share/hadoop/hdfs/lib/*:/data/hadoop-2.9.2/share/hadoop/hdfs/*:/data/hadoop-2.9.2/share/hadoop/yarn:/data/hadoop-2.9.2/share/hadoop/yarn/lib/*:/data/hadoop-2.9.2/share/hadoop/yarn/*:/data/hadoop-2.9.2/share/hadoop/mapreduce/lib/*:/data/hadoop-2.9.2/share/hadoop/mapreduce/*:/contrib/capacity-scheduler/*.jar
&nbsp; &nbsp; volumes:
&nbsp; &nbsp; &nbsp; - ./jobmanager/conf:/opt/flink/conf
&nbsp; &nbsp; &nbsp; - ./data:/data


&nbsp; taskmanager:
&nbsp; &nbsp; image: flink:1.11.0-scala_2.12
&nbsp; &nbsp; expose:
&nbsp; &nbsp; &nbsp; - "6121"
&nbsp; &nbsp; &nbsp; - "6122"
&nbsp; &nbsp; depends_on:
&nbsp; &nbsp; &nbsp; - jobmanager
&nbsp; &nbsp; command: taskmanager
&nbsp; &nbsp; links:
&nbsp; &nbsp; &nbsp; - "jobmanager:jobmanager"
&nbsp; &nbsp; environment:
&nbsp; &nbsp; &nbsp; - JOB_MANAGER_RPC_ADDRESS=jobmanager
&nbsp; &nbsp; volumes:
&nbsp; &nbsp; &nbsp; - ./taskmanager/conf:/opt/flink/conf
networks:
&nbsp; default:
&nbsp; &nbsp; external:
&nbsp; &nbsp; &nbsp; name: flink-network



hadoop-2.9.2已经放在data目录了,且已经在jobmanager和taskmanager的环境变量里添加了HADOOP_CLASSPATH,但通过cli提交和webui提交,jobmanager还是提示报Could not find a file system implementation for scheme 'hdfs'。有谁知道是怎么回事吗?

Re: 【求助】Flink Hadoop依赖问题

Posted by Yang Wang <da...@gmail.com>.
你可以在Pod里面确认一下/data目录是否正常挂载,另外需要在Pod里ps看一下
起的JVM进程里的classpath是什么,有没有包括hadoop的jar


当然,使用Roc Marshal建议的增加flink-shaded-hadoop并且放到$FLINK_HOME/lib下也可以解决问题

Best,
Yang

Roc Marshal <fl...@126.com> 于2020年7月15日周三 下午5:09写道:

>
>
>
> 你好,Z-Z,
>
> 可以尝试在
> https://repo1.maven.org/maven2/org/apache/flink/flink-shaded-hadoop-2-uber/
> 下载对应的uber jar包,并就将下载后的jar文件放到flink镜像的 ${FLINK_HOME}/lib 路径下,之后启动编排的容器。
> 祝好。
> Roc Marshal.
>
>
>
>
>
>
>
>
>
>
>
> 在 2020-07-15 10:47:39,"Z-Z" <zz...@qq.com> 写道:
> >我在使用Flink 1.11.0版本中,使用docker-compose搭建,docker-compose文件如下:
> >version: "2.1"
> >services:
> >&nbsp; jobmanager:
> >&nbsp; &nbsp; image: flink:1.11.0-scala_2.12
> >&nbsp; &nbsp; expose:
> >&nbsp; &nbsp; &nbsp; - "6123"
> >&nbsp; &nbsp; ports:
> >&nbsp; &nbsp; &nbsp; - "8081:8081"
> >&nbsp; &nbsp; command: jobmanager
> >&nbsp; &nbsp; environment:
> >&nbsp; &nbsp; &nbsp; - JOB_MANAGER_RPC_ADDRESS=jobmanager
> >&nbsp; &nbsp; &nbsp; -
> HADOOP_CLASSPATH=/data/hadoop-2.9.2/etc/hadoop:/data/hadoop-2.9.2/share/hadoop/common/lib/*:/data/hadoop-2.9.2/share/hadoop/common/*:/data/hadoop-2.9.2/share/hadoop/hdfs:/data/hadoop-2.9.2/share/hadoop/hdfs/lib/*:/data/hadoop-2.9.2/share/hadoop/hdfs/*:/data/hadoop-2.9.2/share/hadoop/yarn:/data/hadoop-2.9.2/share/hadoop/yarn/lib/*:/data/hadoop-2.9.2/share/hadoop/yarn/*:/data/hadoop-2.9.2/share/hadoop/mapreduce/lib/*:/data/hadoop-2.9.2/share/hadoop/mapreduce/*:/contrib/capacity-scheduler/*.jar
> >&nbsp; &nbsp; volumes:
> >&nbsp; &nbsp; &nbsp; - ./jobmanager/conf:/opt/flink/conf
> >&nbsp; &nbsp; &nbsp; - ./data:/data
> >
> >
> >&nbsp; taskmanager:
> >&nbsp; &nbsp; image: flink:1.11.0-scala_2.12
> >&nbsp; &nbsp; expose:
> >&nbsp; &nbsp; &nbsp; - "6121"
> >&nbsp; &nbsp; &nbsp; - "6122"
> >&nbsp; &nbsp; depends_on:
> >&nbsp; &nbsp; &nbsp; - jobmanager
> >&nbsp; &nbsp; command: taskmanager
> >&nbsp; &nbsp; links:
> >&nbsp; &nbsp; &nbsp; - "jobmanager:jobmanager"
> >&nbsp; &nbsp; environment:
> >&nbsp; &nbsp; &nbsp; - JOB_MANAGER_RPC_ADDRESS=jobmanager
> >&nbsp; &nbsp; volumes:
> >&nbsp; &nbsp; &nbsp; - ./taskmanager/conf:/opt/flink/conf
> >networks:
> >&nbsp; default:
> >&nbsp; &nbsp; external:
> >&nbsp; &nbsp; &nbsp; name: flink-network
> >
> >
> >
> >hadoop-2.9.2已经放在data目录了,且已经在jobmanager和taskmanager的环境变量里添加了HADOOP_CLASSPATH,但通过cli提交和webui提交,jobmanager还是提示报Could
> not find a file system implementation for scheme 'hdfs'。有谁知道是怎么回事吗?
>

Re:【求助】Flink Hadoop依赖问题

Posted by Roc Marshal <fl...@126.com>.


你好,Z-Z,

可以尝试在 https://repo1.maven.org/maven2/org/apache/flink/flink-shaded-hadoop-2-uber/ 下载对应的uber jar包,并就将下载后的jar文件放到flink镜像的 ${FLINK_HOME}/lib 路径下,之后启动编排的容器。
祝好。
Roc Marshal.











在 2020-07-15 10:47:39,"Z-Z" <zz...@qq.com> 写道:
>我在使用Flink 1.11.0版本中,使用docker-compose搭建,docker-compose文件如下:
>version: "2.1"
>services:
>&nbsp; jobmanager:
>&nbsp; &nbsp; image: flink:1.11.0-scala_2.12
>&nbsp; &nbsp; expose:
>&nbsp; &nbsp; &nbsp; - "6123"
>&nbsp; &nbsp; ports:
>&nbsp; &nbsp; &nbsp; - "8081:8081"
>&nbsp; &nbsp; command: jobmanager
>&nbsp; &nbsp; environment:
>&nbsp; &nbsp; &nbsp; - JOB_MANAGER_RPC_ADDRESS=jobmanager
>&nbsp; &nbsp; &nbsp; - HADOOP_CLASSPATH=/data/hadoop-2.9.2/etc/hadoop:/data/hadoop-2.9.2/share/hadoop/common/lib/*:/data/hadoop-2.9.2/share/hadoop/common/*:/data/hadoop-2.9.2/share/hadoop/hdfs:/data/hadoop-2.9.2/share/hadoop/hdfs/lib/*:/data/hadoop-2.9.2/share/hadoop/hdfs/*:/data/hadoop-2.9.2/share/hadoop/yarn:/data/hadoop-2.9.2/share/hadoop/yarn/lib/*:/data/hadoop-2.9.2/share/hadoop/yarn/*:/data/hadoop-2.9.2/share/hadoop/mapreduce/lib/*:/data/hadoop-2.9.2/share/hadoop/mapreduce/*:/contrib/capacity-scheduler/*.jar
>&nbsp; &nbsp; volumes:
>&nbsp; &nbsp; &nbsp; - ./jobmanager/conf:/opt/flink/conf
>&nbsp; &nbsp; &nbsp; - ./data:/data
>
>
>&nbsp; taskmanager:
>&nbsp; &nbsp; image: flink:1.11.0-scala_2.12
>&nbsp; &nbsp; expose:
>&nbsp; &nbsp; &nbsp; - "6121"
>&nbsp; &nbsp; &nbsp; - "6122"
>&nbsp; &nbsp; depends_on:
>&nbsp; &nbsp; &nbsp; - jobmanager
>&nbsp; &nbsp; command: taskmanager
>&nbsp; &nbsp; links:
>&nbsp; &nbsp; &nbsp; - "jobmanager:jobmanager"
>&nbsp; &nbsp; environment:
>&nbsp; &nbsp; &nbsp; - JOB_MANAGER_RPC_ADDRESS=jobmanager
>&nbsp; &nbsp; volumes:
>&nbsp; &nbsp; &nbsp; - ./taskmanager/conf:/opt/flink/conf
>networks:
>&nbsp; default:
>&nbsp; &nbsp; external:
>&nbsp; &nbsp; &nbsp; name: flink-network
>
>
>
>hadoop-2.9.2已经放在data目录了,且已经在jobmanager和taskmanager的环境变量里添加了HADOOP_CLASSPATH,但通过cli提交和webui提交,jobmanager还是提示报Could not find a file system implementation for scheme 'hdfs'。有谁知道是怎么回事吗?