You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "Dawid Wysakowicz (JIRA)" <ji...@apache.org> on 2019/01/30 10:57:00 UTC

[jira] [Resolved] (FLINK-10910) Harden Kubernetes e2e test

     [ https://issues.apache.org/jira/browse/FLINK-10910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dawid Wysakowicz resolved FLINK-10910.
--------------------------------------
       Resolution: Fixed
    Fix Version/s: 1.7.2
                   1.6.4

Fixed in:
master: e0ee3e5c4bede32d89492a5bc3bd91accd13f46e
1.7.2: e79bd7ac4b219a71a708084a63470d7f9309531e
1.6.4: 01e3c7273f4c4a37524f870f08e2cea05281e963

> Harden Kubernetes e2e test
> --------------------------
>
>                 Key: FLINK-10910
>                 URL: https://issues.apache.org/jira/browse/FLINK-10910
>             Project: Flink
>          Issue Type: Bug
>          Components: E2E Tests
>    Affects Versions: 1.7.0
>            Reporter: Till Rohrmann
>            Assignee: Dawid Wysakowicz
>            Priority: Critical
>              Labels: pull-request-available, test-stability
>             Fix For: 1.6.4, 1.7.2, 1.8.0
>
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> The {{Kubernetes test}} (e2e test) sometimes fails with the following output:
> {code}
> ==============================================================================
> Running 'Run Kubernetes test'
> ==============================================================================
> TEST_DATA_DIR: /home/admin/flink/flink-end-to-end-tests/test-scripts/temp-test-directory-40594844780
> Flink dist directory: /home/admin/flink-1.7.0
> /home/admin/flink/flink-end-to-end-tests/test-scripts/test_kubernetes_embedded_job.sh: line 44: none: command not f
> ound
> Using flink dist: ../../flink-dist/target/flink-*-bin
> ./
> ./flink-1.7-SNAPSHOT/
> ./flink-1.7-SNAPSHOT/LICENSE
> ./flink-1.7-SNAPSHOT/examples/
> ./flink-1.7-SNAPSHOT/examples/gelly/
> ./flink-1.7-SNAPSHOT/examples/gelly/flink-gelly-examples_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/
> ./flink-1.7-SNAPSHOT/examples/streaming/WindowJoin.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/SocketWindowWordCount.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/StateMachineExample.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/Kafka010Example.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/Kafka011Example.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/SessionWindowing.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/IncrementalLearning.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/KafkaExample.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/WordCount.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/Twitter.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/Iteration.jar
> ./flink-1.7-SNAPSHOT/examples/streaming/TopSpeedWindowing.jar
> ./flink-1.7-SNAPSHOT/examples/batch/
> ./flink-1.7-SNAPSHOT/examples/batch/KMeans.jar
> ./flink-1.7-SNAPSHOT/examples/batch/PageRank.jar
> ./flink-1.7-SNAPSHOT/examples/batch/WebLogAnalysis.jar
> ./flink-1.7-SNAPSHOT/examples/batch/WordCount.jar
> ./flink-1.7-SNAPSHOT/examples/batch/EnumTriangles.jar
> ./flink-1.7-SNAPSHOT/examples/batch/DistCp.jar
> ./flink-1.7-SNAPSHOT/examples/batch/TransitiveClosure.jar
> ./flink-1.7-SNAPSHOT/examples/batch/ConnectedComponents.jar
> ./flink-1.7-SNAPSHOT/examples/python/
> ./flink-1.7-SNAPSHOT/examples/python/streaming/
> ./flink-1.7-SNAPSHOT/examples/python/streaming/fibonacci.py
> ./flink-1.7-SNAPSHOT/examples/python/streaming/word_count.py
> ./flink-1.7-SNAPSHOT/examples/python/batch/
> ./flink-1.7-SNAPSHOT/examples/python/batch/TriangleEnumeration.py
> ./flink-1.7-SNAPSHOT/examples/python/batch/TPCHQuery3.py
> ./flink-1.7-SNAPSHOT/examples/python/batch/WebLogAnalysis.py
> ./flink-1.7-SNAPSHOT/examples/python/batch/TPCHQuery10.py
> ./flink-1.7-SNAPSHOT/examples/python/batch/WordCount.py
> ./flink-1.7-SNAPSHOT/examples/python/batch/__init__.py
> ./flink-1.7-SNAPSHOT/log/
> ./flink-1.7-SNAPSHOT/opt/
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-ganglia-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-table_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-dropwizard-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-prometheus-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-cep_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-graphite-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-gelly-scala_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-s3-fs-hadoop-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-statsd-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-queryable-state-runtime_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-streaming-python_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-ml_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-gelly_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-swift-fs-hadoop-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-cep-scala_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-slf4j-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-s3-fs-presto-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-metrics-datadog-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/opt/flink-sql-client_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/NOTICE
> ./flink-1.7-SNAPSHOT/README.txt
> ./flink-1.7-SNAPSHOT/lib/
> ./flink-1.7-SNAPSHOT/lib/flink-python_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/lib/log4j-1.2.17.jar
> ./flink-1.7-SNAPSHOT/lib/flink-shaded-hadoop2-uber-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/lib/flink-dist_2.11-1.7-SNAPSHOT.jar
> ./flink-1.7-SNAPSHOT/lib/slf4j-log4j12-1.7.15.jar
> ./flink-1.7-SNAPSHOT/conf/
> ./flink-1.7-SNAPSHOT/conf/log4j-cli.properties
> ./flink-1.7-SNAPSHOT/conf/logback-console.xml
> ./flink-1.7-SNAPSHOT/conf/logback.xml
> ./flink-1.7-SNAPSHOT/conf/logback-yarn.xml
> ./flink-1.7-SNAPSHOT/conf/flink-conf.yaml
> ./flink-1.7-SNAPSHOT/conf/sql-client-defaults.yaml
> ./flink-1.7-SNAPSHOT/conf/log4j-yarn-session.properties
> ./flink-1.7-SNAPSHOT/conf/masters
> ./flink-1.7-SNAPSHOT/conf/log4j.properties
> ./flink-1.7-SNAPSHOT/conf/zoo.cfg
> ./flink-1.7-SNAPSHOT/conf/log4j-console.properties
> ./flink-1.7-SNAPSHOT/conf/slaves
> ./flink-1.7-SNAPSHOT/bin/
> ./flink-1.7-SNAPSHOT/bin/mesos-appmaster-job.sh
> ./flink-1.7-SNAPSHOT/bin/flink
> ./flink-1.7-SNAPSHOT/bin/flink.bat
> ./flink-1.7-SNAPSHOT/bin/taskmanager.sh
> ./flink-1.7-SNAPSHOT/bin/mesos-taskmanager.sh
> ./flink-1.7-SNAPSHOT/bin/stop-zookeeper-quorum.sh
> ./flink-1.7-SNAPSHOT/bin/flink-console.sh
> ./flink-1.7-SNAPSHOT/bin/zookeeper.sh
> ./flink-1.7-SNAPSHOT/bin/start-cluster.sh
> ./flink-1.7-SNAPSHOT/bin/standalone-job.sh
> ./flink-1.7-SNAPSHOT/bin/pyflink-stream.sh
> ./flink-1.7-SNAPSHOT/bin/jobmanager.sh
> ./flink-1.7-SNAPSHOT/bin/historyserver.sh
> ./flink-1.7-SNAPSHOT/bin/config.sh
> ./flink-1.7-SNAPSHOT/bin/start-zookeeper-quorum.sh
> ./flink-1.7-SNAPSHOT/bin/start-scala-shell.sh
> ./flink-1.7-SNAPSHOT/bin/yarn-session.sh
> ./flink-1.7-SNAPSHOT/bin/pyflink.bat
> ./flink-1.7-SNAPSHOT/bin/flink-daemon.sh
> ./flink-1.7-SNAPSHOT/bin/pyflink.sh
> ./flink-1.7-SNAPSHOT/bin/sql-client.sh
> ./flink-1.7-SNAPSHOT/bin/mesos-appmaster.sh
> ./flink-1.7-SNAPSHOT/bin/stop-cluster.sh
> ./flink-1.7-SNAPSHOT/bin/start-cluster.bat
> Sending build context to Docker daemon   472 MB
> Step 1/16 : FROM java:8-jre-alpine
>  ---> fdc893b19a14
> Step 2/16 : RUN apk add --no-cache bash snappy
>  ---> Running in 9b6a826cdbd6
> fetch http://dl-cdn.alpinelinux.org/alpine/v3.4/main/x86_64/APKINDEX.tar.gz
> fetch http://dl-cdn.alpinelinux.org/alpine/v3.4/community/x86_64/APKINDEX.tar.gz
> (1/7) Installing ncurses-terminfo-base (6.0_p20171125-r0)
> (2/7) Installing ncurses-terminfo (6.0_p20171125-r0)
> (3/7) Installing ncurses-libs (6.0_p20171125-r0)
> (4/7) Installing readline (6.3.008-r4)
> (5/7) Installing bash (4.3.42-r5)
> Executing bash-4.3.42-r5.post-install
> (6/7) Installing libstdc++ (5.3.0-r0)
> (7/7) Installing snappy (1.1.3-r0)
> Executing busybox-1.24.2-r13.trigger
> OK: 115 MiB in 41 packages
>  ---> 194f5306312e
> Removing intermediate container 9b6a826cdbd6
> Step 3/16 : ENV FLINK_INSTALL_PATH /opt
>  ---> Running in f44ed28c3ecd
>  ---> 61495cda8d7e
> Removing intermediate container f44ed28c3ecd
> Step 4/16 : ENV FLINK_HOME $FLINK_INSTALL_PATH/flink
>  ---> Running in 106db4c7c3e3
>  ---> 04fcb3422dfa
> Removing intermediate container 106db4c7c3e3
> Step 5/16 : ENV FLINK_LIB_DIR $FLINK_HOME/lib
>  ---> Running in 51030ee2a57e
>  ---> 7c290e53f0f1
> Removing intermediate container 51030ee2a57e
> Step 6/16 : ENV PATH $PATH:$FLINK_HOME/bin
>  ---> Running in 1f89c045de97
>  ---> 81cec8fe782c
> Removing intermediate container 1f89c045de97
> Step 7/16 : ARG flink_dist=NOT_SET
>  ---> Running in bfd5fac37dea
>  ---> 1dfd4009b044
> Removing intermediate container bfd5fac37dea
> Step 8/16 : ARG job_jar=NOT_SET
>  ---> Running in 712432ce7471
>  ---> 17cba67f1748
> Removing intermediate container 712432ce7471
> Step 9/16 : ADD $flink_dist $FLINK_INSTALL_PATH
>  ---> be5be560416b
> Removing intermediate container b6b1c4ae0855
> Step 10/16 : ADD $job_jar $FLINK_INSTALL_PATH/job.jar
>  ---> 1299878dc57c
> Removing intermediate container 0fea418feed1
> Step 11/16 : RUN set -x &&   ln -s $FLINK_INSTALL_PATH/flink-* $FLINK_HOME &&   ln -s $FLINK_INSTALL_PATH/job.jar $
> FLINK_LIB_DIR &&   addgroup -S flink && adduser -D -S -H -G flink -h $FLINK_HOME flink &&   chown -R flink:flink $F
> LINK_INSTALL_PATH/flink-* &&   chown -h flink:flink $FLINK_HOME
>  ---> Running in 53d9785301eb
> + ln -s /opt/flink-1.7-SNAPSHOT /opt/flink
> + ln -s /opt/job.jar /opt/flink/lib
> + addgroup -S flink
> + adduser -D -S -H -G flink -h /opt/flink flink
> + chown -R flink:flink /opt/flink-1.7-SNAPSHOT
> + chown -h flink:flink /opt/flink
>  ---> 98c9da70c69c
> Removing intermediate container 53d9785301eb
> Step 12/16 : COPY docker-entrypoint.sh /
>  ---> 8491578301f5
> Removing intermediate container 6798845d05c7
> Step 13/16 : USER flink
>  ---> Running in c0af68cc8c68
>  ---> f6073e2f0aee
> Removing intermediate container c0af68cc8c68
> Step 14/16 : EXPOSE 8081 6123
>  ---> Running in 9048cc411ae5
>  ---> 765a20289069
> Removing intermediate container 9048cc411ae5
> Step 15/16 : ENTRYPOINT /docker-entrypoint.sh
>  ---> Running in bfa44c4924c3
>  ---> e542e35e7612
> Removing intermediate container bfa44c4924c3
> Step 16/16 : CMD --help
>  ---> Running in d2ac82b536b2
>  ---> effc5b265663
> Removing intermediate container d2ac82b536b2
> Successfully built effc5b265663
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> error: Filespec must match the canonical format: [[namespace/]pod:]file/path
> sort: cannot read: '/home/admin/flink/flink-end-to-end-tests/test-scripts/temp-test-directory-40594844780/out/kubernetes_wc_out*': No such file or directory
> FAIL WordCount: Output hash mismatch.  Got d41d8cd98f00b204e9800998ecf8427e, expected e682ec6622b5e83f2eb614617d5ab2cf.
> head hexdump of actual:
> head: cannot open '/home/admin/flink/flink-end-to-end-tests/test-scripts/temp-test-directory-40594844780/out/kubernetes_wc_out*' for reading: No such file or directory
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> The connection to the server 172.31.47.225:8443 was refused - did you specify the right host or port?
> [FAIL] Test script contains errors.
> Checking for errors...
> No errors in log files.
> Checking for exceptions...
> No exceptions in log files.
> Checking for non-empty .out files...
> grep: /home/admin/flink-1.7.0/log/*.out: No such file or directory
> No non-empty .out files.
> [FAIL] 'Run Kubernetes test' failed after 1 minutes and 6 seconds! Test exited with exit code 1
> {code}
> We should try to harden this test.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)